developer_uid: chai_evaluation_service
submission_id: function_gutof_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T09:47:07+00:00
num_battles: 9939
num_wins: 5034
celo_rating: 1256.33
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.5064895864775129
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.0660691261291504s
Received healthy response to inference request in 2.7494561672210693s
Received healthy response to inference request in 2.744192123413086s
Received healthy response to inference request in 1.912003755569458s
Received healthy response to inference request in 3.194181442260742s
Received healthy response to inference request in 2.980771780014038s
Received healthy response to inference request in 2.7479476928710938s
Received healthy response to inference request in 3.45546555519104s
Received healthy response to inference request in 3.733464479446411s
Received healthy response to inference request in 3.0908024311065674s
10 requests
0 failed requests
5th percentile: 2.2864885210990904
10th percentile: 2.6609732866287232
20th percentile: 2.747196578979492
30th percentile: 2.7490036249160767
40th percentile: 2.8882455348968508
50th percentile: 3.0234204530715942
60th percentile: 3.075962448120117
70th percentile: 3.12181613445282
80th percentile: 3.2464382648468018
90th percentile: 3.483265447616577
95th percentile: 3.608364963531494
99th percentile: 3.7084445762634277
mean time: 2.9674354553222657
Pipeline stage StressChecker completed in 31.65s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
Shutdown handler de-registered
function_gutof_2025-12-13 status is now deployed due to DeploymentManager action
function_gutof_2025-12-13 status is now inactive due to auto deactivation removed underperforming models