developer_uid: chai_evaluation_service
submission_id: function_gijam_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T10:51:19+00:00
num_battles: 7853
num_wins: 3981
celo_rating: 1290.54
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-17
win_ratio: 0.5069400229211766
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2353622913360596s
Received healthy response to inference request in 1.622225284576416s
Received healthy response to inference request in 2.1661159992218018s
Received healthy response to inference request in 3.1062469482421875s
Received healthy response to inference request in 2.6223487854003906s
Received healthy response to inference request in 1.9134044647216797s
Received healthy response to inference request in 2.344122886657715s
Received healthy response to inference request in 2.011268138885498s
Received healthy response to inference request in 2.08290433883667s
Received healthy response to inference request in 2.0007412433624268s
10 requests
0 failed requests
5th percentile: 1.7532559156417846
10th percentile: 1.8842865467071532
20th percentile: 1.9832738876342773
30th percentile: 2.0081100702285766
40th percentile: 2.0542498588562013
50th percentile: 2.124510169029236
60th percentile: 2.193814516067505
70th percentile: 2.267990469932556
80th percentile: 2.39976806640625
90th percentile: 2.6707386016845702
95th percentile: 2.888492774963378
99th percentile: 3.062696113586426
mean time: 2.2104740381240844
Pipeline stage StressChecker completed in 23.53s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.89s
Shutdown handler de-registered
function_gijam_2025-12-17 status is now deployed due to DeploymentManager action
function_gijam_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_gijam_2025-12-17 status is now torndown due to DeploymentManager action