developer_uid: chai_evaluation_service
submission_id: function_jagut_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-17T14:01:12+00:00
num_battles: 9439
num_wins: 4704
celo_rating: 1299.93
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.4983578768937387
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.796337604522705s
Received healthy response to inference request in 3.7230515480041504s
Received healthy response to inference request in 2.3093323707580566s
Received healthy response to inference request in 2.8461833000183105s
Received healthy response to inference request in 2.0879340171813965s
Received healthy response to inference request in 2.0494956970214844s
Received healthy response to inference request in 1.6748852729797363s
Received healthy response to inference request in 2.0120556354522705s
Received healthy response to inference request in 2.3344297409057617s
Received healthy response to inference request in 1.9666621685028076s
10 requests
0 failed requests
5th percentile: 1.7295388221740722
10th percentile: 1.7841923713684082
20th percentile: 1.9325972557067872
30th percentile: 1.9984375953674316
40th percentile: 2.034519672393799
50th percentile: 2.0687148571014404
60th percentile: 2.1764933586120603
70th percentile: 2.3168615818023683
80th percentile: 2.4367804527282715
90th percentile: 2.9338701248168944
95th percentile: 3.3284608364105215
99th percentile: 3.644133405685425
mean time: 2.280036735534668
Pipeline stage StressChecker completed in 24.51s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.67s
Shutdown handler de-registered
function_jagut_2025-12-14 status is now deployed due to DeploymentManager action
function_jagut_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_jagut_2025-12-14 status is now torndown due to DeploymentManager action