developer_uid: chai_evaluation_service
submission_id: function_terol_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T23:31:15+00:00
num_battles: 6490
num_wins: 3258
celo_rating: 1294.55
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-21
win_ratio: 0.5020030816640986
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.332944869995117s
Received healthy response to inference request in 3.615039825439453s
Received healthy response to inference request in 2.623535394668579s
Received healthy response to inference request in 2.0488243103027344s
Received healthy response to inference request in 2.3924999237060547s
Received healthy response to inference request in 3.445949077606201s
Received healthy response to inference request in 2.8670148849487305s
Received healthy response to inference request in 2.076660633087158s
Received healthy response to inference request in 2.0981521606445312s
Received healthy response to inference request in 2.199888229370117s
10 requests
0 failed requests
5th percentile: 2.0613506555557253
10th percentile: 2.0738770008087157
20th percentile: 2.0938538551330566
30th percentile: 2.1693674087524415
40th percentile: 2.279722213745117
50th percentile: 2.362722396850586
60th percentile: 2.4849141120910643
70th percentile: 2.6965792417526244
80th percentile: 2.982801723480225
90th percentile: 3.462858152389526
95th percentile: 3.5389489889144894
99th percentile: 3.5998216581344606
mean time: 2.5700509309768678
Pipeline stage StressChecker completed in 27.33s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_terol_2025-12-18 status is now deployed due to DeploymentManager action
function_terol_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_terol_2025-12-18 status is now torndown due to DeploymentManager action