developer_uid: chai_evaluation_service
submission_id: function_nadeb_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T10:21:24+00:00
num_battles: 8355
num_wins: 4140
celo_rating: 1289.97
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.4955116696588869
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.396908760070801s
Received healthy response to inference request in 1.8325626850128174s
Received healthy response to inference request in 2.2813665866851807s
Received healthy response to inference request in 3.36547589302063s
Received healthy response to inference request in 2.1995859146118164s
Received healthy response to inference request in 2.445632219314575s
Received healthy response to inference request in 1.7873194217681885s
Received healthy response to inference request in 2.3232884407043457s
Received healthy response to inference request in 1.661994457244873s
Received healthy response to inference request in 1.925938367843628s
10 requests
0 failed requests
5th percentile: 1.718390691280365
10th percentile: 1.774786925315857
20th percentile: 1.8235140323638916
30th percentile: 1.8979256629943848
40th percentile: 2.090126895904541
50th percentile: 2.2404762506484985
60th percentile: 2.2981353282928465
70th percentile: 2.3453745365142824
80th percentile: 2.4066534519195555
90th percentile: 2.5376165866851803
95th percentile: 2.951546239852904
99th percentile: 3.282689962387085
mean time: 2.2220072746276855
Pipeline stage StressChecker completed in 23.99s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_nadeb_2025-12-16 status is now deployed due to DeploymentManager action
function_nadeb_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_nadeb_2025-12-16 status is now torndown due to DeploymentManager action