developer_uid: chai_evaluation_service
submission_id: function_metok_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-17T14:36:27+00:00
num_battles: 8401
num_wins: 4210
celo_rating: 1252.31
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.5011308177597905
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2574520111083984s
Received healthy response to inference request in 2.199568271636963s
Received healthy response to inference request in 2.127823829650879s
Received healthy response to inference request in 2.4301352500915527s
Received healthy response to inference request in 2.077731132507324s
Received healthy response to inference request in 2.8415801525115967s
Received healthy response to inference request in 2.851261615753174s
Received healthy response to inference request in 2.902456521987915s
Received healthy response to inference request in 2.1225228309631348s
Received healthy response to inference request in 2.713108539581299s
10 requests
0 failed requests
5th percentile: 2.097887396812439
10th percentile: 2.1180436611175537
20th percentile: 2.12676362991333
30th percentile: 2.1780449390411376
40th percentile: 2.234298515319824
50th percentile: 2.3437936305999756
60th percentile: 2.543324565887451
70th percentile: 2.751650023460388
80th percentile: 2.843516445159912
90th percentile: 2.8563811063766478
95th percentile: 2.8794188141822814
99th percentile: 2.8978489804267884
mean time: 2.4523640155792235
Pipeline stage StressChecker completed in 26.01s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_metok_2025-12-14 status is now deployed due to DeploymentManager action
function_metok_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_metok_2025-12-14 status is now torndown due to DeploymentManager action