developer_uid: chai_evaluation_service
submission_id: function_mesat_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T12:31:07+00:00
num_battles: 7938
num_wins: 4006
celo_rating: 1296.47
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.5046611237087427
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 7.906052589416504s
Received healthy response to inference request in 2.503331422805786s
Received healthy response to inference request in 2.6023507118225098s
Received healthy response to inference request in 3.202890634536743s
Received healthy response to inference request in 2.51550555229187s
Received healthy response to inference request in 3.0591483116149902s
Received healthy response to inference request in 2.3467867374420166s
Received healthy response to inference request in 3.1410157680511475s
Received healthy response to inference request in 2.049140453338623s
Received healthy response to inference request in 3.830239772796631s
10 requests
0 failed requests
5th percentile: 2.1830812811851503
10th percentile: 2.317022109031677
20th percentile: 2.4720224857330324
30th percentile: 2.511853313446045
40th percentile: 2.567612648010254
50th percentile: 2.83074951171875
60th percentile: 3.0918952941894533
70th percentile: 3.159578227996826
80th percentile: 3.3283604621887206
90th percentile: 4.237821054458617
95th percentile: 6.071936821937557
99th percentile: 7.539229435920716
mean time: 3.315646195411682
Pipeline stage StressChecker completed in 34.55s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_mesat_2025-12-17 status is now deployed due to DeploymentManager action
function_mesat_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_mesat_2025-12-17 status is now torndown due to DeploymentManager action