developer_uid: chai_backend_admin
submission_id: function_mesit_2025-12-08
model_name: function_mesit_2025-12-08
model_group:
status: torndown
timestamp: 2025-12-12T18:29:39+00:00
num_battles: 6515
num_wins: 3490
celo_rating: 1318.45
family_friendly_score: 0.5114000000000001
family_friendly_standard_error: 0.007069229661002675
submission_type: function
display_name: function_mesit_2025-12-08
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-08
win_ratio: 0.5356868764389869
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.9200313091278076s
Received healthy response to inference request in 2.5978100299835205s
Received healthy response to inference request in 2.2579855918884277s
Received healthy response to inference request in 2.7472331523895264s
Received healthy response to inference request in 1.6369001865386963s
Received healthy response to inference request in 3.3401694297790527s
Received healthy response to inference request in 1.7311577796936035s
Received healthy response to inference request in 2.0174400806427s
Received healthy response to inference request in 2.1689889430999756s
Received healthy response to inference request in 2.8520233631134033s
10 requests
0 failed requests
5th percentile: 1.6793161034584045
10th percentile: 1.7217320203781128
20th percentile: 1.9601836204528809
30th percentile: 2.123524284362793
40th percentile: 2.222386932373047
50th percentile: 2.427897810935974
60th percentile: 2.657579278945923
70th percentile: 2.7786702156066894
80th percentile: 2.865624952316284
90th percentile: 2.962045121192932
95th percentile: 3.151107275485992
99th percentile: 3.3023569989204407
mean time: 2.426973986625671
Pipeline stage StressChecker completed in 25.83s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_mesit_2025-12-08 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 3089.52s
Shutdown handler de-registered
function_mesit_2025-12-08 status is now inactive due to auto deactivation removed underperforming models
function_mesit_2025-12-08 status is now torndown due to DeploymentManager action