developer_uid: chai_backend_admin
submission_id: function_mutus_2025-12-21
model_name: function_mutus_2025-12-21
model_group:
status: torndown
timestamp: 2025-12-24T23:21:24+00:00
num_battles: 6525
num_wins: 3474
celo_rating: 1315.67
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_mutus_2025-12-21
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-24
win_ratio: 0.5324137931034483
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.805166006088257s
Received healthy response to inference request in 8.630069255828857s
Received healthy response to inference request in 3.4741404056549072s
Received healthy response to inference request in 3.8842203617095947s
Received healthy response to inference request in 6.529425621032715s
Received healthy response to inference request in 5.9253809452056885s
Received healthy response to inference request in 7.73692774772644s
Received healthy response to inference request in 3.944843292236328s
Received healthy response to inference request in 8.480316400527954s
Received healthy response to inference request in 8.782042503356934s
10 requests
0 failed requests
5th percentile: 3.6231019258499146
10th percentile: 3.772063446044922
20th percentile: 3.8684094905853272
30th percentile: 3.926656413078308
40th percentile: 5.133165884017944
50th percentile: 6.227403283119202
60th percentile: 7.012426471710205
70th percentile: 7.959944343566894
80th percentile: 8.510266971588134
90th percentile: 8.645266580581666
95th percentile: 8.7136545419693
99th percentile: 8.768364911079408
mean time: 6.119253253936767
Pipeline stage StressChecker completed in 62.46s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.56s
Shutdown handler de-registered
function_mutus_2025-12-21 status is now deployed due to DeploymentManager action
function_mutus_2025-12-21 status is now inactive due to auto deactivation removed underperforming models
function_mutus_2025-12-21 status is now torndown due to DeploymentManager action