developer_uid: chai_backend_admin
submission_id: function_tutom_2025-12-14
model_name: function_tutom_2025-12-14
model_group:
status: inactive
timestamp: 2025-12-14T03:36:51+00:00
num_battles: 6407
num_wins: 2490
celo_rating: 1221.03
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_tutom_2025-12-14
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.38863742781332916
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 8.838281869888306s
Received healthy response to inference request in 6.401893377304077s
Received healthy response to inference request in 1.9522888660430908s
Received healthy response to inference request in 4.41216516494751s
Received healthy response to inference request in 7.70810604095459s
Received healthy response to inference request in 7.668372869491577s
Received healthy response to inference request in 6.381713628768921s
Received healthy response to inference request in 2.077974796295166s
Received healthy response to inference request in 4.634469509124756s
Received healthy response to inference request in 1.8179140090942383s
10 requests
0 failed requests
5th percentile: 1.8783826947212219
10th percentile: 1.9388513803482055
20th percentile: 2.052837610244751
30th percentile: 3.711908054351806
40th percentile: 4.545547771453857
50th percentile: 5.508091568946838
60th percentile: 6.389785528182983
70th percentile: 6.781837224960327
80th percentile: 7.67631950378418
90th percentile: 7.821123623847961
95th percentile: 8.329702746868133
99th percentile: 8.736566045284272
mean time: 5.189318013191223
Pipeline stage StressChecker completed in 53.29s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.97s
Shutdown handler de-registered
function_tutom_2025-12-14 status is now deployed due to DeploymentManager action
function_tutom_2025-12-14 status is now inactive due to auto deactivation removed underperforming models