developer_uid: chai_backend_admin
submission_id: function_dedal_2025-12-18
model_name: function_dedal_2025-12-18
model_group:
status: torndown
timestamp: 2025-12-21T18:21:23+00:00
num_battles: 10246
num_wins: 6060
celo_rating: 1357.74
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_dedal_2025-12-18
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-21
win_ratio: 0.5914503220769081
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 15.441465616226196s
Received healthy response to inference request in 16.422095775604248s
Received healthy response to inference request in 6.536503791809082s
Received healthy response to inference request in 5.641752243041992s
Received healthy response to inference request in 4.312408447265625s
Received healthy response to inference request in 2.8273262977600098s
Received healthy response to inference request in 3.525475263595581s
Received healthy response to inference request in 3.205488681793213s
Received healthy response to inference request in 3.0815927982330322s
Received healthy response to inference request in 2.8218019008636475s
10 requests
0 failed requests
5th percentile: 2.8242878794670103
10th percentile: 2.8267738580703736
20th percentile: 3.030739498138428
30th percentile: 3.168319916725159
40th percentile: 3.3974806308746337
50th percentile: 3.918941855430603
60th percentile: 4.844145965576171
70th percentile: 5.910177707672119
80th percentile: 8.317496156692506
90th percentile: 15.539528632164002
95th percentile: 15.980812203884124
99th percentile: 16.333839061260225
mean time: 6.381591081619263
Pipeline stage StressChecker completed in 65.45s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 8.04s
Shutdown handler de-registered
function_dedal_2025-12-18 status is now deployed due to DeploymentManager action
function_dedal_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_dedal_2025-12-18 status is now torndown due to DeploymentManager action