developer_uid: chai_backend_admin
submission_id: function_muhel_2025-12-18
model_name: function_muhel_2025-12-18
model_group:
status: torndown
timestamp: 2026-01-14T16:59:57+00:00
num_battles: 1434
num_wins: 763
celo_rating: 1302.17
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_muhel_2025-12-18
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-17
win_ratio: 0.5320781032078103
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.3721051216125488s
Received healthy response to inference request in 2.4282736778259277s
Received healthy response to inference request in 2.725734233856201s
Received healthy response to inference request in 1.858433485031128s
Received healthy response to inference request in 2.0476748943328857s
Received healthy response to inference request in 2.726893186569214s
Received healthy response to inference request in 3.120760679244995s
Received healthy response to inference request in 1.4979338645935059s
Received healthy response to inference request in 2.060392379760742s
Received healthy response to inference request in 1.8692479133605957s
10 requests
0 failed requests
5th percentile: 1.4287280559539794
10th percentile: 1.48535099029541
20th percentile: 1.7863335609436035
30th percentile: 1.8660035848617553
40th percentile: 1.9763041019439698
50th percentile: 2.054033637046814
60th percentile: 2.207544898986816
70th percentile: 2.5175118446350098
80th percentile: 2.7259660243988035
90th percentile: 2.766279935836792
95th percentile: 2.943520307540893
99th percentile: 3.085312604904175
mean time: 2.1707449436187742
Pipeline stage StressChecker completed in 23.12s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_muhel_2025-12-18 status is now deployed due to DeploymentManager action
function_muhel_2025-12-18 status is now protected due to ABTestQueueItem
function_muhel_2025-12-18 status is now inactive due to ABTestQueueItem
function_muhel_2025-12-18 status is now torndown due to DeploymentManager action