developer_uid: chai_backend_admin
submission_id: function_hutef_2026-01-17
model_name: function_hutef_2026-01-17
model_group:
status: torndown
timestamp: 2026-01-20T01:13:33+00:00
num_battles: 14500
num_wins: 4319
celo_rating: 9999.0
family_friendly_score: 0.7063999999999999
family_friendly_standard_error: 0.006440481969542342
submission_type: function
display_name: function_hutef_2026-01-17
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-16
win_ratio: 0.29786206896551726
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Falling back to EndpointApi.from_submission implementation
Running pipeline stage StressChecker
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 0.9930241107940674s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 0.7174291610717773s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 0.7537217140197754s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 0.6465966701507568s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 0.6213216781616211s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 0.6244218349456787s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 0.6972570419311523s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.0117809772491455s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 0.6533455848693848s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 0.7100772857666016s
Falling back to EndpointApi.from_submission implementation
10 requests
Falling back to EndpointApi.from_submission implementation
0 failed requests
Falling back to EndpointApi.from_submission implementation
5th percentile: 0.622716748714447
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
10th percentile: 0.624111819267273
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
20th percentile: 0.6421617031097412
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
30th percentile: 0.6513209104537964
Falling back to EndpointApi.from_submission implementation
40th percentile: 0.6796924591064453
Falling back to EndpointApi.from_submission implementation
50th percentile: 0.703667163848877
Falling back to EndpointApi.from_submission implementation
60th percentile: 0.7130180358886719
70th percentile: 0.7283169269561768
80th percentile: 0.8015821933746339
90th percentile: 0.9948997974395752
95th percentile: 1.0033403873443603
99th percentile: 1.0100928592681884
mean time: 0.7428976058959961
Pipeline stage StressChecker completed in 10.54s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
Falling back to EndpointApi.from_submission implementation
run_pipeline:run_in_cloud %s
Falling back to EndpointApi.from_submission implementation
starting trigger_guanaco_pipeline args=%s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Shutdown handler de-registered
Falling back to EndpointApi.from_submission implementation
function_hutef_2026-01-17 status is now deployed due to DeploymentManager action
Falling back to EndpointApi.from_submission implementation
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2208.52s
Shutdown handler de-registered
function_hutef_2026-01-17 status is now torndown due to DeploymentManager action