developer_uid: chai_backend_admin
submission_id: function_gelab_2026-01-15
model_name: function_gelab_2026-01-15
model_group:
status: torndown
timestamp: 2026-01-18T16:13:30+00:00
num_battles: 114
num_wins: 56
family_friendly_score: 0.5218
family_friendly_standard_error: 0.007064343706247595
submission_type: function
display_name: function_gelab_2026-01-15
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-15
win_ratio: 0.49122807017543857
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': 'CUSTOM', 'prompt_template': 'CUSTOM', 'bot_template': 'CUSTOM', 'user_template': 'CUSTOM', 'response_template': 'CUSTOM', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7283413410186768s
Received healthy response to inference request in 1.8834326267242432s
Received healthy response to inference request in 2.1036930084228516s
Received healthy response to inference request in 1.6440112590789795s
Received healthy response to inference request in 2.289067506790161s
Received healthy response to inference request in 1.641252040863037s
Received healthy response to inference request in 1.6460309028625488s
Received healthy response to inference request in 1.4986076354980469s
Received healthy response to inference request in 1.6496593952178955s
Received healthy response to inference request in 1.6813194751739502s
10 requests
0 failed requests
5th percentile: 1.5627976179122924
10th percentile: 1.626987600326538
20th percentile: 1.643459415435791
30th percentile: 1.645425009727478
40th percentile: 1.648207998275757
50th percentile: 1.6654894351959229
60th percentile: 1.7001282215118407
70th percentile: 1.7748687267303467
80th percentile: 1.9274847030639648
90th percentile: 2.1222304582595823
95th percentile: 2.2056489825248717
99th percentile: 2.2723838019371034
mean time: 1.776541519165039
Pipeline stage StressChecker completed in 19.04s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_gelab_2026-01-15 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 4029.36s
Shutdown handler de-registered
admin requested tearing down of function_gelab_2026-01-15
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
Shutdown handler de-registered
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
function_gelab_2026-01-15 status is now torndown due to DeploymentManager action
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation