developer_uid: chai_backend_admin
submission_id: function_gimar_2026-01-15
model_name: function_gimar_2026-01-15
model_group:
status: torndown
timestamp: 2026-01-18T05:23:32+00:00
num_battles: 11079
num_wins: 5857
celo_rating: 1313.78
family_friendly_score: 0.5414
family_friendly_standard_error: 0.007046787069296192
submission_type: function
display_name: function_gimar_2026-01-15
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-14
win_ratio: 0.5286578211029876
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': 'CUSTOM', 'prompt_template': 'CUSTOM', 'bot_template': 'CUSTOM', 'user_template': 'CUSTOM', 'response_template': 'CUSTOM', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.682391405105591s
Received healthy response to inference request in 2.574735403060913s
Received healthy response to inference request in 3.1500582695007324s
Received healthy response to inference request in 3.5049219131469727s
Received healthy response to inference request in 3.8011531829833984s
Received healthy response to inference request in 2.8409910202026367s
Received healthy response to inference request in 3.2596700191497803s
Received healthy response to inference request in 3.124786138534546s
Received healthy response to inference request in 2.4151947498321533s
Received healthy response to inference request in 2.7967610359191895s
10 requests
0 failed requests
5th percentile: 2.4869880437850953
10th percentile: 2.5587813377380373
20th percentile: 2.7523559093475343
30th percentile: 2.8277220249176027
40th percentile: 3.0112680912017824
50th percentile: 3.137422204017639
60th percentile: 3.1939029693603516
70th percentile: 3.333245587348938
80th percentile: 3.540415811538696
90th percentile: 3.6942675828933718
95th percentile: 3.747710382938385
99th percentile: 3.7904646229743957
mean time: 3.115066313743591
Pipeline stage StressChecker completed in 32.65s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_gimar_2026-01-15 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 3631.99s
Shutdown handler de-registered
function_gimar_2026-01-15 status is now inactive due to auto deactivation removed underperforming models
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
function_gimar_2026-01-15 status is now torndown due to DeploymentManager action
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation