developer_uid: chai_backend_admin
submission_id: function_gomas_2025-12-09
model_name: function_gomas_2025-12-09
model_group:
status: torndown
timestamp: 2025-12-12T22:21:10+00:00
num_battles: 8120
num_wins: 4216
celo_rating: 1307.59
family_friendly_score: 0.5802
family_friendly_standard_error: 0.006979512303879119
submission_type: function
display_name: function_gomas_2025-12-09
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-09
win_ratio: 0.5192118226600986
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8229706287384033s
Received healthy response to inference request in 1.711782455444336s
Received healthy response to inference request in 2.1766512393951416s
Received healthy response to inference request in 1.9423203468322754s
Received healthy response to inference request in 1.607513666152954s
Received healthy response to inference request in 1.6773109436035156s
Received healthy response to inference request in 2.2307562828063965s
Received healthy response to inference request in 1.7232520580291748s
Received healthy response to inference request in 1.646693468093872s
Received healthy response to inference request in 1.9006707668304443s
10 requests
0 failed requests
5th percentile: 1.6251445770263673
10th percentile: 1.6427754878997802
20th percentile: 1.671187448501587
30th percentile: 1.7014410018920898
40th percentile: 1.7186642169952393
50th percentile: 1.773111343383789
60th percentile: 1.8540506839752198
70th percentile: 1.9131656408309936
80th percentile: 1.9891865253448486
90th percentile: 2.1820617437362673
95th percentile: 2.206409013271332
99th percentile: 2.2258868288993834
mean time: 1.8439921855926513
Pipeline stage StressChecker completed in 19.74s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_gomas_2025-12-09 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2220.05s
Shutdown handler de-registered
function_gomas_2025-12-09 status is now inactive due to auto deactivation removed underperforming models
function_gomas_2025-12-09 status is now torndown due to DeploymentManager action