developer_uid: chai_backend_admin
submission_id: function_nagab_2026-03-22
model_name: function_nagab_2026-03-22
model_group:
status: torndown
timestamp: 2026-03-25T08:21:41+00:00
num_battles: 11781
num_wins: 6986
celo_rating: 8507.58
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_nagab_2026-03-22
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-03-22
win_ratio: 0.5929887106357694
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.3481781482696533s
Received healthy response to inference request in 3.354583740234375s
Received healthy response to inference request in 4.03613805770874s
Received healthy response to inference request in 3.4082319736480713s
Received healthy response to inference request in 3.3022632598876953s
Received healthy response to inference request in 3.3755738735198975s
Received healthy response to inference request in 3.674551248550415s
Received healthy response to inference request in 3.1691863536834717s
Received healthy response to inference request in 3.241938591003418s
10 requests
1 failed requests
5th percentile: 3.2019248604774475
10th percentile: 3.2346633672714233
20th percentile: 3.29019832611084
30th percentile: 3.3344036817550657
40th percentile: 3.352021503448486
50th percentile: 3.3650788068771362
60th percentile: 3.388637113571167
70th percentile: 3.4881277561187742
80th percentile: 3.74686861038208
90th percentile: 5.64207174777984
95th percentile: 12.868773353099806
99th percentile: 18.650134637355805
mean time: 5.100612020492553
%s, retrying in %s seconds...
Received healthy response to inference request in 2.9427692890167236s
Received healthy response to inference request in 3.4722506999969482s
Received healthy response to inference request in 4.0103137493133545s
Received healthy response to inference request in 3.1220285892486572s
Received healthy response to inference request in 3.4692490100860596s
Received healthy response to inference request in 3.507798671722412s
Received healthy response to inference request in 3.601055145263672s
Received healthy response to inference request in 3.3750500679016113s
Received healthy response to inference request in 2.8308238983154297s
Received healthy response to inference request in 3.272127389907837s
10 requests
0 failed requests
5th percentile: 2.881199324131012
10th percentile: 2.931574749946594
20th percentile: 3.0861767292022706
30th percentile: 3.227097749710083
40th percentile: 3.3338809967041017
50th percentile: 3.4221495389938354
60th percentile: 3.470449686050415
70th percentile: 3.4829150915145872
80th percentile: 3.526449966430664
90th percentile: 3.64198100566864
95th percentile: 3.826147377490997
99th percentile: 3.9734804749488832
mean time: 3.3603466510772706
Pipeline stage StressChecker completed in 89.77s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
function_nagab_2026-03-22 status is now deployed due to DeploymentManager action
function_nagab_2026-03-22 status is now inactive due to auto deactivation removed underperforming models
function_nagab_2026-03-22 status is now torndown due to DeploymentManager action