developer_uid: chai_backend_admin
submission_id: function_nabof_2025-12-18
model_name: function_nabof_2025-12-18
model_group:
status: torndown
timestamp: 2025-12-21T17:51:15+00:00
num_battles: 7313
num_wins: 4372
celo_rating: 1361.93
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_nabof_2025-12-18
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-21
win_ratio: 0.5978394639682757
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.8612704277038574s
Received healthy response to inference request in 2.466526508331299s
Received healthy response to inference request in 2.853973865509033s
Received healthy response to inference request in 3.0001986026763916s
Received healthy response to inference request in 2.679882287979126s
Received healthy response to inference request in 3.3007545471191406s
Received healthy response to inference request in 3.4183363914489746s
Received healthy response to inference request in 2.9562816619873047s
Received healthy response to inference request in 3.1734254360198975s
10 requests
1 failed requests
5th percentile: 2.5625366091728212
10th percentile: 2.658546710014343
20th percentile: 2.819155550003052
30th percentile: 2.9255893230438232
40th percentile: 2.9826318264007567
50th percentile: 3.0868120193481445
60th percentile: 3.2243570804595945
70th percentile: 3.3360291004180906
80th percentile: 3.506923198699951
90th percentile: 5.484675121307367
95th percentile: 12.789996242523177
99th percentile: 18.63425313949585
mean time: 4.7805967092514035
%s, retrying in %s seconds...
Received healthy response to inference request in 2.445157289505005s
Received healthy response to inference request in 2.752265214920044s
Received healthy response to inference request in 3.8618602752685547s
Received healthy response to inference request in 2.6042511463165283s
Received healthy response to inference request in 2.8159992694854736s
Received healthy response to inference request in 2.846195697784424s
Received healthy response to inference request in 2.772341728210449s
Received healthy response to inference request in 3.1298515796661377s
Received healthy response to inference request in 3.0026350021362305s
Received healthy response to inference request in 3.1146371364593506s
10 requests
0 failed requests
5th percentile: 2.5167495250701903
10th percentile: 2.588341760635376
20th percentile: 2.722662401199341
30th percentile: 2.7663187742233277
40th percentile: 2.798536252975464
50th percentile: 2.8310974836349487
60th percentile: 2.9087714195251464
70th percentile: 3.0362356424331667
80th percentile: 3.117680025100708
90th percentile: 3.203052449226379
95th percentile: 3.5324563622474665
99th percentile: 3.7959794926643373
mean time: 2.9345194339752196
Pipeline stage StressChecker completed in 79.92s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_nabof_2025-12-18 status is now deployed due to DeploymentManager action
function_nabof_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_nabof_2025-12-18 status is now torndown due to DeploymentManager action