developer_uid: chai_backend_admin
submission_id: function_japib_2025-12-16
model_name: function_japib_2025-12-16
model_group:
status: torndown
timestamp: 2025-12-19T03:41:45+00:00
num_battles: 7257
num_wins: 3905
celo_rating: 1319.63
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_japib_2025-12-16
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.5381011437233016
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.680398464202881s
Received healthy response to inference request in 3.7072083950042725s
Received healthy response to inference request in 4.11907696723938s
Received healthy response to inference request in 4.146904230117798s
Received healthy response to inference request in 3.186937093734741s
Received healthy response to inference request in 3.352876663208008s
Received healthy response to inference request in 5.949553966522217s
Received healthy response to inference request in 3.1243951320648193s
Received healthy response to inference request in 3.479990005493164s
10 requests
1 failed requests
5th percentile: 3.152539014816284
10th percentile: 3.180682897567749
20th percentile: 3.3196887493133547
30th percentile: 3.4418560028076173
40th percentile: 3.6002350807189942
50th percentile: 3.6938034296035767
60th percentile: 3.871955823898315
70th percentile: 4.1274251461029055
80th percentile: 4.507434177398682
90th percentile: 7.369958519935603
95th percentile: 13.761779010295854
99th percentile: 18.875235402584078
mean time: 5.4900940418243405
%s, retrying in %s seconds...
Received healthy response to inference request in 4.522564649581909s
Received healthy response to inference request in 2.9744179248809814s
Received healthy response to inference request in 3.0238380432128906s
Received healthy response to inference request in 3.9901084899902344s
Received healthy response to inference request in 4.628681898117065s
Received healthy response to inference request in 2.5961015224456787s
Received healthy response to inference request in 4.491536855697632s
Received healthy response to inference request in 3.337998390197754s
Received healthy response to inference request in 2.8223512172698975s
Received healthy response to inference request in 3.0797324180603027s
10 requests
0 failed requests
5th percentile: 2.697913885116577
10th percentile: 2.7997262477874756
20th percentile: 2.9440045833587645
30th percentile: 3.009012007713318
40th percentile: 3.057374668121338
50th percentile: 3.2088654041290283
60th percentile: 3.598842430114746
70th percentile: 4.140536999702453
80th percentile: 4.497742414474487
90th percentile: 4.533176374435425
95th percentile: 4.580929136276245
99th percentile: 4.619131345748901
mean time: 3.5467331409454346
Pipeline stage StressChecker completed in 93.37s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_japib_2025-12-16 status is now deployed due to DeploymentManager action
function_japib_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_japib_2025-12-16 status is now torndown due to DeploymentManager action