developer_uid: chai_backend_admin
submission_id: function_ganub_2025-12-16
model_name: function_ganub_2025-12-16
model_group:
status: torndown
timestamp: 2025-12-19T03:01:14+00:00
num_battles: 5084
num_wins: 2744
celo_rating: 1320.9
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_ganub_2025-12-16
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.5397324940991345
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.7886734008789062s
Received healthy response to inference request in 3.1719095706939697s
Received healthy response to inference request in 10.34981107711792s
Received healthy response to inference request in 3.619753837585449s
Received healthy response to inference request in 3.497573137283325s
Received healthy response to inference request in 3.7948031425476074s
Received healthy response to inference request in 3.1916708946228027s
Received healthy response to inference request in 2.418794631958008s
Received healthy response to inference request in 2.8520960807800293s
10 requests
1 failed requests
5th percentile: 2.5852400779724123
10th percentile: 2.7516855239868163
20th percentile: 2.8394115447998045
30th percentile: 3.0759655237197876
40th percentile: 3.1837663650512695
50th percentile: 3.344622015953064
60th percentile: 3.5464454174041746
70th percentile: 3.6722686290740967
80th percentile: 5.105804729461671
90th percentile: 11.325810384750362
95th percentile: 15.717807269096365
99th percentile: 19.231404776573182
mean time: 5.57948899269104
%s, retrying in %s seconds...
Received healthy response to inference request in 2.7833681106567383s
Received healthy response to inference request in 3.346017837524414s
Received healthy response to inference request in 3.5999598503112793s
Received healthy response to inference request in 3.160688877105713s
Received healthy response to inference request in 3.7823691368103027s
Received healthy response to inference request in 2.403754472732544s
Received healthy response to inference request in 2.743940591812134s
Received healthy response to inference request in 3.8746323585510254s
Received healthy response to inference request in 3.6858489513397217s
Received healthy response to inference request in 2.1721017360687256s
10 requests
0 failed requests
5th percentile: 2.276345467567444
10th percentile: 2.3805891990661623
20th percentile: 2.6759033679962156
30th percentile: 2.7715398550033568
40th percentile: 3.0097605705261232
50th percentile: 3.2533533573150635
60th percentile: 3.44759464263916
70th percentile: 3.625726580619812
80th percentile: 3.705152988433838
90th percentile: 3.791595458984375
95th percentile: 3.8331139087677
99th percentile: 3.8663286685943605
mean time: 3.1552681922912598
Pipeline stage StressChecker completed in 90.23s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.55s
Shutdown handler de-registered
function_ganub_2025-12-16 status is now deployed due to DeploymentManager action
function_ganub_2025-12-16 status is now inactive due to admin request
function_ganub_2025-12-16 status is now torndown due to DeploymentManager action