developer_uid: chai_backend_admin
submission_id: function_duhef_2026-03-14
model_name: function_duhef_2026-03-14
model_group:
status: inactive
timestamp: 2026-03-15T00:17:50+00:00
num_battles: 10118
num_wins: 5123
celo_rating: 1298.23
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_duhef_2026-03-14
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-03-14
win_ratio: 0.5063253607432299
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 4.089763164520264s
Received healthy response to inference request in 3.162933349609375s
Received healthy response to inference request in 2.255812168121338s
Received healthy response to inference request in 3.2808594703674316s
Received healthy response to inference request in 2.1573164463043213s
Received healthy response to inference request in 2.5549111366271973s
Received healthy response to inference request in 8.649261236190796s
Received healthy response to inference request in 13.479644536972046s
Received healthy response to inference request in 2.501100778579712s
10 requests
1 failed requests
5th percentile: 2.2016395211219786
10th percentile: 2.2459625959396363
20th percentile: 2.452043056488037
30th percentile: 2.5387680292129517
40th percentile: 2.919724464416504
50th percentile: 3.2218964099884033
60th percentile: 3.604420948028564
70th percentile: 5.457612586021423
80th percentile: 9.615337896347047
90th percentile: 14.434988880157468
95th percentile: 18.73403842449187
99th percentile: 22.17327805995941
mean time: 6.516469025611878
%s, retrying in %s seconds...
Received healthy response to inference request in 7.225813388824463s
Received healthy response to inference request in 3.863793134689331s
Received healthy response to inference request in 2.5644643306732178s
Received healthy response to inference request in 3.2489356994628906s
Received healthy response to inference request in 2.673135995864868s
Received healthy response to inference request in 2.164222240447998s
Received healthy response to inference request in 3.1200907230377197s
Received healthy response to inference request in 3.6494147777557373s
Received healthy response to inference request in 3.176819324493408s
Received healthy response to inference request in 3.290041208267212s
10 requests
0 failed requests
5th percentile: 2.344331181049347
10th percentile: 2.524440121650696
20th percentile: 2.651401662826538
30th percentile: 2.986004304885864
40th percentile: 3.1541278839111326
50th percentile: 3.2128775119781494
60th percentile: 3.265377902984619
70th percentile: 3.3978532791137694
80th percentile: 3.692290449142456
90th percentile: 4.199995160102843
95th percentile: 5.71290427446365
99th percentile: 6.923231565952301
mean time: 3.4976730823516844
Pipeline stage StressChecker completed in 187.50s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 8.06s
Shutdown handler de-registered
function_duhef_2026-03-14 status is now deployed due to DeploymentManager action
function_duhef_2026-03-14 status is now inactive due to auto deactivation removed underperforming models