developer_uid: chai_backend_admin
submission_id: function_hejas_2026-03-18
model_name: function_hejas_2026-03-18
model_group:
status: torndown
timestamp: 2026-03-21T15:21:26+00:00
num_battles: 10394
num_wins: 6052
celo_rating: 1351.94
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_hejas_2026-03-18
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-03-18
win_ratio: 0.5822589955743698
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.461315870285034s
Received healthy response to inference request in 4.6858954429626465s
Received healthy response to inference request in 3.2507553100585938s
Received healthy response to inference request in 2.813408851623535s
Received healthy response to inference request in 3.6432688236236572s
Received healthy response to inference request in 3.7010817527770996s
Received healthy response to inference request in 3.902419328689575s
Received healthy response to inference request in 2.9074387550354004s
Received healthy response to inference request in 3.5119383335113525s
10 requests
1 failed requests
5th percentile: 2.8557223081588745
10th percentile: 2.898035764694214
20th percentile: 3.182091999053955
30th percentile: 3.398147702217102
40th percentile: 3.491689348220825
50th percentile: 3.577603578567505
60th percentile: 3.666393995285034
70th percentile: 3.7614830255508425
80th percentile: 4.05911455154419
90th percentile: 6.230919528007502
95th percentile: 13.183527910709365
99th percentile: 18.74561461687088
mean time: 5.201365876197815
%s, retrying in %s seconds...
Received healthy response to inference request in 4.945973634719849s
Received healthy response to inference request in 3.5929527282714844s
Received healthy response to inference request in 4.047801971435547s
Received healthy response to inference request in 3.8745226860046387s
Received healthy response to inference request in 3.4462873935699463s
Received healthy response to inference request in 4.040365695953369s
Received healthy response to inference request in 3.335975170135498s
Received healthy response to inference request in 4.06137490272522s
Received healthy response to inference request in 5.2073423862457275s
Received healthy response to inference request in 3.416839599609375s
10 requests
0 failed requests
5th percentile: 3.3723641633987427
10th percentile: 3.4087531566619873
20th percentile: 3.440397834777832
30th percentile: 3.548953127861023
40th percentile: 3.761894702911377
50th percentile: 3.957444190979004
60th percentile: 4.04334020614624
70th percentile: 4.0518738508224486
80th percentile: 4.238294649124145
90th percentile: 4.9721105098724365
95th percentile: 5.089726448059082
99th percentile: 5.183819198608399
mean time: 3.9969436168670653
Pipeline stage StressChecker completed in 96.54s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
function_hejas_2026-03-18 status is now deployed due to DeploymentManager action
function_hejas_2026-03-18 status is now inactive due to auto deactivation removed underperforming models
function_hejas_2026-03-18 status is now torndown due to DeploymentManager action