developer_uid: chai_backend_admin
submission_id: function_romuf_2026-03-23
model_name: function_romuf_2026-03-23
model_group:
status: torndown
timestamp: 2026-04-03T00:07:03+00:00
num_battles: 10553
num_wins: 5974
celo_rating: 8489.56
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_romuf_2026-03-23
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-03-23
win_ratio: 0.5660949493035156
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 11.885101318359375s
Received healthy response to inference request in 10.084250688552856s
Received healthy response to inference request in 9.719635963439941s
Received healthy response to inference request in 7.412137508392334s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 7.997490167617798s
Received healthy response to inference request in 6.01354193687439s
Received healthy response to inference request in 6.087342262268066s
Received healthy response to inference request in 11.13361120223999s
Received healthy response to inference request in 8.681321144104004s
10 requests
1 failed requests
5th percentile: 6.046752083301544
10th percentile: 6.079962229728698
20th percentile: 7.14717845916748
30th percentile: 7.821884369850158
40th percentile: 8.407788753509521
50th percentile: 9.200478553771973
60th percentile: 9.865481853485107
70th percentile: 10.399058842658997
80th percentile: 11.283909225463868
90th percentile: 12.712831830978391
95th percentile: 16.437619137763967
99th percentile: 19.417448983192443
mean time: 9.917683863639832
%s, retrying in %s seconds...
Received healthy response to inference request in 6.121027708053589s
Received healthy response to inference request in 9.99253511428833s
Received healthy response to inference request in 8.484567403793335s
Received healthy response to inference request in 9.861217737197876s
Received healthy response to inference request in 7.003811359405518s
Received healthy response to inference request in 9.158156633377075s
Received healthy response to inference request in 6.049315452575684s
Received healthy response to inference request in 7.179944038391113s
Received healthy response to inference request in 6.113134145736694s
Received healthy response to inference request in 7.383630275726318s
10 requests
0 failed requests
5th percentile: 6.078033864498138
10th percentile: 6.106752276420593
20th percentile: 6.11944899559021
30th percentile: 6.738976263999938
40th percentile: 7.109490966796875
50th percentile: 7.281787157058716
60th percentile: 7.824005126953124
70th percentile: 8.686644172668457
80th percentile: 9.298768854141235
90th percentile: 9.874349474906921
95th percentile: 9.933442294597626
99th percentile: 9.98071655035019
mean time: 7.734733986854553
Pipeline stage StressChecker completed in 181.44s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.62s
Shutdown handler de-registered
function_romuf_2026-03-23 status is now deployed due to DeploymentManager action
function_romuf_2026-03-23 status is now inactive due to auto deactivation removed underperforming models
function_romuf_2026-03-23 status is now torndown due to DeploymentManager action