developer_uid: chai_backend_admin
submission_id: function_dudol_2026-03-09
model_name: function_dudol_2026-03-09
model_group:
status: inactive
timestamp: 2026-03-09T05:27:21+00:00
num_battles: 10489
num_wins: 4990
celo_rating: 1277.52
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_dudol_2026-03-09
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-03-08
win_ratio: 0.475736485842311
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 1.8109254837036133s
Received healthy response to inference request in 2.2586541175842285s
Received healthy response to inference request in 3.5174362659454346s
Received healthy response to inference request in 3.256331205368042s
Received healthy response to inference request in 1.7752246856689453s
Received healthy response to inference request in 15.868122339248657s
Received healthy response to inference request in 8.486266613006592s
Received healthy response to inference request in 2.7517316341400146s
Received healthy response to inference request in 13.114710330963135s
10 requests
1 failed requests
5th percentile: 1.791290044784546
10th percentile: 1.8073554039001465
20th percentile: 2.1691083908081055
30th percentile: 2.6038083791732785
40th percentile: 3.054491376876831
50th percentile: 3.3868837356567383
60th percentile: 5.504968404769895
70th percentile: 9.874799728393555
80th percentile: 13.66539273262024
90th percentile: 16.44151337146759
95th percentile: 19.021773016452784
99th percentile: 21.085980732440948
mean time: 7.444143533706665
%s, retrying in %s seconds...
Received healthy response to inference request in 1.5379810333251953s
Received healthy response to inference request in 1.9485857486724854s
Received healthy response to inference request in 12.260023832321167s
Received healthy response to inference request in 6.045171737670898s
Received healthy response to inference request in 3.110175609588623s
Received healthy response to inference request in 3.801027536392212s
Received healthy response to inference request in 2.5093801021575928s
Received healthy response to inference request in 3.105769395828247s
Received healthy response to inference request in 2.6452078819274902s
Received healthy response to inference request in 12.535400867462158s
10 requests
0 failed requests
5th percentile: 1.7227531552314759
10th percentile: 1.9075252771377564
20th percentile: 2.397221231460571
30th percentile: 2.604459547996521
40th percentile: 2.9215447902679443
50th percentile: 3.107972502708435
60th percentile: 3.3865163803100584
70th percentile: 4.4742707967758175
80th percentile: 7.288142156600953
90th percentile: 12.287561535835266
95th percentile: 12.411481201648712
99th percentile: 12.510616934299469
mean time: 4.949872374534607
Pipeline stage StressChecker completed in 197.72s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 6.29s
Shutdown handler de-registered
function_dudol_2026-03-09 status is now deployed due to DeploymentManager action
function_dudol_2026-03-09 status is now inactive due to auto deactivation removed underperforming models