developer_uid: chai_backend_admin
submission_id: function_kupas_2026-04-17
model_name: function_kupas_2026-04-17
model_group:
status: deployed
timestamp: 2026-04-17T00:20:40+00:00
num_battles: 5488
num_wins: 2599
celo_rating: 0.0
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_kupas_2026-04-17
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-04-16
win_ratio: 0.4735787172011662
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 19.342326641082764s
Received healthy response to inference request in 11.462585926055908s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 14.440340757369995s
Received healthy response to inference request in 19.440311193466187s
Received healthy response to inference request in 12.246062994003296s
Received healthy response to inference request in 10.319799184799194s
Received healthy response to inference request in 5.052949666976929s
10 requests
1 failed requests
5th percentile: 5.505553770065307
10th percentile: 5.958157873153686
20th percentile: 7.046320104598999
30th percentile: 10.05763440132141
40th percentile: 11.374505615234375
50th percentile: 11.854324460029602
60th percentile: 12.529909563064574
70th percentile: 13.401077818870544
80th percentile: 15.42073793411255
90th percentile: 19.419272208213805
95th percentile: 19.765527260303497
99th percentile: 20.04253130197525
mean time: 12.020606541633606
%s, retrying in %s seconds...
Received healthy response to inference request in 14.851062774658203s
Received healthy response to inference request in 18.91890835762024s
Received healthy response to inference request in 4.648874521255493s
Received healthy response to inference request in 14.849674701690674s
Received healthy response to inference request in 16.49593210220337s
Received healthy response to inference request in 9.256165504455566s
Received healthy response to inference request in 19.848787307739258s
Received healthy response to inference request in 7.8036274909973145s
Received healthy response to inference request in 7.609707355499268s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 9.18372106552124s
Received healthy response to inference request in 14.064853429794312s
Received healthy response to inference request in 6.145359754562378s
10 requests
2 failed requests
5th percentile: 5.3222928762435915
10th percentile: 5.99571123123169
20th percentile: 8.576048803329467
30th percentile: 9.234432172775268
40th percentile: 9.894345712661742
50th percentile: 13.407865643501282
60th percentile: 17.465122604370116
70th percentile: 19.075329208374022
80th percentile: 19.58931245803833
90th percentile: 20.18565378189087
95th percentile: 20.18716697692871
99th percentile: 20.188377532958985
mean time: 13.478306937217713
%s, retrying in %s seconds...
Received healthy response to inference request in 11.682910919189453s
Received healthy response to inference request in 7.03119158744812s
Received healthy response to inference request in 19.90036916732788s
Received healthy response to inference request in 8.310869693756104s
Received healthy response to inference request in 15.179409742355347s
Received healthy response to inference request in 3.5084731578826904s
Received healthy response to inference request in 4.692002534866333s
Received healthy response to inference request in 7.155503749847412s
Received healthy response to inference request in 19.60551142692566s
10 requests
0 failed requests
5th percentile: 7.291523683071136
10th percentile: 7.5518557786941525
20th percentile: 7.764843463897705
30th percentile: 10.51912589073181
40th percentile: 13.112076425552369
50th percentile: 14.457264065742493
60th percentile: 14.850229930877685
70th percentile: 14.949566864967347
80th percentile: 16.06463007926941
90th percentile: 19.62983901500702
95th percentile: 19.73931316137314
99th percentile: 19.826892478466036
mean time: 13.252673673629761
%s, retrying in %s seconds...
Received healthy response to inference request in 5.208148717880249s
Received healthy response to inference request in 8.371894121170044s
Received healthy response to inference request in 14.466280698776245s
Received healthy response to inference request in 6.648926734924316s
Received healthy response to inference request in 4.394593000411987s
Received healthy response to inference request in 6.257838010787964s
Received healthy response to inference request in 7.895441770553589s
Received healthy response to inference request in 18.995288610458374s
Received healthy response to inference request in 4.275354385375977s
Received healthy response to inference request in 4.324937343597412s
10 requests
0 failed requests
5th percentile: 3.8758820414543154
10th percentile: 4.24329092502594
20th percentile: 4.6185894966125485
30th percentile: 5.053304862976074
40th percentile: 6.07261552810669
50th percentile: 6.902215242385864
60th percentile: 7.617650127410888
70th percentile: 8.329177021980286
80th percentile: 10.496573019027712
90th percentile: 19.085796666145324
95th percentile: 19.4930829167366
99th percentile: 19.818911917209626
mean time: 8.71164138317108
Pipeline stage StressChecker completed in 239.80s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.88s
Shutdown handler de-registered
function_kupas_2026-04-17 status is now deployed due to DeploymentManager action