developer_uid: chai_backend_admin
submission_id: function_fufof_2025-12-20
model_name: function_fufof_2025-12-20
model_group:
status: torndown
timestamp: 2025-12-23T15:41:19+00:00
num_battles: 6356
num_wins: 3213
celo_rating: 1296.96
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_fufof_2025-12-20
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-23
win_ratio: 0.5055066079295154
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 9.81348967552185s
Received healthy response to inference request in 10.587587833404541s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 11.372879981994629s
Received healthy response to inference request in 7.17824125289917s
Received healthy response to inference request in 18.260130405426025s
Received healthy response to inference request in 3.433598279953003s
Received healthy response to inference request in 5.477665662765503s
Received healthy response to inference request in 6.48881196975708s
Received healthy response to inference request in 11.179973125457764s
10 requests
1 failed requests
5th percentile: 4.353428602218628
10th percentile: 5.273258924484253
20th percentile: 6.286582708358765
30th percentile: 6.971412467956543
40th percentile: 8.759390306472778
50th percentile: 10.200538754463196
60th percentile: 10.82454195022583
70th percentile: 11.237845182418823
80th percentile: 12.75033006668091
90th percentile: 18.7575670003891
95th percentile: 20.996031677722925
99th percentile: 22.786803419589997
mean time: 10.702687454223632
%s, retrying in %s seconds...
Received healthy response to inference request in 3.3695147037506104s
Received healthy response to inference request in 6.377387285232544s
Received healthy response to inference request in 10.572070598602295s
Received healthy response to inference request in 18.9285306930542s
Received healthy response to inference request in 1.6225254535675049s
Received healthy response to inference request in 6.213213205337524s
Received healthy response to inference request in 14.852409362792969s
Received healthy response to inference request in 5.439704179763794s
Received healthy response to inference request in 1.7161118984222412s
Received healthy response to inference request in 8.893122673034668s
10 requests
0 failed requests
5th percentile: 1.6646393537521362
10th percentile: 1.7067532539367676
20th percentile: 3.038834142684937
30th percentile: 4.818647336959838
40th percentile: 5.903809595108032
50th percentile: 6.295300245285034
60th percentile: 7.383681440353392
70th percentile: 9.396807050704956
80th percentile: 11.428138351440431
90th percentile: 15.26002149581909
95th percentile: 17.09427609443664
99th percentile: 18.561679773330688
mean time: 7.798459005355835
Pipeline stage StressChecker completed in 187.74s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.15s
Shutdown handler de-registered
function_fufof_2025-12-20 status is now deployed due to DeploymentManager action
function_fufof_2025-12-20 status is now inactive due to auto deactivation removed underperforming models
function_fufof_2025-12-20 status is now torndown due to DeploymentManager action