developer_uid: chai_backend_admin
submission_id: function_jibin_2026-03-20
model_name: function_jibin_2026-03-20
model_group:
status: torndown
timestamp: 2026-03-23T23:51:34+00:00
num_battles: 10271
num_wins: 5615
celo_rating: 1328.87
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_jibin_2026-03-20
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-03-20
win_ratio: 0.5466848408139422
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.866692304611206s
Received healthy response to inference request in 3.636582612991333s
Received healthy response to inference request in 6.689462661743164s
Received healthy response to inference request in 3.234739303588867s
Received healthy response to inference request in 3.741097927093506s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 18.027227640151978s
Received healthy response to inference request in 3.001507520675659s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 7.66003942489624s
10 requests
2 failed requests
5th percentile: 3.106461822986603
10th percentile: 3.2114161252975464
20th percentile: 3.55621395111084
30th percentile: 3.709743332862854
40th percentile: 3.816454553604126
50th percentile: 5.278077483177185
60th percentile: 7.077693367004394
70th percentile: 10.770195889472959
80th percentile: 18.44410071372986
90th percentile: 20.114186239242553
95th percentile: 20.12585577964783
99th percentile: 20.135191411972045
mean time: 9.010646772384643
%s, retrying in %s seconds...
Received healthy response to inference request in 6.135896444320679s
Received healthy response to inference request in 17.532520532608032s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 5.945607662200928s
Received healthy response to inference request in 3.9504337310791016s
Received healthy response to inference request in 16.958690404891968s
Received healthy response to inference request in 17.293620824813843s
Received healthy response to inference request in 3.3700244426727295s
Received healthy response to inference request in 5.006876468658447s
10 requests
2 failed requests
5th percentile: 3.631208622455597
10th percentile: 3.892392802238464
20th percentile: 4.7955879211425785
30th percentile: 5.663988304138183
40th percentile: 6.059780931472778
50th percentile: 11.547293424606323
60th percentile: 17.092662572860718
70th percentile: 17.3652907371521
80th percentile: 18.074865579605103
90th percentile: 20.60370123386383
95th percentile: 22.221250832080838
99th percentile: 23.51529051065445
mean time: 12.027671670913696
%s, retrying in %s seconds...
Received healthy response to inference request in 3.132253408432007s
Received healthy response to inference request in 2.6242640018463135s
Received healthy response to inference request in 6.228342533111572s
Received healthy response to inference request in 8.525884866714478s
Received healthy response to inference request in 3.6715121269226074s
Received healthy response to inference request in 3.8815221786499023s
Received healthy response to inference request in 3.051013708114624s
Received healthy response to inference request in 2.862942695617676s
Received healthy response to inference request in 3.960287570953369s
Received healthy response to inference request in 3.3070476055145264s
10 requests
0 failed requests
5th percentile: 2.7316694140434263
10th percentile: 2.8390748262405396
20th percentile: 3.0133995056152343
30th percentile: 3.107881498336792
40th percentile: 3.2371299266815186
50th percentile: 3.489279866218567
60th percentile: 3.7555161476135255
70th percentile: 3.9051517963409426
80th percentile: 4.41389856338501
90th percentile: 6.458096766471862
95th percentile: 7.491990816593168
99th percentile: 8.319106056690217
mean time: 4.1245070695877075
Pipeline stage StressChecker completed in 257.53s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.64s
Shutdown handler de-registered
function_jibin_2026-03-20 status is now deployed due to DeploymentManager action
function_jibin_2026-03-20 status is now inactive due to auto deactivation removed underperforming models
function_jibin_2026-03-20 status is now torndown due to DeploymentManager action