developer_uid: chai_backend_admin
submission_id: function_gujis_2026-03-18
model_name: function_gujis_2026-03-18
model_group:
status: inactive
timestamp: 2026-03-19T18:35:05+00:00
num_battles: 6434
num_wins: 3457
celo_rating: 1320.15
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_gujis_2026-03-18
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-03-19
win_ratio: 0.5373018340068386
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 6.5757036209106445s
Received healthy response to inference request in 6.367870330810547s
Received healthy response to inference request in 2.798578977584839s
Received healthy response to inference request in 5.194252014160156s
Received healthy response to inference request in 3.5168285369873047s
Received healthy response to inference request in 5.993249177932739s
Received healthy response to inference request in 5.286722660064697s
Received healthy response to inference request in 4.701222896575928s
Received healthy response to inference request in 2.672816753387451s
10 requests
1 failed requests
5th percentile: 2.7294097542762756
10th percentile: 2.7860027551651
20th percentile: 3.3731786251068114
30th percentile: 4.345904588699341
40th percentile: 4.9970403671264645
50th percentile: 5.240487337112427
60th percentile: 5.569333267211913
70th percentile: 6.1056355237960815
80th percentile: 6.409436988830566
90th percentile: 7.92925837039947
95th percentile: 14.020254743099198
99th percentile: 18.893051841259005
mean time: 6.321849608421326
%s, retrying in %s seconds...
Received healthy response to inference request in 3.2211198806762695s
Received healthy response to inference request in 3.5244874954223633s
Received healthy response to inference request in 3.7857108116149902s
Received healthy response to inference request in 3.072842836380005s
Received healthy response to inference request in 3.128237009048462s
Received healthy response to inference request in 3.0465009212493896s
Received healthy response to inference request in 3.143993854522705s
Received healthy response to inference request in 3.1629509925842285s
Received healthy response to inference request in 2.7248148918151855s
Received healthy response to inference request in 4.164239168167114s
10 requests
0 failed requests
5th percentile: 2.8695736050605776
10th percentile: 3.014332318305969
20th percentile: 3.067574453353882
30th percentile: 3.111618757247925
40th percentile: 3.137691116333008
50th percentile: 3.153472423553467
60th percentile: 3.1862185478210447
70th percentile: 3.3121301651000974
80th percentile: 3.5767321586608887
90th percentile: 3.8235636472702024
95th percentile: 3.993901407718658
99th percentile: 4.130171616077424
mean time: 3.297489786148071
Pipeline stage StressChecker completed in 99.44s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.18s
Shutdown handler de-registered
function_gujis_2026-03-18 status is now deployed due to DeploymentManager action
function_gujis_2026-03-18 status is now inactive due to system request