developer_uid: chai_backend_admin
submission_id: function_silas_2026-03-11
model_name: function_silas_2026-03-11
model_group:
status: inactive
timestamp: 2026-03-11T04:37:42+00:00
num_battles: 11725
num_wins: 5921
celo_rating: 1297.77
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_silas_2026-03-11
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-03-10
win_ratio: 0.5049893390191897
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.114189386367798s
Received healthy response to inference request in 4.383909225463867s
Received healthy response to inference request in 5.874801397323608s
Received healthy response to inference request in 4.909602165222168s
Received healthy response to inference request in 2.288288116455078s
Received healthy response to inference request in 4.002796411514282s
Received healthy response to inference request in 3.1849465370178223s
Received healthy response to inference request in 6.09546160697937s
Received healthy response to inference request in 4.335964679718018s
10 requests
1 failed requests
5th percentile: 2.192533814907074
10th percentile: 2.27087824344635
20th percentile: 3.0056148529052735
30th percentile: 3.757441449165344
40th percentile: 4.202697372436523
50th percentile: 4.359936952590942
60th percentile: 4.594186401367187
70th percentile: 5.1991619348526
80th percentile: 5.918933439254761
90th percentile: 7.894390773773187
95th percentile: 15.989572024345378
99th percentile: 22.465717024803165
mean time: 6.127471280097962
%s, retrying in %s seconds...
Received healthy response to inference request in 5.3409583568573s
Received healthy response to inference request in 3.1952853202819824s
Received healthy response to inference request in 3.424325704574585s
Received healthy response to inference request in 5.796450138092041s
Received healthy response to inference request in 4.683617115020752s
Received healthy response to inference request in 2.2998476028442383s
Received healthy response to inference request in 3.531553268432617s
Received healthy response to inference request in 5.381633043289185s
Received healthy response to inference request in 4.485506772994995s
Received healthy response to inference request in 5.49631667137146s
10 requests
0 failed requests
5th percentile: 2.702794575691223
10th percentile: 3.105741548538208
20th percentile: 3.3785176277160645
30th percentile: 3.4993849992752075
40th percentile: 4.103925371170044
50th percentile: 4.5845619440078735
60th percentile: 4.946553611755371
70th percentile: 5.353160762786866
80th percentile: 5.40456976890564
90th percentile: 5.5263300180435175
95th percentile: 5.661390078067779
99th percentile: 5.769438126087189
mean time: 4.363549399375915
Pipeline stage StressChecker completed in 186.28s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 7.37s
Shutdown handler de-registered
function_silas_2026-03-11 status is now deployed due to DeploymentManager action
function_silas_2026-03-11 status is now inactive due to auto deactivation removed underperforming models