developer_uid: chai_backend_admin
submission_id: function_rohem_2025-09-08
model_name: function_rohem_2025-09-08
model_group:
status: torndown
timestamp: 2025-09-08T21:30:55+00:00
num_battles: 5760
num_wins: 3312
celo_rating: 1303.85
family_friendly_score: 0.5232
family_friendly_standard_error: 0.007063451847361883
submission_type: function
display_name: function_rohem_2025-09-08
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-09-08
win_ratio: 0.575
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 4.052913427352905s
Received healthy response to inference request in 2.0063586235046387s
Received healthy response to inference request in 1.888012409210205s
Received healthy response to inference request in 3.6612796783447266s
5 requests
1 failed requests
5th percentile: 1.9116816520690918
10th percentile: 1.9353508949279785
20th percentile: 1.982689380645752
30th percentile: 2.3373428344726563
40th percentile: 2.9993112564086917
50th percentile: 3.6612796783447266
60th percentile: 3.817933177947998
70th percentile: 3.9745866775512693
80th percentile: 7.268728542327883
90th percentile: 13.700358772277834
95th percentile: 16.916173887252803
99th percentile: 19.488825979232786
mean time: 6.348110628128052
%s, retrying in %s seconds...
Received healthy response to inference request in 1.7945740222930908s
Received healthy response to inference request in 3.132450819015503s
Received healthy response to inference request in 2.3526368141174316s
Received healthy response to inference request in 1.793929100036621s
Received healthy response to inference request in 3.26767635345459s
5 requests
0 failed requests
5th percentile: 1.794058084487915
10th percentile: 1.794187068939209
20th percentile: 1.7944450378417969
30th percentile: 1.906186580657959
40th percentile: 2.1294116973876953
50th percentile: 2.3526368141174316
60th percentile: 2.6645624160766603
70th percentile: 2.9764880180358886
80th percentile: 3.1594959259033204
90th percentile: 3.213586139678955
95th percentile: 3.2406312465667724
99th percentile: 3.2622673320770263
mean time: 2.468253421783447
Pipeline stage StressChecker completed in 46.54s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.62s
Shutdown handler de-registered
function_rohem_2025-09-08 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2273.19s
Shutdown handler de-registered
function_rohem_2025-09-08 status is now inactive due to auto deactivation removed underperforming models
function_rohem_2025-09-08 status is now torndown due to DeploymentManager action