developer_uid: chai_backend_admin
submission_id: function_higan_2025-05-31
model_name: function_higan_2025-05-31
model_group:
status: torndown
timestamp: 2025-05-31T20:51:02+00:00
num_battles: 5777
num_wins: 2933
celo_rating: 1298.34
family_friendly_score: 0.45940000000000003
family_friendly_standard_error: 0.00704771792852126
submission_type: function
display_name: function_higan_2025-05-31
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-05-31
win_ratio: 0.5077029600138481
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.347954034805298s
Received healthy response to inference request in 3.2892696857452393s
Received healthy response to inference request in 3.6880791187286377s
Received healthy response to inference request in 3.338348865509033s
5 requests
1 failed requests
5th percentile: 3.299085521697998
10th percentile: 3.3089013576507567
20th percentile: 3.3285330295562745
30th percentile: 3.340269899368286
40th percentile: 3.344111967086792
50th percentile: 3.347954034805298
60th percentile: 3.4840040683746336
70th percentile: 3.62005410194397
80th percentile: 6.973165798187258
90th percentile: 13.543339157104494
95th percentile: 16.828425836563106
99th percentile: 19.456495180130005
mean time: 6.755432844161987
%s, retrying in %s seconds...
Received healthy response to inference request in 3.40316104888916s
Received healthy response to inference request in 3.2798779010772705s
Received healthy response to inference request in 3.45125412940979s
Received healthy response to inference request in 3.3091349601745605s
Received healthy response to inference request in 3.428107500076294s
5 requests
0 failed requests
5th percentile: 3.2857293128967284
10th percentile: 3.2915807247161863
20th percentile: 3.3032835483551026
30th percentile: 3.3279401779174806
40th percentile: 3.3655506134033204
50th percentile: 3.40316104888916
60th percentile: 3.4131396293640135
70th percentile: 3.4231182098388673
80th percentile: 3.4327368259429933
90th percentile: 3.4419954776763917
95th percentile: 3.4466248035430906
99th percentile: 3.4503282642364503
mean time: 3.374307107925415
Pipeline stage StressChecker completed in 52.83s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
Shutdown handler de-registered
function_higan_2025-05-31 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3200.45s
Shutdown handler de-registered
function_higan_2025-05-31 status is now inactive due to auto deactivation removed underperforming models
function_higan_2025-05-31 status is now torndown due to DeploymentManager action