developer_uid: chai_backend_admin
submission_id: function_hesul_2025-06-29
model_name: function_hesul_2025-06-29
model_group:
status: torndown
timestamp: 2025-06-29T23:42:26+00:00
num_battles: 7926
num_wins: 5372
celo_rating: 1291.35
family_friendly_score: 0.5369999999999999
family_friendly_standard_error: 0.007051680650738517
submission_type: function
display_name: function_hesul_2025-06-29
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-06-29
win_ratio: 0.6777693666414333
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.907118558883667s
Received healthy response to inference request in 3.4303476810455322s
Received healthy response to inference request in 4.413388013839722s
Received healthy response to inference request in 2.233250379562378s
Received healthy response to inference request in 3.735090732574463s
5 requests
0 failed requests
5th percentile: 2.3680240154266357
10th percentile: 2.5027976512908934
20th percentile: 2.7723449230194093
30th percentile: 3.01176438331604
40th percentile: 3.221056032180786
50th percentile: 3.4303476810455322
60th percentile: 3.5522449016571045
70th percentile: 3.6741421222686768
80th percentile: 3.8707501888275146
90th percentile: 4.142069101333618
95th percentile: 4.27772855758667
99th percentile: 4.386256122589112
mean time: 3.3438390731811523
Pipeline stage StressChecker completed in 18.09s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.95s
Shutdown handler de-registered
function_hesul_2025-06-29 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3422.54s
Shutdown handler de-registered
function_hesul_2025-06-29 status is now inactive due to auto deactivation removed underperforming models
function_hesul_2025-06-29 status is now torndown due to DeploymentManager action