developer_uid: chai_backend_admin
submission_id: function_bepor_2025-05-31
model_name: function_bepor_2025-05-31
model_group:
status: torndown
timestamp: 2025-05-31T20:52:16+00:00
num_battles: 5674
num_wins: 2805
celo_rating: 1290.74
family_friendly_score: 0.5798
family_friendly_standard_error: 0.006980429213164474
submission_type: function
display_name: function_bepor_2025-05-31
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-05-31
win_ratio: 0.4943602396898132
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.4117369651794434s
Received healthy response to inference request in 3.0715713500976562s
Received healthy response to inference request in 2.5537054538726807s
Received healthy response to inference request in 2.314103603363037s
Received healthy response to inference request in 3.314279317855835s
5 requests
0 failed requests
5th percentile: 2.3336302757263185
10th percentile: 2.3531569480895995
20th percentile: 2.392210292816162
30th percentile: 2.440130662918091
40th percentile: 2.496918058395386
50th percentile: 2.5537054538726807
60th percentile: 2.760851812362671
70th percentile: 2.967998170852661
80th percentile: 3.120112943649292
90th percentile: 3.2171961307525634
95th percentile: 3.2657377243041994
99th percentile: 3.3045709991455077
mean time: 2.7330793380737304
Pipeline stage StressChecker completed in 14.82s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.94s
Shutdown handler de-registered
function_bepor_2025-05-31 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2565.24s
Shutdown handler de-registered
function_bepor_2025-05-31 status is now inactive due to auto deactivation removed underperforming models
function_bepor_2025-05-31 status is now torndown due to DeploymentManager action