developer_uid: chai_backend_admin
submission_id: function_pebub_2025-07-09
model_name: function_pebub_2025-07-09
model_group:
status: torndown
timestamp: 2025-07-09T23:40:49+00:00
num_battles: 6772
num_wins: 3479
celo_rating: 1295.72
family_friendly_score: 0.5408
family_friendly_standard_error: 0.007047486927976525
submission_type: function
display_name: function_pebub_2025-07-09
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-07-09
win_ratio: 0.5137330183106911
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2857561111450195s
Received healthy response to inference request in 2.8243441581726074s
Received healthy response to inference request in 2.8223071098327637s
Received healthy response to inference request in 3.347593307495117s
Received healthy response to inference request in 3.3766279220581055s
5 requests
0 failed requests
5th percentile: 2.3930663108825683
10th percentile: 2.500376510620117
20th percentile: 2.714996910095215
30th percentile: 2.8227145195007326
40th percentile: 2.82352933883667
50th percentile: 2.8243441581726074
60th percentile: 3.0336438179016114
70th percentile: 3.242943477630615
80th percentile: 3.353400230407715
90th percentile: 3.36501407623291
95th percentile: 3.370820999145508
99th percentile: 3.375466537475586
mean time: 2.9313257217407225
Pipeline stage StressChecker completed in 16.09s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.74s
Shutdown handler de-registered
function_pebub_2025-07-09 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4020.40s
Shutdown handler de-registered
function_pebub_2025-07-09 status is now inactive due to auto deactivation removed underperforming models
function_pebub_2025-07-09 status is now torndown due to DeploymentManager action