developer_uid: chai_backend_admin
submission_id: function_ruhik_2025-12-05
model_name: function_ruhik_2025-12-05
model_group:
status: torndown
timestamp: 2025-12-12T18:30:05+00:00
num_battles: 19176
num_wins: 11480
celo_rating: 1362.58
family_friendly_score: 0.54
family_friendly_standard_error: 0.0070484040746824385
submission_type: function
display_name: function_ruhik_2025-12-05
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-05
win_ratio: 0.5986649979140592
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.660231351852417s
Received healthy response to inference request in 3.463216543197632s
Received healthy response to inference request in 3.372649669647217s
Received healthy response to inference request in 3.478891372680664s
Received healthy response to inference request in 3.538130760192871s
Received healthy response to inference request in 0.3364987373352051s
Received healthy response to inference request in 3.120577812194824s
Received healthy response to inference request in 3.748110055923462s
Received healthy response to inference request in 0.5217995643615723s
Received healthy response to inference request in 0.47762274742126465s
10 requests
0 failed requests
5th percentile: 0.40000454187393186
10th percentile: 0.4635103464126587
20th percentile: 0.5129642009735107
30th percentile: 2.340944337844848
40th percentile: 3.2718209266662597
50th percentile: 3.4179331064224243
60th percentile: 3.4694864749908447
70th percentile: 3.4966631889343263
80th percentile: 3.5801266193389893
90th percentile: 3.839322185516357
95th percentile: 4.249776768684386
99th percentile: 4.5781404352188115
mean time: 2.671772861480713
Pipeline stage StressChecker completed in 29.17s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.95s
Shutdown handler de-registered
function_ruhik_2025-12-05 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 3951.81s
Shutdown handler de-registered
function_ruhik_2025-12-05 status is now inactive due to auto deactivation removed underperforming models
function_ruhik_2025-12-05 status is now torndown due to DeploymentManager action