developer_uid: NischayDnk
submission_id: function_buham_2025-08-07
model_name: function_buham_2025-08-07
model_group:
status: torndown
timestamp: 2025-08-07T17:42:36+00:00
num_battles: 5621
num_wins: 2918
celo_rating: 1274.98
family_friendly_score: 0.523
family_friendly_standard_error: 0.007063582660378514
submission_type: function
display_name: function_buham_2025-08-07
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-08-07
win_ratio: 0.5191247109055328
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.0593924522399902s
Received healthy response to inference request in 0.8688809871673584s
Received healthy response to inference request in 0.7784130573272705s
Received healthy response to inference request in 1.0759320259094238s
Received healthy response to inference request in 0.9868392944335938s
5 requests
0 failed requests
5th percentile: 0.7965066432952881
10th percentile: 0.8146002292633057
20th percentile: 0.8507874011993408
30th percentile: 0.8924726486206055
40th percentile: 0.9396559715270996
50th percentile: 0.9868392944335938
60th percentile: 1.0158605575561523
70th percentile: 1.044881820678711
80th percentile: 1.062700366973877
90th percentile: 1.0693161964416504
95th percentile: 1.072624111175537
99th percentile: 1.0752704429626465
mean time: 0.9538915634155274
Pipeline stage StressChecker completed in 5.90s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.00s
Shutdown handler de-registered
function_buham_2025-08-07 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 1827.99s
Shutdown handler de-registered
function_buham_2025-08-07 status is now inactive due to auto deactivation removed underperforming models
function_buham_2025-08-07 status is now torndown due to DeploymentManager action