developer_uid: NischayDnk
submission_id: function_redut_2025-08-19
model_name: function_redut_2025-08-19
model_group:
status: torndown
timestamp: 2025-08-19T20:18:44+00:00
num_battles: 6316
num_wins: 3347
celo_rating: 1271.49
family_friendly_score: 0.5469999999999999
family_friendly_standard_error: 0.007039758518585705
submission_type: function
display_name: function_redut_2025-08-19
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-08-19
win_ratio: 0.5299240025332489
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.12638258934021s
Received healthy response to inference request in 2.83561110496521s
Received healthy response to inference request in 2.88645076751709s
Received healthy response to inference request in 3.0036118030548096s
Received healthy response to inference request in 0.34192585945129395s
5 requests
0 failed requests
5th percentile: 0.8406629085540771
10th percentile: 1.3393999576568603
20th percentile: 2.336874055862427
30th percentile: 2.845779037475586
40th percentile: 2.8661149024963377
50th percentile: 2.88645076751709
60th percentile: 2.9333151817321776
70th percentile: 2.9801795959472654
80th percentile: 3.0281659603118896
90th percentile: 3.07727427482605
95th percentile: 3.10182843208313
99th percentile: 3.121471757888794
mean time: 2.4387964248657226
Pipeline stage StressChecker completed in 13.20s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.84s
Shutdown handler de-registered
function_redut_2025-08-19 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3059.69s
Shutdown handler de-registered
function_redut_2025-08-19 status is now inactive due to auto deactivation removed underperforming models
function_redut_2025-08-19 status is now torndown due to DeploymentManager action