developer_uid: NischayDnk
submission_id: function_jareb_2025-08-03
model_name: function_jareb_2025-08-03
model_group:
status: torndown
timestamp: 2025-08-03T07:58:12+00:00
num_battles: 5825
num_wins: 3275
celo_rating: 1276.62
family_friendly_score: 0.538
family_friendly_standard_error: 0.007050616994277877
submission_type: function
display_name: function_jareb_2025-08-03
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-08-03
win_ratio: 0.5622317596566524
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.434373378753662s
Received healthy response to inference request in 3.317476511001587s
Received healthy response to inference request in 2.628309488296509s
Received healthy response to inference request in 3.0832366943359375s
Received healthy response to inference request in 2.593538284301758s
5 requests
0 failed requests
5th percentile: 2.600492525100708
10th percentile: 2.6074467658996583
20th percentile: 2.6213552474975588
30th percentile: 2.7192949295043944
40th percentile: 2.901265811920166
50th percentile: 3.0832366943359375
60th percentile: 3.176932621002197
70th percentile: 3.270628547668457
80th percentile: 3.3408558845520018
90th percentile: 3.387614631652832
95th percentile: 3.4109940052032472
99th percentile: 3.4296975040435793
mean time: 3.0113868713378906
Pipeline stage StressChecker completed in 17.17s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.66s
Shutdown handler de-registered
function_jareb_2025-08-03 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3251.60s
Shutdown handler de-registered
function_jareb_2025-08-03 status is now inactive due to auto deactivation removed underperforming models
function_jareb_2025-08-03 status is now torndown due to DeploymentManager action