developer_uid: NischayDnk
submission_id: function_jutet_2025-06-25
model_name: function_jutet_2025-06-25
model_group:
status: torndown
timestamp: 2025-06-25T22:06:14+00:00
num_battles: 8512
num_wins: 4098
celo_rating: 1274.95
family_friendly_score: 0.6204000000000001
family_friendly_standard_error: 0.006862999927145563
submission_type: function
display_name: function_jutet_2025-06-25
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-06-25
win_ratio: 0.481437969924812
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0085108280181885s
Received healthy response to inference request in 2.5681464672088623s
Received healthy response to inference request in 2.49481201171875s
Received healthy response to inference request in 2.37994647026062s
Received healthy response to inference request in 1.9763517379760742s
5 requests
0 failed requests
5th percentile: 1.982783555984497
10th percentile: 1.98921537399292
20th percentile: 2.0020790100097656
30th percentile: 2.082797956466675
40th percentile: 2.2313722133636475
50th percentile: 2.37994647026062
60th percentile: 2.425892686843872
70th percentile: 2.471838903427124
80th percentile: 2.5094789028167725
90th percentile: 2.538812685012817
95th percentile: 2.5534795761108398
99th percentile: 2.5652130889892577
mean time: 2.285553503036499
Pipeline stage StressChecker completed in 12.48s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.64s
Shutdown handler de-registered
function_jutet_2025-06-25 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3498.56s
Shutdown handler de-registered
function_jutet_2025-06-25 status is now inactive due to auto deactivation removed underperforming models
function_jutet_2025-06-25 status is now torndown due to DeploymentManager action