developer_uid: NischayDnk
submission_id: function_pumil_2025-08-25
model_name: function_pumil_2025-08-25
model_group:
status: torndown
timestamp: 2025-08-25T06:34:01+00:00
num_battles: 6045
num_wins: 3350
celo_rating: 1286.19
family_friendly_score: 0.5134000000000001
family_friendly_standard_error: 0.00706852799386124
submission_type: function
display_name: function_pumil_2025-08-25
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-08-24
win_ratio: 0.554177005789909
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.2434279918670654s
Received healthy response to inference request in 2.552778720855713s
Received healthy response to inference request in 4.312110185623169s
Received healthy response to inference request in 3.2681310176849365s
Received healthy response to inference request in 0.995192289352417s
5 requests
0 failed requests
5th percentile: 1.3067095756530762
10th percentile: 1.6182268619537354
20th percentile: 2.241261434555054
30th percentile: 2.6909085750579833
40th percentile: 2.9671682834625246
50th percentile: 3.2434279918670654
60th percentile: 3.253309202194214
70th percentile: 3.263190412521362
80th percentile: 3.4769268512725833
90th percentile: 3.894518518447876
95th percentile: 4.103314352035523
99th percentile: 4.2703510189056395
mean time: 2.87432804107666
Pipeline stage StressChecker completed in 16.19s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.79s
Shutdown handler de-registered
function_pumil_2025-08-25 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3078.99s
Shutdown handler de-registered
function_pumil_2025-08-25 status is now inactive due to auto deactivation removed underperforming models
function_pumil_2025-08-25 status is now torndown due to DeploymentManager action