developer_uid: NischayDnk
submission_id: function_nefib_2025-08-25
model_name: function_nefib_2025-08-25
model_group:
status: torndown
timestamp: 2025-08-25T04:45:21+00:00
num_battles: 5674
num_wins: 3102
celo_rating: 1279.21
family_friendly_score: 0.512
family_friendly_standard_error: 0.007069031050999847
submission_type: function
display_name: function_nefib_2025-08-25
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-08-24
win_ratio: 0.5467042650687346
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.6093080043792725s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 3.73846697807312s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 3.18874192237854s
Received healthy response to inference request in 3.1926186084747314s
Received healthy response to inference request in 3.9150657653808594s
5 requests
0 failed requests
5th percentile: 1.925194787979126
10th percentile: 2.2410815715789796
20th percentile: 2.8728551387786867
30th percentile: 3.1895172595977783
40th percentile: 3.191067934036255
50th percentile: 3.1926186084747314
60th percentile: 3.410957956314087
70th percentile: 3.629297304153442
80th percentile: 3.773786735534668
90th percentile: 3.8444262504577638
95th percentile: 3.8797460079193113
99th percentile: 3.90800181388855
mean time: 3.1288402557373045
Pipeline stage StressChecker completed in 16.77s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.02s
Shutdown handler de-registered
function_nefib_2025-08-25 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2959.82s
Shutdown handler de-registered
function_nefib_2025-08-25 status is now inactive due to auto deactivation removed underperforming models
function_nefib_2025-08-25 status is now torndown due to DeploymentManager action