developer_uid: NischayDnk
submission_id: function_nunof_2025-07-01
model_name: function_nunof_2025-07-01
model_group:
status: torndown
timestamp: 2025-07-01T18:26:44+00:00
num_battles: 7932
num_wins: 4143
celo_rating: 1309.52
family_friendly_score: 0.5756
family_friendly_standard_error: 0.00698977310075227
submission_type: function
display_name: function_nunof_2025-07-01
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-07-01
win_ratio: 0.5223146747352496
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.428431272506714s
Received healthy response to inference request in 4.239194393157959s
Received healthy response to inference request in 0.9524755477905273s
Received healthy response to inference request in 3.331843852996826s
Received healthy response to inference request in 0.5513544082641602s
5 requests
0 failed requests
5th percentile: 0.6315786361694335
10th percentile: 0.711802864074707
20th percentile: 0.872251319885254
30th percentile: 1.428349208831787
40th percentile: 2.3800965309143067
50th percentile: 3.331843852996826
60th percentile: 3.694784069061279
70th percentile: 4.057724285125732
80th percentile: 4.27704176902771
90th percentile: 4.352736520767212
95th percentile: 4.390583896636963
99th percentile: 4.420861797332764
mean time: 2.700659894943237
%s, retrying in %s seconds...
Received healthy response to inference request in 4.2877421379089355s
Received healthy response to inference request in 3.119704008102417s
Received healthy response to inference request in 2.7936198711395264s
Received healthy response to inference request in 2.1750965118408203s
Received healthy response to inference request in 1.2096710205078125s
5 requests
0 failed requests
5th percentile: 1.402756118774414
10th percentile: 1.5958412170410157
20th percentile: 1.9820114135742188
30th percentile: 2.2988011837005615
40th percentile: 2.546210527420044
50th percentile: 2.7936198711395264
60th percentile: 2.9240535259246827
70th percentile: 3.0544871807098386
80th percentile: 3.353311634063721
90th percentile: 3.8205268859863284
95th percentile: 4.054134511947631
99th percentile: 4.2410206127166745
mean time: 2.717166709899902
Pipeline stage StressChecker completed in 29.60s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.66s
Shutdown handler de-registered
function_nunof_2025-07-01 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4588.82s
Shutdown handler de-registered
function_nunof_2025-07-01 status is now inactive due to auto deactivation removed underperforming models
function_nunof_2025-07-01 status is now torndown due to DeploymentManager action