developer_uid: jxlu90
submission_id: function_ginem_2024-12-18
model_name: nemo_anthropic_moonshine_avg1k
model_group:
status: inactive
timestamp: 2024-12-18T21:20:08+00:00
num_battles: 12868
num_wins: 6729
celo_rating: 1279.95
family_friendly_score: 0.5913999999999999
family_friendly_standard_error: 0.006951921173316049
submission_type: function
display_name: nemo_anthropic_moonshine_avg1k
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-12-18
win_ratio: 0.5229250854833696
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', 'You:'], 'max_input_tokens': 2048, 'best_of': 4, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.796147108078003s
Received healthy response to inference request in 2.859071969985962s
Received healthy response to inference request in 2.581662654876709s
Received healthy response to inference request in 2.8008460998535156s
Received healthy response to inference request in 7.1667516231536865s
5 requests
0 failed requests
5th percentile: 2.6254993438720704
10th percentile: 2.669336032867432
20th percentile: 2.757009410858154
30th percentile: 2.812491273880005
40th percentile: 2.835781621932983
50th percentile: 2.859071969985962
60th percentile: 3.233902025222778
70th percentile: 3.6087320804595944
80th percentile: 4.4702680110931405
90th percentile: 5.818509817123413
95th percentile: 6.492630720138549
99th percentile: 7.031927442550659
mean time: 3.840895891189575
Pipeline stage StressChecker completed in 20.31s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.17s
Shutdown handler de-registered
function_ginem_2024-12-18 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3517.81s
Shutdown handler de-registered
function_ginem_2024-12-18 status is now inactive due to auto deactivation removed underperforming models