developer_uid: jxlu90
submission_id: function_gatul_2024-12-13
model_name: nemo70B_600_avgctx1k
model_group:
status: inactive
timestamp: 2024-12-13T01:49:38+00:00
num_battles: 8111
num_wins: 3711
celo_rating: 1234.01
family_friendly_score: 0.6222
family_friendly_standard_error: 0.006856634159702558
submission_type: function
display_name: nemo70B_600_avgctx1k
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-12-12
win_ratio: 0.4575268154358279
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', 'You:'], 'max_input_tokens': 1024, 'best_of': 4, 'max_output_tokens': 68}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 6.643751859664917s
Received healthy response to inference request in 2.743422269821167s
Received healthy response to inference request in 3.489100694656372s
Received healthy response to inference request in 2.1200971603393555s
Received healthy response to inference request in 4.165843963623047s
5 requests
0 failed requests
5th percentile: 2.2447621822357178
10th percentile: 2.36942720413208
20th percentile: 2.6187572479248047
30th percentile: 2.892557954788208
40th percentile: 3.19082932472229
50th percentile: 3.489100694656372
60th percentile: 3.759798002243042
70th percentile: 4.030495309829711
80th percentile: 4.661425542831421
90th percentile: 5.6525887012481695
95th percentile: 6.148170280456543
99th percentile: 6.544635543823242
mean time: 3.832443189620972
%s, retrying in %s seconds...
Received healthy response to inference request in 5.113766193389893s
Received healthy response to inference request in 5.157989978790283s
Received healthy response to inference request in 3.084062337875366s
Received healthy response to inference request in 3.206469774246216s
Received healthy response to inference request in 2.732377767562866s
5 requests
0 failed requests
5th percentile: 2.8027146816253663
10th percentile: 2.8730515956878664
20th percentile: 3.013725423812866
30th percentile: 3.1085438251495363
40th percentile: 3.157506799697876
50th percentile: 3.206469774246216
60th percentile: 3.969388341903686
70th percentile: 4.7323069095611565
80th percentile: 5.122610950469971
90th percentile: 5.140300464630127
95th percentile: 5.149145221710205
99th percentile: 5.156221027374268
mean time: 3.858933210372925
%s, retrying in %s seconds...
Received healthy response to inference request in 2.9358973503112793s
Received healthy response to inference request in 3.445775032043457s
Received healthy response to inference request in 3.1125144958496094s
Received healthy response to inference request in 4.733310222625732s
Received healthy response to inference request in 2.421841621398926s
5 requests
0 failed requests
5th percentile: 2.5246527671813963
10th percentile: 2.6274639129638673
20th percentile: 2.8330862045288088
30th percentile: 2.971220779418945
40th percentile: 3.0418676376342773
50th percentile: 3.1125144958496094
60th percentile: 3.2458187103271485
70th percentile: 3.3791229248046872
80th percentile: 3.7032820701599123
90th percentile: 4.218296146392822
95th percentile: 4.475803184509277
99th percentile: 4.681808815002442
mean time: 3.3298677444458007
Pipeline stage StressChecker completed in 58.32s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 3.38s
Shutdown handler de-registered
function_gatul_2024-12-13 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2510.74s
Shutdown handler de-registered
function_gatul_2024-12-13 status is now inactive due to auto deactivation removed underperforming models