developer_uid: jxlu90
submission_id: function_lemer_2024-12-18
model_name: gasol_newsubs_max2k_ctx1k
model_group:
status: inactive
timestamp: 2024-12-18T00:44:42+00:00
num_battles: 10315
num_wins: 5372
celo_rating: 1279.13
family_friendly_score: 0.5858
family_friendly_standard_error: 0.006966180589103328
submission_type: function
display_name: gasol_newsubs_max2k_ctx1k
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-12-17
win_ratio: 0.5207949587978672
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', 'You:'], 'max_input_tokens': 2048, 'best_of': 4, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 6.941565752029419s
Received healthy response to inference request in 4.765521287918091s
Received healthy response to inference request in 3.481015682220459s
Received healthy response to inference request in 3.804917812347412s
Received healthy response to inference request in 5.926811933517456s
5 requests
0 failed requests
5th percentile: 3.54579610824585
10th percentile: 3.61057653427124
20th percentile: 3.7401373863220213
30th percentile: 3.9970385074615478
40th percentile: 4.3812798976898195
50th percentile: 4.765521287918091
60th percentile: 5.230037546157837
70th percentile: 5.694553804397583
80th percentile: 6.1297626972198485
90th percentile: 6.535664224624634
95th percentile: 6.7386149883270265
99th percentile: 6.90097559928894
mean time: 4.983966493606568
%s, retrying in %s seconds...
Received healthy response to inference request in 4.117892503738403s
Received healthy response to inference request in 2.884829044342041s
Received healthy response to inference request in 4.349330425262451s
Received healthy response to inference request in 2.1514484882354736s
Received healthy response to inference request in 6.0375049114227295s
5 requests
0 failed requests
5th percentile: 2.298124599456787
10th percentile: 2.4448007106781007
20th percentile: 2.7381529331207277
30th percentile: 3.1314417362213134
40th percentile: 3.6246671199798586
50th percentile: 4.117892503738403
60th percentile: 4.210467672348022
70th percentile: 4.303042840957642
80th percentile: 4.686965322494507
90th percentile: 5.362235116958618
95th percentile: 5.699870014190673
99th percentile: 5.969977931976318
mean time: 3.90820107460022
%s, retrying in %s seconds...
Received healthy response to inference request in 3.2493510246276855s
Received healthy response to inference request in 3.0217015743255615s
Received healthy response to inference request in 2.415072202682495s
Received healthy response to inference request in 3.745222806930542s
Received healthy response to inference request in 2.937136650085449s
5 requests
0 failed requests
5th percentile: 2.519485092163086
10th percentile: 2.623897981643677
20th percentile: 2.8327237606048583
30th percentile: 2.9540496349334715
40th percentile: 2.9878756046295165
50th percentile: 3.0217015743255615
60th percentile: 3.112761354446411
70th percentile: 3.203821134567261
80th percentile: 3.348525381088257
90th percentile: 3.5468740940093992
95th percentile: 3.6460484504699706
99th percentile: 3.7253879356384276
mean time: 3.0736968517303467
Pipeline stage StressChecker completed in 63.13s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.73s
Shutdown handler de-registered
function_lemer_2024-12-18 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2828.07s
Shutdown handler de-registered
function_lemer_2024-12-18 status is now inactive due to auto deactivation removed underperforming models