developer_uid: jxlu90
submission_id: function_kutot_2024-12-14
model_name: llama31_70B
model_group:
status: inactive
timestamp: 2024-12-14T19:54:58+00:00
num_battles: 13930
num_wins: 6845
celo_rating: 1257.45
family_friendly_score: 0.5998
family_friendly_standard_error: 0.006928779979188255
submission_type: function
display_name: llama31_70B
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-12-14
win_ratio: 0.4913854989231874
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', 'You:'], 'max_input_tokens': 1024, 'best_of': 4, 'max_output_tokens': 68}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.1189119815826416s
Received healthy response to inference request in 3.1333673000335693s
Received healthy response to inference request in 4.0987389087677s
Received healthy response to inference request in 2.643683671951294s
Received healthy response to inference request in 2.768134117126465s
5 requests
0 failed requests
5th percentile: 2.668573760986328
10th percentile: 2.693463850021362
20th percentile: 2.7432440280914308
30th percentile: 2.8382896900177004
40th percentile: 2.978600835800171
50th percentile: 3.1189119815826416
60th percentile: 3.1246941089630127
70th percentile: 3.130476236343384
80th percentile: 3.3264416217803956
90th percentile: 3.712590265274048
95th percentile: 3.905664587020874
99th percentile: 4.060124044418335
mean time: 3.152567195892334
Pipeline stage StressChecker completed in 16.90s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.44s
Shutdown handler de-registered
function_kutot_2024-12-14 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2905.07s
Shutdown handler de-registered
function_kutot_2024-12-14 status is now inactive due to auto deactivation removed underperforming models