developer_uid: jxlu90
submission_id: function_molom_2024-12-27
model_name: llama33_70B_bo8
model_group:
status: torndown
timestamp: 2024-12-27T18:25:43+00:00
num_battles: 9713
num_wins: 4899
celo_rating: 1264.06
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: llama33_70B_bo8
is_internal_developer: False
ranking_group: single
us_pacific_date: 2024-12-27
win_ratio: 0.504375579120766
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', 'You:'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2143430709838867s
Received healthy response to inference request in 3.3382136821746826s
Received healthy response to inference request in 5.067372798919678s
Received healthy response to inference request in 2.962068796157837s
Received healthy response to inference request in 2.625520706176758s
5 requests
0 failed requests
5th percentile: 2.296578598022461
10th percentile: 2.378814125061035
20th percentile: 2.5432851791381834
30th percentile: 2.6928303241729736
40th percentile: 2.8274495601654053
50th percentile: 2.962068796157837
60th percentile: 3.1125267505645753
70th percentile: 3.262984704971313
80th percentile: 3.684045505523682
90th percentile: 4.37570915222168
95th percentile: 4.7215409755706785
99th percentile: 4.9982064342498775
mean time: 3.2415038108825684
Pipeline stage StressChecker completed in 17.34s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_molom_2024-12-27 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Received signal 15, running shutdown handler
Shutdown handler de-registered
function_molom_2024-12-27 status is now inactive due to admin request
function_molom_2024-12-27 status is now torndown due to DeploymentManager action
function_molom_2024-12-27 status is now torndown due to DeploymentManager action
function_molom_2024-12-27 status is now torndown due to DeploymentManager action