developer_uid: jxlu90
submission_id: function_hohar_2024-12-27
model_name: llama_33_70B_bo8
model_group:
status: torndown
timestamp: 2024-12-27T23:58:45+00:00
num_battles: 18321
num_wins: 9171
celo_rating: 1258.67
family_friendly_score: 0.607
family_friendly_standard_error: 0.006907257053273753
submission_type: function
display_name: llama_33_70B_bo8
is_internal_developer: False
ranking_group: single
us_pacific_date: 2024-12-27
win_ratio: 0.5005731128213525
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', 'You:', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.3587894439697266s
Received healthy response to inference request in 2.9368133544921875s
Received healthy response to inference request in 2.586027145385742s
Received healthy response to inference request in 2.6030194759368896s
Received healthy response to inference request in 4.008300542831421s
5 requests
0 failed requests
5th percentile: 2.5894256114959715
10th percentile: 2.5928240776062013
20th percentile: 2.5996210098266603
30th percentile: 2.669778251647949
40th percentile: 2.8032958030700685
50th percentile: 2.9368133544921875
60th percentile: 3.105603790283203
70th percentile: 3.2743942260742185
80th percentile: 3.4886916637420655
90th percentile: 3.748496103286743
95th percentile: 3.878398323059082
99th percentile: 3.982320098876953
mean time: 3.0985899925231934
Pipeline stage StressChecker completed in 16.60s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
Shutdown handler de-registered
function_hohar_2024-12-27 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2390.72s
Shutdown handler de-registered
function_hohar_2024-12-27 status is now inactive due to auto deactivation removed underperforming models
function_hohar_2024-12-27 status is now torndown due to DeploymentManager action