developer_uid: jxlu90
submission_id: function_kirer_2024-12-31
model_name: llama_405b_bo4_ctx1k
model_group:
status: torndown
timestamp: 2024-12-31T15:51:52+00:00
num_battles: 6059
num_wins: 3048
celo_rating: 1271.4
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: llama_405b_bo4_ctx1k
is_internal_developer: False
ranking_group: single
us_pacific_date: 2024-12-31
win_ratio: 0.5030533091269186
generation_params: {'temperature': 0.9, 'top_p': 0.95, 'min_p': 0.0, 'top_k': 50, 'presence_penalty': 0.0, 'frequency_penalty': 0.1, 'stopping_words': ['You:', '</s>', '\n'], 'max_input_tokens': 1024, 'best_of': 4, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.2598934173583984s
Received healthy response to inference request in 2.175898313522339s
Received healthy response to inference request in 3.585965156555176s
Received healthy response to inference request in 4.607079029083252s
Received healthy response to inference request in 2.8585920333862305s
5 requests
0 failed requests
5th percentile: 2.312437057495117
10th percentile: 2.4489758014678955
20th percentile: 2.722053289413452
30th percentile: 2.938852310180664
40th percentile: 3.0993728637695312
50th percentile: 3.2598934173583984
60th percentile: 3.3903221130371093
70th percentile: 3.52075080871582
80th percentile: 3.790187931060791
90th percentile: 4.198633480072021
95th percentile: 4.402856254577636
99th percentile: 4.566234474182129
mean time: 3.297485589981079
Pipeline stage StressChecker completed in 17.55s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.71s
Shutdown handler de-registered
function_kirer_2024-12-31 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Received signal 15, running shutdown handler
Shutdown handler de-registered
function_kirer_2024-12-31 status is now inactive due to admin request
function_kirer_2024-12-31 status is now torndown due to DeploymentManager action
function_kirer_2024-12-31 status is now torndown due to DeploymentManager action
function_kirer_2024-12-31 status is now torndown due to DeploymentManager action