function_satet_2024-11-27

developer_uid: chai_backend_admin

submission_id: function_satet_2024-11-27

model_name: meta-405b

model_group:

status: inactive

timestamp: 2024-11-27T19:03:17+00:00

num_battles: 5228

num_wins: 2625

celo_rating: 1260.67

family_friendly_score: 0.5938

family_friendly_standard_error: 0.006945524602216884

submission_type: function

display_name: meta-405b

is_internal_developer: True

ranking_group: single

us_pacific_date: 2024-11-27

win_ratio: 0.5021040550879877

generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', 'You:'], 'max_input_tokens': 1024, 'best_of': 4, 'max_output_tokens': 68}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 2.9322519302368164s
Received healthy response to inference request in 2.8691346645355225s
Received healthy response to inference request in 2.71671462059021s
Received healthy response to inference request in 4.878159284591675s
Received healthy response to inference request in 4.929113388061523s
5 requests
0 failed requests
5th percentile: 2.7471986293792723
10th percentile: 2.777682638168335
20th percentile: 2.83865065574646
30th percentile: 2.881758117675781
40th percentile: 2.907005023956299
50th percentile: 2.9322519302368164
60th percentile: 3.7106148719787595
70th percentile: 4.488977813720703
80th percentile: 4.888350105285644
90th percentile: 4.908731746673584
95th percentile: 4.918922567367554
99th percentile: 4.92707522392273
mean time: 3.6650747776031496
%s, retrying in %s seconds...
Received healthy response to inference request in 4.498362302780151s
Received healthy response to inference request in 2.848271131515503s
Received healthy response to inference request in 2.9736876487731934s
Received healthy response to inference request in 3.2980706691741943s
Received healthy response to inference request in 3.398841142654419s
5 requests
0 failed requests
5th percentile: 2.873354434967041
10th percentile: 2.898437738418579
20th percentile: 2.9486043453216553
30th percentile: 3.0385642528533934
40th percentile: 3.168317461013794
50th percentile: 3.2980706691741943
60th percentile: 3.3383788585662844
70th percentile: 3.378687047958374
80th percentile: 3.6187453746795657
90th percentile: 4.058553838729859
95th percentile: 4.278458070755005
99th percentile: 4.454381456375122
mean time: 3.403446578979492
Pipeline stage StressChecker completed in 37.84s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.41s
Shutdown handler de-registered
function_satet_2024-11-27 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4575.33s
Shutdown handler de-registered
function_satet_2024-11-27 status is now inactive due to auto deactivation removed underperforming models