submission_id: function_lebum_2024-09-14
developer_uid: chai_backend_admin
alignment_samples: 32247
alignment_score: 0.5051145214728625
celo_rating: 1212.94
display_name: mixtral_with_ava_reward_175k_v1
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 50, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>', '<|user|>', '###'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: mixtral_with_ava_reward_175k_v1
num_battles: 32227
num_wins: 15951
propriety_score: 0.7429054054054054
propriety_total_count: 2960.0
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-09-14T23:58:08+00:00
us_pacific_date: 2024-09-14
win_ratio: 0.49495764421137556
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.8258931636810303s
Received healthy response to inference request in 2.8395497798919678s
Received healthy response to inference request in 2.1385912895202637s
Received healthy response to inference request in 2.166435956954956s
Received healthy response to inference request in 2.1485135555267334s
5 requests
0 failed requests
5th percentile: 2.1405757427215577
10th percentile: 2.1425601959228517
20th percentile: 2.1465291023254394
30th percentile: 2.152098035812378
40th percentile: 2.159266996383667
50th percentile: 2.166435956954956
60th percentile: 2.4302188396453857
70th percentile: 2.6940017223358153
80th percentile: 2.8286244869232178
90th percentile: 2.8340871334075928
95th percentile: 2.8368184566497803
99th percentile: 2.8390035152435305
mean time: 2.4237967491149903
Pipeline stage StressChecker completed in 12.62s
Shutdown handler de-registered
function_lebum_2024-09-14 status is now deployed due to DeploymentManager action
function_lebum_2024-09-14 status is now inactive due to auto deactivation removed underperforming models