submission_id: function_tasof_2024-09-14
developer_uid: chai_backend_admin
alignment_samples: 32364
alignment_score: 0.4242196210953898
celo_rating: 1212.59
display_name: mixtral_with_ava_reward_250k_v1
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 50, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>', '<|user|>', '###'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: mixtral_with_ava_reward_250k_v1
num_battles: 32348
num_wins: 15994
propriety_score: 0.7366104181951577
propriety_total_count: 2726.0
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-09-14T23:58:55+00:00
us_pacific_date: 2024-09-14
win_ratio: 0.4944355137875603
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.847228527069092s
Received healthy response to inference request in 3.3778300285339355s
Received healthy response to inference request in 2.8604626655578613s
Received healthy response to inference request in 3.368922710418701s
Received healthy response to inference request in 2.2481725215911865s
5 requests
0 failed requests
5th percentile: 2.3706305503845213
10th percentile: 2.4930885791778565
20th percentile: 2.7380046367645265
30th percentile: 2.962154674530029
40th percentile: 3.165538692474365
50th percentile: 3.368922710418701
60th percentile: 3.372485637664795
70th percentile: 3.3760485649108887
80th percentile: 3.471709728240967
90th percentile: 3.6594691276550293
95th percentile: 3.7533488273620605
99th percentile: 3.8284525871276855
mean time: 3.1405232906341554
Pipeline stage StressChecker completed in 16.24s
Shutdown handler de-registered
function_tasof_2024-09-14 status is now deployed due to DeploymentManager action
function_tasof_2024-09-14 status is now inactive due to auto deactivation removed underperforming models