submission_id: function_sesel_2024-09-14
developer_uid: chai_backend_admin
alignment_samples: 18255
alignment_score: -0.3047618180901028
celo_rating: 1258.91
display_name: dpo_with_ava_reward_100k_v1
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.08, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: dpo_with_ava_reward_100k_v1
num_battles: 18255
num_wins: 9126
propriety_score: 0.7389993972272453
propriety_total_count: 1659.0
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-09-14T21:04:04+00:00
us_pacific_date: 2024-09-14
win_ratio: 0.49991783073130647
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.736905574798584s
Received healthy response to inference request in 5.337608814239502s
Received healthy response to inference request in 3.721877336502075s
Received healthy response to inference request in 2.9928760528564453s
Received healthy response to inference request in 4.886986494064331s
5 requests
0 failed requests
5th percentile: 3.138676309585571
10th percentile: 3.284476566314697
20th percentile: 3.5760770797729493
30th percentile: 3.724882984161377
40th percentile: 3.7308942794799806
50th percentile: 3.736905574798584
60th percentile: 4.196937942504883
70th percentile: 4.6569703102111815
80th percentile: 4.977110958099365
90th percentile: 5.157359886169433
95th percentile: 5.247484350204468
99th percentile: 5.319583921432495
mean time: 4.135250854492187
Pipeline stage StressChecker completed in 21.77s
Shutdown handler de-registered
function_sesel_2024-09-14 status is now deployed due to DeploymentManager action
function_sesel_2024-09-14 status is now inactive due to auto deactivation removed underperforming models