submission_id: function_rapan_2024-09-14
developer_uid: chai_backend_admin
alignment_samples: 11697
alignment_score: -1.5201586589869092
celo_rating: 1265.93
display_name: dpo_with_ava_reward_100k_v1
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.08, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: dpo_with_ava_reward_100k_v1
num_battles: 11697
num_wins: 6040
propriety_score: 0.7477386934673367
propriety_total_count: 995.0
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-09-14T21:02:39+00:00
us_pacific_date: 2024-09-14
win_ratio: 0.5163717192442506
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.1109375953674316s
Received healthy response to inference request in 2.809687614440918s
Received healthy response to inference request in 2.848595380783081s
Received healthy response to inference request in 2.371626138687134s
Received healthy response to inference request in 3.687715768814087s
5 requests
0 failed requests
5th percentile: 2.4592384338378905
10th percentile: 2.5468507289886473
20th percentile: 2.722075319290161
30th percentile: 2.8174691677093504
40th percentile: 2.8330322742462157
50th percentile: 2.848595380783081
60th percentile: 2.953532266616821
70th percentile: 3.0584691524505616
80th percentile: 3.2262932300567626
90th percentile: 3.457004499435425
95th percentile: 3.572360134124756
99th percentile: 3.6646446418762206
mean time: 2.9657124996185305
Pipeline stage StressChecker completed in 15.43s
Shutdown handler de-registered
function_rapan_2024-09-14 status is now deployed due to DeploymentManager action
function_rapan_2024-09-14 status is now inactive due to auto deactivation removed underperforming models