submission_id: function_sumot_2024-08-27
developer_uid: chai_backend_admin
celo_rating: 1256.72
display_name: elo_alignment_corweave_baseline
family_friendly_score: 0.0
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.08, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: elo_alignment_corweave_baseline
num_battles: 9771
num_wins: 5153
ranking_group: single
status: torndown
submission_type: function
timestamp: 2024-08-27T00:24:13+00:00
us_pacific_date: 2024-08-26
win_ratio: 0.527376931736772
Resubmit model
Running pipeline stage StressChecker
Received healthy response to inference request in 3.375950574874878s
Received healthy response to inference request in 3.612034559249878s
Received healthy response to inference request in 2.718234062194824s
Received healthy response to inference request in 2.4123027324676514s
Received healthy response to inference request in 2.1559948921203613s
5 requests
0 failed requests
5th percentile: 2.2072564601898192
10th percentile: 2.258518028259277
20th percentile: 2.3610411643981934
30th percentile: 2.473488998413086
40th percentile: 2.595861530303955
50th percentile: 2.718234062194824
60th percentile: 2.9813206672668455
70th percentile: 3.2444072723388673
80th percentile: 3.423167371749878
90th percentile: 3.517600965499878
95th percentile: 3.564817762374878
99th percentile: 3.602591199874878
mean time: 2.8549033641815185
Pipeline stage StressChecker completed in 15.66s
function_sumot_2024-08-27 status is now deployed due to DeploymentManager action
function_sumot_2024-08-27 status is now inactive due to auto deactivation removed underperforming models
function_sumot_2024-08-27 status is now torndown due to DeploymentManager action
function_sumot_2024-08-27 status is now torndown due to DeploymentManager action