submission_id: function_pupin_2024-08-27
developer_uid: chai_backend_admin
alignment_samples: 11185
alignment_score: -0.7647892609531753
celo_rating: 1227.29
display_name: elo_alignment_amd_quartz_fp8
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.08, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: elo_alignment_amd_quartz_fp8
num_battles: 11185
num_wins: 5402
propriety_score: 0.6982942430703625
propriety_total_count: 938.0
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-08-27T03:06:44+00:00
us_pacific_date: 2024-08-26
win_ratio: 0.4829682610639249
Download Preference Data
Resubmit model
Running pipeline stage StressChecker
Received healthy response to inference request in 1.3331544399261475s
Received healthy response to inference request in 2.3642704486846924s
Received healthy response to inference request in 2.772045373916626s
Received healthy response to inference request in 2.24149227142334s
Received healthy response to inference request in 1.4521207809448242s
5 requests
0 failed requests
5th percentile: 1.3569477081298829
10th percentile: 1.3807409763336183
20th percentile: 1.4283275127410888
30th percentile: 1.6099950790405273
40th percentile: 1.9257436752319337
50th percentile: 2.24149227142334
60th percentile: 2.2906035423278808
70th percentile: 2.3397148132324217
80th percentile: 2.445825433731079
90th percentile: 2.6089354038238524
95th percentile: 2.690490388870239
99th percentile: 2.755734376907349
mean time: 2.032616662979126
Pipeline stage StressChecker completed in 10.84s
function_pupin_2024-08-27 status is now deployed due to DeploymentManager action
function_pupin_2024-08-27 status is now inactive due to auto deactivation removed underperforming models

Usage Metrics

Latency Metrics