submission_id: function_hudim_2024-08-26
developer_uid: chai_backend_admin
alignment_samples: 10689
alignment_score: -1.1294833286556556
celo_rating: 1258.38
display_name: elo_alignment_tensorwave
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.08, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: elo_alignment_tensorwave
num_battles: 10689
num_wins: 5665
propriety_score: 0.7270742358078602
propriety_total_count: 916.0
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-08-26T20:36:49+00:00
us_pacific_date: 2024-08-26
win_ratio: 0.52998409579942
Download Preference Data
Resubmit model
Running pipeline stage StressChecker
Received healthy response to inference request in 1.3560187816619873s
Received healthy response to inference request in 2.1267569065093994s
Received healthy response to inference request in 2.736536741256714s
Received healthy response to inference request in 3.6751887798309326s
Received healthy response to inference request in 4.357394456863403s
5 requests
0 failed requests
5th percentile: 1.5101664066314697
10th percentile: 1.6643140316009521
20th percentile: 1.972609281539917
30th percentile: 2.2487128734588624
40th percentile: 2.492624807357788
50th percentile: 2.736536741256714
60th percentile: 3.111997556686401
70th percentile: 3.487458372116089
80th percentile: 3.811629915237427
90th percentile: 4.084512186050415
95th percentile: 4.220953321456909
99th percentile: 4.330106229782104
mean time: 2.850379133224487
Pipeline stage StressChecker completed in 14.90s
function_hudim_2024-08-26 status is now deployed due to DeploymentManager action
function_hudim_2024-08-26 status is now inactive due to auto deactivation removed underperforming models

Usage Metrics

Latency Metrics