submission_id: function_haret_2024-08-17
developer_uid: chai_backend_admin
alignment_samples: 9247
alignment_score: 1.2071590372234378
celo_rating: 1214.43
display_name: gpt4-tl
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.1, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', 'You:'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: gpt4-tl
num_battles: 9247
num_wins: 4390
propriety_score: 0.8159645232815964
propriety_total_count: 902.0
ranking_group: single
status: torndown
submission_type: function
timestamp: 2024-08-17T05:37:24+00:00
us_pacific_date: 2024-08-16
win_ratio: 0.4747485671028442
Download Preference Data
Resubmit model
Running pipeline stage StressChecker
Received healthy response to inference request in 2.801076889038086s
Received healthy response to inference request in 3.981886148452759s
Received healthy response to inference request in 2.0582222938537598s
Received healthy response to inference request in 2.440504312515259s
Received healthy response to inference request in 2.0806126594543457s
5 requests
0 failed requests
5th percentile: 2.062700366973877
10th percentile: 2.0671784400939943
20th percentile: 2.0761345863342284
30th percentile: 2.1525909900665283
40th percentile: 2.2965476512908936
50th percentile: 2.440504312515259
60th percentile: 2.5847333431243897
70th percentile: 2.7289623737335202
80th percentile: 3.037238740921021
90th percentile: 3.50956244468689
95th percentile: 3.745724296569824
99th percentile: 3.934653778076172
mean time: 2.6724604606628417
Pipeline stage StressChecker completed in 13.95s
function_haret_2024-08-17 status is now deployed due to DeploymentManager action
function_haret_2024-08-17 status is now inactive due to auto deactivation removed underperforming models
function_haret_2024-08-17 status is now torndown due to DeploymentManager action

Usage Metrics

Latency Metrics