submission_id: function_ruhur_2024-08-19
developer_uid: chai_backend_admin
alignment_samples: 4905
alignment_score: 2.8011184110547385
celo_rating: 1195.29
display_name: gpt4-tl
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.1, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', 'You:'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: gpt4-tl
num_battles: 4905
num_wins: 2218
propriety_score: 0.8148936170212766
propriety_total_count: 470.0
ranking_group: single
status: torndown
submission_type: function
timestamp: 2024-08-19T04:42:45+00:00
us_pacific_date: 2024-08-18
win_ratio: 0.45219164118246685
Download Preference Data
Resubmit model
Running pipeline stage StressChecker
Received healthy response to inference request in 3.523791790008545s
Received healthy response to inference request in 2.2121546268463135s
Received healthy response to inference request in 1.3996074199676514s
Received healthy response to inference request in 1.3079898357391357s
Received healthy response to inference request in 1.289036750793457s
5 requests
0 failed requests
5th percentile: 1.2928273677825928
10th percentile: 1.2966179847717285
20th percentile: 1.30419921875
30th percentile: 1.326313352584839
40th percentile: 1.3629603862762452
50th percentile: 1.3996074199676514
60th percentile: 1.7246263027191162
70th percentile: 2.049645185470581
80th percentile: 2.4744820594787598
90th percentile: 2.9991369247436523
95th percentile: 3.2614643573760986
99th percentile: 3.471326303482056
mean time: 1.9465160846710206
Pipeline stage StressChecker completed in 10.27s
function_ruhur_2024-08-19 status is now deployed due to DeploymentManager action
function_ruhur_2024-08-19 status is now inactive due to admin request
function_ruhur_2024-08-19 status is now torndown due to DeploymentManager action

Usage Metrics

Latency Metrics