submission_id: function_gitit_2024-08-16
developer_uid: chai_backend_admin
celo_rating: 1205.82
display_name: gpt4-tl
family_friendly_score: 0.0
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.2, 'top_p': 1.0, 'min_p': 0.1, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', 'You:'], 'max_input_tokens': 512, 'best_of': 12, 'max_output_tokens': 64, 'reward_max_token_input': 256}
is_internal_developer: True
model_group:
model_name: gpt4-tl
num_battles: 9581
num_wins: 4413
ranking_group: single
reward_repo: ChaiML/gpt2_xl_pairwise_89m_step_347634
status: torndown
submission_type: function
timestamp: 2024-08-16T02:16:10+00:00
us_pacific_date: 2024-08-15
win_ratio: 0.4605991023901472
Download Preferencedata
Resubmit model
Running pipeline stage StressChecker
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 2.0542781352996826s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 2.290761709213257s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 3.6793878078460693s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 2.380680799484253s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 2.505622625350952s
5 requests
0 failed requests
5th percentile: 2.1015748500823976
10th percentile: 2.148871564865112
20th percentile: 2.243464994430542
30th percentile: 2.308745527267456
40th percentile: 2.3447131633758547
50th percentile: 2.380680799484253
60th percentile: 2.4306575298309325
70th percentile: 2.480634260177612
80th percentile: 2.740375661849976
90th percentile: 3.2098817348480226
95th percentile: 3.4446347713470455
99th percentile: 3.6324372005462644
mean time: 2.5821462154388426
Pipeline stage StressChecker completed in 16.17s
function_gitit_2024-08-16 status is now deployed due to DeploymentManager action
function_gitit_2024-08-16 status is now inactive due to auto deactivation removed underperforming models
function_gitit_2024-08-16 status is now torndown due to DeploymentManager action