developer_uid: chai_backend_admin
submission_id: function_desof_2024-08-20
model_name: gpt4-tl
status: torndown
timestamp: 2024-08-20T16:45:39+00:00
num_battles: 7671
num_wins: 3718
celo_rating: 1219.22
family_friendly_score: 0.0
submission_type: function
display_name: gpt4-tl
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-08-20
win_ratio: 0.4846825707208969
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.1, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', 'You:'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Running pipeline stage StressChecker
Received healthy response to inference request in 4.765697479248047s
Received healthy response to inference request in 2.7984464168548584s
Received healthy response to inference request in 2.066361904144287s
Received healthy response to inference request in 2.240793228149414s
Received healthy response to inference request in 4.520531892776489s
5 requests
0 failed requests
5th percentile: 2.1012481689453124
10th percentile: 2.1361344337463377
20th percentile: 2.2059069633483888
30th percentile: 2.352323865890503
40th percentile: 2.5753851413726805
50th percentile: 2.7984464168548584
60th percentile: 3.4872806072235107
70th percentile: 4.176114797592163
80th percentile: 4.569565010070801
90th percentile: 4.6676312446594235
95th percentile: 4.716664361953735
99th percentile: 4.755890855789184
mean time: 3.2783661842346192
Pipeline stage StressChecker completed in 16.96s
function_desof_2024-08-20 status is now deployed due to DeploymentManager action
function_desof_2024-08-20 status is now inactive due to admin request
function_desof_2024-08-20 status is now torndown due to DeploymentManager action