developer_uid: chai_backend_admin
submission_id: function_dorob_2024-08-17
model_name: gpt4-tl
status: torndown
timestamp: 2024-08-17T05:38:08+00:00
num_battles: 9144
num_wins: 4343
celo_rating: 1214.3
family_friendly_score: 0.0
submission_type: function
display_name: gpt4-tl
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-08-16
win_ratio: 0.47495625546806647
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.1, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', 'You:'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Running pipeline stage StressChecker
Received healthy response to inference request in 7.035886287689209s
Received healthy response to inference request in 2.423793315887451s
Received healthy response to inference request in 2.0750129222869873s
Received healthy response to inference request in 2.2388839721679688s
Received healthy response to inference request in 2.987109899520874s
5 requests
0 failed requests
5th percentile: 2.1077871322631836
10th percentile: 2.14056134223938
20th percentile: 2.2061097621917725
30th percentile: 2.275865840911865
40th percentile: 2.349829578399658
50th percentile: 2.423793315887451
60th percentile: 2.6491199493408204
70th percentile: 2.874446582794189
80th percentile: 3.7968651771545416
90th percentile: 5.4163757324218755
95th percentile: 6.226131010055541
99th percentile: 6.873935232162475
mean time: 3.3521372795104982
Pipeline stage StressChecker completed in 17.25s
function_dorob_2024-08-17 status is now deployed due to DeploymentManager action
function_dorob_2024-08-17 status is now inactive due to auto deactivation removed underperforming models
function_dorob_2024-08-17 status is now torndown due to DeploymentManager action