submission_id: function_lisol_2024-10-14
developer_uid: chai_backend_admin
celo_rating: 1231.76
display_name: retune_with_base
family_friendly_score: 0.5892742143225143
family_friendly_standard_error: 0.004797120693145111
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: retune_with_base
num_battles: 11220
num_wins: 5373
ranking_group: single
status: torndown
submission_type: function
timestamp: 2024-10-14T02:36:33+00:00
us_pacific_date: 2024-10-13
win_ratio: 0.4788770053475936
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 6.427759647369385s
Received healthy response to inference request in 4.19994592666626s
Received healthy response to inference request in 6.087379455566406s
Received healthy response to inference request in 4.428776025772095s
Received healthy response to inference request in 5.989637851715088s
5 requests
0 failed requests
5th percentile: 4.245711946487427
10th percentile: 4.291477966308594
20th percentile: 4.383010005950927
30th percentile: 4.7409483909606935
40th percentile: 5.36529312133789
50th percentile: 5.989637851715088
60th percentile: 6.028734493255615
70th percentile: 6.067831134796142
80th percentile: 6.155455493927002
90th percentile: 6.2916075706481935
95th percentile: 6.359683609008789
99th percentile: 6.4141444396972656
mean time: 5.426699781417847
%s, retrying in %s seconds...
Received healthy response to inference request in 5.601219415664673s
Received healthy response to inference request in 5.539386034011841s
Received healthy response to inference request in 3.4578635692596436s
Received healthy response to inference request in 3.172170639038086s
Received healthy response to inference request in 4.894582509994507s
5 requests
0 failed requests
5th percentile: 3.2293092250823974
10th percentile: 3.286447811126709
20th percentile: 3.400724983215332
30th percentile: 3.745207357406616
40th percentile: 4.319894933700562
50th percentile: 4.894582509994507
60th percentile: 5.152503919601441
70th percentile: 5.4104253292083735
80th percentile: 5.551752710342408
90th percentile: 5.57648606300354
95th percentile: 5.588852739334106
99th percentile: 5.598746080398559
mean time: 4.53304443359375
%s, retrying in %s seconds...
Received healthy response to inference request in 1.9844677448272705s
Received healthy response to inference request in 2.199650287628174s
Received healthy response to inference request in 2.641805648803711s
Received healthy response to inference request in 3.2106411457061768s
Received healthy response to inference request in 2.4430885314941406s
5 requests
0 failed requests
5th percentile: 2.027504253387451
10th percentile: 2.070540761947632
20th percentile: 2.1566137790679933
30th percentile: 2.2483379364013674
40th percentile: 2.345713233947754
50th percentile: 2.4430885314941406
60th percentile: 2.5225753784179688
70th percentile: 2.602062225341797
80th percentile: 2.7555727481842043
90th percentile: 2.9831069469451905
95th percentile: 3.0968740463256834
99th percentile: 3.187887725830078
mean time: 2.4959306716918945
Pipeline stage StressChecker completed in 69.55s
Shutdown handler de-registered
function_lisol_2024-10-14 status is now deployed due to DeploymentManager action
function_lisol_2024-10-14 status is now inactive due to auto deactivation removed underperforming models
function_lisol_2024-10-14 status is now torndown due to DeploymentManager action