function_pudir_2024-09-19

developer_uid: chai_backend_admin

submission_id: function_pudir_2024-09-19

model_name: reward_blend_default_full_bon

model_group:

status: torndown

timestamp: 2024-09-19T22:12:24+00:00

num_battles: 11139

num_wins: 6176

celo_rating: 1285.09

family_friendly_score: 0.0

submission_type: function

display_name: reward_blend_default_full_bon

is_internal_developer: True

ranking_group: single

us_pacific_date: 2024-09-19

win_ratio: 0.5544483346799534

generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 50, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>', '<|user|>', '###'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.822741985321045s
Received healthy response to inference request in 5.129310131072998s
Received healthy response to inference request in 3.9844374656677246s
Received healthy response to inference request in 3.810523271560669s
Received healthy response to inference request in 5.735930919647217s
5 requests
0 failed requests
5th percentile: 3.8453061103820803
10th percentile: 3.880088949203491
20th percentile: 3.9496546268463133
30th percentile: 4.152098369598389
40th percentile: 4.487420177459716
50th percentile: 4.822741985321045
60th percentile: 4.945369243621826
70th percentile: 5.067996501922607
80th percentile: 5.250634288787842
90th percentile: 5.493282604217529
95th percentile: 5.6146067619323725
99th percentile: 5.711666088104248
mean time: 4.696588754653931
%s, retrying in %s seconds...
Received healthy response to inference request in 3.8311707973480225s
Received healthy response to inference request in 5.194744348526001s
Received healthy response to inference request in 5.332444667816162s
Received healthy response to inference request in 6.245071172714233s
Received healthy response to inference request in 5.969021558761597s
5 requests
0 failed requests
5th percentile: 4.103885507583618
10th percentile: 4.376600217819214
20th percentile: 4.922029638290406
30th percentile: 5.222284412384033
40th percentile: 5.2773645401000975
50th percentile: 5.332444667816162
60th percentile: 5.587075424194336
70th percentile: 5.841706180572509
80th percentile: 6.024231481552124
90th percentile: 6.134651327133179
95th percentile: 6.189861249923706
99th percentile: 6.234029188156128
mean time: 5.314490509033203
%s, retrying in %s seconds...
Received healthy response to inference request in 6.655776262283325s
Received healthy response to inference request in 3.3748879432678223s
Received healthy response to inference request in 4.384046792984009s
Received healthy response to inference request in 3.6142284870147705s
Received healthy response to inference request in 4.205065011978149s
5 requests
0 failed requests
5th percentile: 3.422756052017212
10th percentile: 3.4706241607666017
20th percentile: 3.5663603782653808
30th percentile: 3.732395792007446
40th percentile: 3.968730401992798
50th percentile: 4.205065011978149
60th percentile: 4.2766577243804935
70th percentile: 4.348250436782837
80th percentile: 4.838392686843872
90th percentile: 5.747084474563599
95th percentile: 6.201430368423462
99th percentile: 6.564907083511352
mean time: 4.446800899505615
Pipeline stage StressChecker completed in 73.98s
Shutdown handler de-registered
function_pudir_2024-09-19 status is now deployed due to DeploymentManager action
function_pudir_2024-09-19 status is now inactive due to auto deactivation removed underperforming models
function_pudir_2024-09-19 status is now torndown due to DeploymentManager action