function_dosis_2024-10-18

developer_uid: chai_backend_admin

submission_id: function_dosis_2024-10-18

model_name: reward_blend_default_full_bon

model_group:

status: torndown

timestamp: 2024-10-18T19:44:12+00:00

num_battles: 12764

num_wins: 6445

celo_rating: 1265.2

family_friendly_score: 0.5820331398161587

family_friendly_standard_error: 0.00467243261042945

submission_type: function

display_name: reward_blend_default_full_bon

is_internal_developer: True

ranking_group: single

us_pacific_date: 2024-10-18

win_ratio: 0.5049357568160451

generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 50, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>', '<|user|>', '###'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.203017234802246s
Received healthy response to inference request in 3.1840174198150635s
Received healthy response to inference request in 3.7317543029785156s
Received healthy response to inference request in 5.116004705429077s
Received healthy response to inference request in 3.7092769145965576s
5 requests
0 failed requests
5th percentile: 2.3992172718048095
10th percentile: 2.595417308807373
20th percentile: 2.9878173828125
30th percentile: 3.289069318771362
40th percentile: 3.49917311668396
50th percentile: 3.7092769145965576
60th percentile: 3.718267869949341
70th percentile: 3.727258825302124
80th percentile: 4.0086043834686285
90th percentile: 4.562304544448852
95th percentile: 4.839154624938964
99th percentile: 5.060634689331055
mean time: 3.588814115524292
%s, retrying in %s seconds...
Received healthy response to inference request in 2.20623779296875s
Received healthy response to inference request in 2.590333938598633s
Received healthy response to inference request in 3.439987897872925s
Received healthy response to inference request in 2.005682945251465s
Received healthy response to inference request in 5.301914930343628s
5 requests
0 failed requests
5th percentile: 2.045793914794922
10th percentile: 2.0859048843383787
20th percentile: 2.166126823425293
30th percentile: 2.2830570220947264
40th percentile: 2.4366954803466796
50th percentile: 2.590333938598633
60th percentile: 2.9301955223083493
70th percentile: 3.2700571060180663
80th percentile: 3.812373304367066
90th percentile: 4.557144117355347
95th percentile: 4.929529523849487
99th percentile: 5.2274378490448
mean time: 3.10883150100708
Pipeline stage StressChecker completed in 36.63s
Shutdown handler de-registered
function_dosis_2024-10-18 status is now deployed due to DeploymentManager action
function_dosis_2024-10-18 status is now inactive due to auto deactivation removed underperforming models
function_dosis_2024-10-18 status is now torndown due to DeploymentManager action