function_tihil_2024-10-05

developer_uid: chai_backend_admin

submission_id: function_tihil_2024-10-05

model_name: reward_blend_default_full_bon

model_group:

status: torndown

timestamp: 2024-10-05T02:29:11+00:00

num_battles: 9056

num_wins: 4744

celo_rating: 1270.98

family_friendly_score: 0.5502283105022832

family_friendly_standard_error: 0.005372113352314137

submission_type: function

display_name: reward_blend_default_full_bon

is_internal_developer: True

ranking_group: single

us_pacific_date: 2024-10-04

win_ratio: 0.523851590106007

generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 50, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>', '<|user|>', '###'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.1828181743621826s
Received healthy response to inference request in 3.753145694732666s
Received healthy response to inference request in 5.489656925201416s
Received healthy response to inference request in 3.555147171020508s
Received healthy response to inference request in 4.636682033538818s
5 requests
0 failed requests
5th percentile: 3.2572839736938475
10th percentile: 3.331749773025513
20th percentile: 3.480681371688843
30th percentile: 3.5947468757629393
40th percentile: 3.6739462852478026
50th percentile: 3.753145694732666
60th percentile: 4.106560230255127
70th percentile: 4.459974765777588
80th percentile: 4.807277011871338
90th percentile: 5.148466968536377
95th percentile: 5.319061946868897
99th percentile: 5.455537929534912
mean time: 4.1234899997711185
%s, retrying in %s seconds...
Received healthy response to inference request in 2.872084140777588s
Received healthy response to inference request in 3.266862154006958s
Received healthy response to inference request in 3.7612109184265137s
Received healthy response to inference request in 3.696599245071411s
Received healthy response to inference request in 3.1888587474823s
5 requests
0 failed requests
5th percentile: 2.9354390621185305
10th percentile: 2.9987939834594726
20th percentile: 3.1255038261413572
30th percentile: 3.2044594287872314
40th percentile: 3.2356607913970947
50th percentile: 3.266862154006958
60th percentile: 3.438756990432739
70th percentile: 3.6106518268585206
80th percentile: 3.7095215797424315
90th percentile: 3.7353662490844726
95th percentile: 3.7482885837554933
99th percentile: 3.75862645149231
mean time: 3.357123041152954
Pipeline stage StressChecker completed in 48.58s
Shutdown handler de-registered
function_tihil_2024-10-05 status is now deployed due to DeploymentManager action
function_tihil_2024-10-05 status is now inactive due to auto deactivation removed underperforming models
function_tihil_2024-10-05 status is now torndown due to DeploymentManager action