developer_uid: chai_backend_admin
submission_id: function_bunar_2024-10-18
model_name: reward_blend_default_full_bon
model_group:
status: torndown
timestamp: 2024-10-18T19:43:25+00:00
num_battles: 13783
num_wins: 6873
celo_rating: 1264.14
family_friendly_score: 0.5861489284176356
family_friendly_standard_error: 0.004881090242032852
submission_type: function
display_name: reward_blend_default_full_bon
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-10-18
win_ratio: 0.4986577668141914
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 50, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>', '<|user|>', '###'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.1677701473236084s
Received healthy response to inference request in 2.3458566665649414s
Received healthy response to inference request in 4.640395164489746s
Received healthy response to inference request in 1.9912965297698975s
Received healthy response to inference request in 3.4100542068481445s
5 requests
0 failed requests
5th percentile: 2.062208557128906
10th percentile: 2.133120584487915
20th percentile: 2.2749446392059327
30th percentile: 2.510239362716675
40th percentile: 2.8390047550201416
50th percentile: 3.1677701473236084
60th percentile: 3.2646837711334227
70th percentile: 3.3615973949432374
80th percentile: 3.656122398376465
90th percentile: 4.148258781433105
95th percentile: 4.394326972961426
99th percentile: 4.591181526184082
mean time: 3.1110745429992677
Pipeline stage StressChecker completed in 17.45s
Shutdown handler de-registered
function_bunar_2024-10-18 status is now deployed due to DeploymentManager action
function_bunar_2024-10-18 status is now inactive due to auto deactivation removed underperforming models
function_bunar_2024-10-18 status is now torndown due to DeploymentManager action