submission_id: function_bunar_2024-10-18
developer_uid: chai_backend_admin
celo_rating: 1264.14
display_name: reward_blend_default_full_bon
family_friendly_score: 0.5861489284176356
family_friendly_standard_error: 0.004881090242032852
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 50, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>', '<|user|>', '###'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: reward_blend_default_full_bon
num_battles: 13783
num_wins: 6873
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-10-18T19:43:25+00:00
us_pacific_date: 2024-10-18
win_ratio: 0.4986577668141914
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.1677701473236084s
Received healthy response to inference request in 2.3458566665649414s
Received healthy response to inference request in 4.640395164489746s
Received healthy response to inference request in 1.9912965297698975s
Received healthy response to inference request in 3.4100542068481445s
5 requests
0 failed requests
5th percentile: 2.062208557128906
10th percentile: 2.133120584487915
20th percentile: 2.2749446392059327
30th percentile: 2.510239362716675
40th percentile: 2.8390047550201416
50th percentile: 3.1677701473236084
60th percentile: 3.2646837711334227
70th percentile: 3.3615973949432374
80th percentile: 3.656122398376465
90th percentile: 4.148258781433105
95th percentile: 4.394326972961426
99th percentile: 4.591181526184082
mean time: 3.1110745429992677
Pipeline stage StressChecker completed in 17.45s
Shutdown handler de-registered
function_bunar_2024-10-18 status is now deployed due to DeploymentManager action
function_bunar_2024-10-18 status is now inactive due to auto deactivation removed underperforming models