submission_id: function_dohol_2024-10-18
developer_uid: chai_backend_admin
celo_rating: 1271.5
display_name: reward_blend_default_full_bon
family_friendly_score: 0.5596776105250681
family_friendly_standard_error: 0.005371561629434037
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 50, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>', '<|user|>', '###'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: reward_blend_default_full_bon
num_battles: 8853
num_wins: 4666
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-10-18T21:13:47+00:00
us_pacific_date: 2024-10-18
win_ratio: 0.5270529763921834
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.16746187210083s
Received healthy response to inference request in 2.606011390686035s
Received healthy response to inference request in 1.4710993766784668s
Received healthy response to inference request in 2.090207576751709s
Received healthy response to inference request in 3.1384336948394775s
5 requests
0 failed requests
5th percentile: 1.5949210166931151
10th percentile: 1.7187426567077637
20th percentile: 1.9663859367370606
30th percentile: 2.105658435821533
40th percentile: 2.136560153961182
50th percentile: 2.16746187210083
60th percentile: 2.342881679534912
70th percentile: 2.518301486968994
80th percentile: 2.712495851516724
90th percentile: 2.9254647731781005
95th percentile: 3.031949234008789
99th percentile: 3.1171368026733397
mean time: 2.294642782211304
Pipeline stage StressChecker completed in 12.90s
Shutdown handler de-registered
function_dohol_2024-10-18 status is now deployed due to DeploymentManager action
function_dohol_2024-10-18 status is now inactive due to auto deactivation removed underperforming models