submission_id: function_tubik_2024-10-18
developer_uid: chai_backend_admin
celo_rating: 1266.93
display_name: reward_blend_default_full_bon
family_friendly_score: 0.5750158052789631
family_friendly_standard_error: 0.0053506721770427155
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 50, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>', '<|user|>', '###'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: reward_blend_default_full_bon
num_battles: 8878
num_wins: 4595
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-10-18T20:17:26+00:00
us_pacific_date: 2024-10-18
win_ratio: 0.5175715251182699
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.6194796562194824s
Received healthy response to inference request in 2.621652364730835s
Received healthy response to inference request in 1.8041071891784668s
Received healthy response to inference request in 2.2233331203460693s
Received healthy response to inference request in 1.874459981918335s
5 requests
0 failed requests
5th percentile: 1.8181777477264405
10th percentile: 1.8322483062744142
20th percentile: 1.8603894233703613
30th percentile: 1.9442346096038818
40th percentile: 2.0837838649749756
50th percentile: 2.2233331203460693
60th percentile: 2.3817917346954345
70th percentile: 2.5402503490447996
80th percentile: 2.619914197921753
90th percentile: 2.620783281326294
95th percentile: 2.6212178230285645
99th percentile: 2.621565456390381
mean time: 2.2286064624786377
Pipeline stage StressChecker completed in 12.21s
Shutdown handler de-registered
function_tubik_2024-10-18 status is now deployed due to DeploymentManager action
function_tubik_2024-10-18 status is now inactive due to auto deactivation removed underperforming models