submission_id: function_reput_2024-10-18
developer_uid: chai_backend_admin
celo_rating: 1263.13
display_name: reward_blend_default_full_bon
family_friendly_score: 0.5754103498256747
family_friendly_standard_error: 0.005333692291213816
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 50, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>', '<|user|>', '###'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: reward_blend_default_full_bon
num_battles: 8947
num_wins: 4570
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-10-18T20:21:50+00:00
us_pacific_date: 2024-10-18
win_ratio: 0.5107857382362803
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.051523447036743s
Received healthy response to inference request in 2.7041614055633545s
Received healthy response to inference request in 2.3540098667144775s
Received healthy response to inference request in 2.335817813873291s
Received healthy response to inference request in 1.9316112995147705s
5 requests
0 failed requests
5th percentile: 2.0124526023864746
10th percentile: 2.0932939052581787
20th percentile: 2.254976511001587
30th percentile: 2.3394562244415282
40th percentile: 2.346733045578003
50th percentile: 2.3540098667144775
60th percentile: 2.4940704822540285
70th percentile: 2.634131097793579
80th percentile: 2.7736338138580323
90th percentile: 2.9125786304473875
95th percentile: 2.9820510387420653
99th percentile: 3.0376289653778077
mean time: 2.4754247665405273
Pipeline stage StressChecker completed in 13.82s
Shutdown handler de-registered
function_reput_2024-10-18 status is now deployed due to DeploymentManager action
function_reput_2024-10-18 status is now inactive due to auto deactivation removed underperforming models