submission_id: function_gupor_2024-09-20
developer_uid: chai_backend_admin
celo_rating: 1216.77
display_name: elo_alignment_randomize_memory
family_friendly_score: 0.0
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.08, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: elo_alignment_randomize_memory
num_battles: 12549
num_wins: 5721
ranking_group: single
status: torndown
submission_type: function
timestamp: 2024-09-20T03:00:45+00:00
us_pacific_date: 2024-09-19
win_ratio: 0.45589289983265596
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.8862051963806152s
Received healthy response to inference request in 2.4095466136932373s
Received healthy response to inference request in 2.340507745742798s
Received healthy response to inference request in 2.2751786708831787s
Received healthy response to inference request in 2.021674633026123s
5 requests
0 failed requests
5th percentile: 2.072375440597534
10th percentile: 2.123076248168945
20th percentile: 2.2244778633117677
30th percentile: 2.2882444858551025
40th percentile: 2.31437611579895
50th percentile: 2.340507745742798
60th percentile: 2.3681232929229736
70th percentile: 2.3957388401031494
80th percentile: 2.504878330230713
90th percentile: 2.695541763305664
95th percentile: 2.7908734798431394
99th percentile: 2.86713885307312
mean time: 2.3866225719451903
Pipeline stage StressChecker completed in 13.87s
Shutdown handler de-registered
function_gupor_2024-09-20 status is now deployed due to DeploymentManager action
function_gupor_2024-09-20 status is now inactive due to auto deactivation removed underperforming models
function_gupor_2024-09-20 status is now torndown due to DeploymentManager action