submission_id: function_pogub_2024-09-20
developer_uid: chai_backend_admin
celo_rating: 1260.2
display_name: elo_alignment_randomize_memory
family_friendly_score: 0.0
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.08, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: elo_alignment_randomize_memory
num_battles: 11958
num_wins: 6227
ranking_group: single
status: torndown
submission_type: function
timestamp: 2024-09-20T03:06:13+00:00
us_pacific_date: 2024-09-19
win_ratio: 0.5207392540558622
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.5409696102142334s
Received healthy response to inference request in 3.3692967891693115s
Received healthy response to inference request in 4.835160970687866s
Received healthy response to inference request in 2.715935230255127s
Received healthy response to inference request in 3.2705397605895996s
5 requests
0 failed requests
5th percentile: 2.5759627342224123
10th percentile: 2.6109558582305907
20th percentile: 2.680942106246948
30th percentile: 2.8268561363220215
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
40th percentile: 3.0486979484558105
50th percentile: 3.2705397605895996
60th percentile: 3.3100425720214846
70th percentile: 3.349545383453369
80th percentile: 3.662469625473023
90th percentile: 4.248815298080444
95th percentile: 4.541988134384155
99th percentile: 4.776526403427124
mean time: 3.3463804721832275
Pipeline stage StressChecker completed in 17.61s
Shutdown handler de-registered
function_pogub_2024-09-20 status is now deployed due to DeploymentManager action
function_pogub_2024-09-20 status is now inactive due to auto deactivation removed underperforming models
function_pogub_2024-09-20 status is now torndown due to DeploymentManager action