submission_id: function_kadef_2024-09-20
developer_uid: chai_backend_admin
celo_rating: 1204.11
display_name: elo_alignment_randomize_memory
family_friendly_score: 0.0
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.08, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: elo_alignment_randomize_memory
num_battles: 14859
num_wins: 6436
ranking_group: single
status: torndown
submission_type: function
timestamp: 2024-09-20T03:01:21+00:00
us_pacific_date: 2024-09-19
win_ratio: 0.43313816542163
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.2877066135406494s
Received healthy response to inference request in 2.5315423011779785s
Received healthy response to inference request in 2.3442208766937256s
Received healthy response to inference request in 2.142879009246826s
Received healthy response to inference request in 2.1964802742004395s
5 requests
0 failed requests
5th percentile: 2.153599262237549
10th percentile: 2.1643195152282715
20th percentile: 2.185760021209717
30th percentile: 2.226028394699097
40th percentile: 2.285124635696411
50th percentile: 2.3442208766937256
60th percentile: 2.4191494464874266
70th percentile: 2.494078016281128
80th percentile: 2.682775163650513
90th percentile: 2.9852408885955812
95th percentile: 3.136473751068115
99th percentile: 3.2574600410461425
mean time: 2.5005658149719237
Pipeline stage StressChecker completed in 13.20s
Shutdown handler de-registered
function_kadef_2024-09-20 status is now deployed due to DeploymentManager action
function_kadef_2024-09-20 status is now inactive due to auto deactivation removed underperforming models
function_kadef_2024-09-20 status is now torndown due to DeploymentManager action