submission_id: function_sabet_2024-09-10
developer_uid: chai_backend_admin
alignment_samples: 8297
alignment_score: -0.9491930160598026
celo_rating: 1261.26
display_name: elo_alignment_amd_quartz_v2
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.08, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: elo_alignment_amd_quartz_v2
num_battles: 8297
num_wins: 4514
propriety_score: 0.7195121951219512
propriety_total_count: 738.0
ranking_group: single
status: deployed
submission_type: function
timestamp: 2024-09-10T04:57:40+00:00
us_pacific_date: 2024-09-09
win_ratio: 0.5440520670121731
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.1773769855499268s
Received healthy response to inference request in 1.5115923881530762s
Received healthy response to inference request in 2.0096888542175293s
Received healthy response to inference request in 2.0455474853515625s
Received healthy response to inference request in 1.8276739120483398s
5 requests
0 failed requests
5th percentile: 1.574808692932129
10th percentile: 1.6380249977111816
20th percentile: 1.764457607269287
30th percentile: 1.8640769004821778
40th percentile: 1.9368828773498534
50th percentile: 2.0096888542175293
60th percentile: 2.0240323066711428
70th percentile: 2.0383757591247558
80th percentile: 2.0719133853912353
90th percentile: 2.1246451854705812
95th percentile: 2.151011085510254
99th percentile: 2.172103805541992
mean time: 1.9143759250640868
Pipeline stage StressChecker completed in 10.31s
Shutdown handler de-registered
function_sabet_2024-09-10 status is now deployed due to DeploymentManager action