submission_id: function_jubes_2024-10-01
developer_uid: chai_backend_admin
celo_rating: 1253.42
display_name: retune_with_base
family_friendly_score: 0.5591054313099042
family_friendly_standard_error: 0.008432730733317989
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', '###', 'Bot:', 'User:', 'You:', '<|im_end|>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: retune_with_base
num_battles: 3556
num_wins: 1764
ranking_group: single
status: torndown
submission_type: function
timestamp: 2024-10-01T03:03:20+00:00
us_pacific_date: 2024-09-30
win_ratio: 0.49606299212598426
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.362647771835327s
Received healthy response to inference request in 4.965871810913086s
Received healthy response to inference request in 2.99088978767395s
Received healthy response to inference request in 3.339735507965088s
Received healthy response to inference request in 3.523533344268799s
5 requests
0 failed requests
5th percentile: 3.0606589317321777
10th percentile: 3.1304280757904053
20th percentile: 3.2699663639068604
30th percentile: 3.3443179607391356
40th percentile: 3.3534828662872314
50th percentile: 3.362647771835327
60th percentile: 3.4270020008087156
70th percentile: 3.4913562297821046
80th percentile: 3.8120010375976565
90th percentile: 4.3889364242553714
95th percentile: 4.677404117584228
99th percentile: 4.908178272247314
mean time: 3.63653564453125
Pipeline stage StressChecker completed in 20.10s
Shutdown handler de-registered
function_jubes_2024-10-01 status is now deployed due to DeploymentManager action
function_jubes_2024-10-01 status is now inactive due to auto deactivation removed underperforming models
function_jubes_2024-10-01 status is now torndown due to DeploymentManager action