developer_uid: azuruce
submission_id: function_sumek_2024-10-28
model_name: select-safe
model_group:
status: torndown
timestamp: 2024-10-28T05:36:25+00:00
num_battles: 10896
num_wins: 5111
celo_rating: 1213.52
family_friendly_score: 0.608
family_friendly_standard_error: 0.006904143683325254
submission_type: function
display_name: select-safe
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-10-27
win_ratio: 0.4690712187958884
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 5.166102170944214s
Received healthy response to inference request in 4.515929698944092s
Received healthy response to inference request in 2.5928192138671875s
Received healthy response to inference request in 3.2727949619293213s
Received healthy response to inference request in 3.691180944442749s
5 requests
0 failed requests
5th percentile: 2.7288143634796143
10th percentile: 2.864809513092041
20th percentile: 3.1367998123168945
30th percentile: 3.3564721584320067
40th percentile: 3.523826551437378
50th percentile: 3.691180944442749
60th percentile: 4.021080446243286
70th percentile: 4.350979948043823
80th percentile: 4.645964193344116
90th percentile: 4.906033182144165
95th percentile: 5.03606767654419
99th percentile: 5.140095272064209
mean time: 3.847765398025513
Pipeline stage StressChecker completed in 21.16s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 13.20s
Shutdown handler de-registered
function_sumek_2024-10-28 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2805.46s
Shutdown handler de-registered
function_sumek_2024-10-28 status is now inactive due to auto deactivation removed underperforming models
function_sumek_2024-10-28 status is now torndown due to DeploymentManager action