submission_id: function_bebab_2024-10-29
developer_uid: valentin
celo_rating: 1214.68
display_name: guard_ensemble_w0pt1_bo1
family_friendly_score: 0.579
family_friendly_standard_error: 0.006982248921371967
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: guard_ensemble_w0pt1_bo1
num_battles: 7799
num_wins: 3599
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-10-29T14:37:59+00:00
us_pacific_date: 2024-10-29
win_ratio: 0.4614694191563021
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9476871490478516s
Received healthy response to inference request in 1.6276202201843262s
Received healthy response to inference request in 1.617522954940796s
Received healthy response to inference request in 1.552243947982788s
Received healthy response to inference request in 2.085299015045166s
5 requests
0 failed requests
5th percentile: 1.5652997493743896
10th percentile: 1.5783555507659912
20th percentile: 1.6044671535491943
30th percentile: 1.619542407989502
40th percentile: 1.623581314086914
50th percentile: 1.6276202201843262
60th percentile: 1.7556469917297364
70th percentile: 1.8836737632751464
80th percentile: 1.9752095222473145
90th percentile: 2.03025426864624
95th percentile: 2.0577766418457033
99th percentile: 2.0797945404052736
mean time: 1.7660746574401855
Pipeline stage StressChecker completed in 12.75s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 8.89s
Shutdown handler de-registered
function_bebab_2024-10-29 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2194.43s
Shutdown handler de-registered
function_bebab_2024-10-29 status is now inactive due to auto deactivation removed underperforming models