submission_id: function_dolik_2024-10-29
developer_uid: valentin
celo_rating: 1242.0
display_name: guard_ensemble_w0pt1_bo1
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: guard_ensemble_w0pt1_bo1
num_battles: 6114
num_wins: 3096
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-10-29T14:04:17+00:00
us_pacific_date: 2024-10-29
win_ratio: 0.5063788027477919
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.310801029205322s
Received healthy response to inference request in 5.147119760513306s
Received healthy response to inference request in 3.980621099472046s
Received healthy response to inference request in 3.9752650260925293s
Received healthy response to inference request in 5.072280168533325s
5 requests
0 failed requests
5th percentile: 3.9763362407684326
10th percentile: 3.977407455444336
20th percentile: 3.9795498847961426
30th percentile: 4.046657085418701
40th percentile: 4.178729057312012
50th percentile: 4.310801029205322
60th percentile: 4.615392684936523
70th percentile: 4.919984340667725
80th percentile: 5.087248086929321
90th percentile: 5.1171839237213135
95th percentile: 5.13215184211731
99th percentile: 5.1441261768341064
mean time: 4.497217416763306
Pipeline stage StressChecker completed in 25.91s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 8.57s
Shutdown handler de-registered
function_dolik_2024-10-29 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Received signal 2, running shutdown handler
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3065.55s
Shutdown handler de-registered