submission_id: function_tilem_2024-10-29
developer_uid: valentin
celo_rating: 1271.39
display_name: guard_ensemble_w0pt1_bo4
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: guard_ensemble_w0pt1_bo4
num_battles: 7359
num_wins: 4024
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-10-29T14:16:25+00:00
us_pacific_date: 2024-10-29
win_ratio: 0.5468134257371926
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 9.410011768341064s
Received healthy response to inference request in 11.180140256881714s
Received healthy response to inference request in 9.73344087600708s
Received healthy response to inference request in 8.671180725097656s
Received healthy response to inference request in 9.190306901931763s
5 requests
0 failed requests
5th percentile: 8.775005960464478
10th percentile: 8.8788311958313
20th percentile: 9.086481666564941
30th percentile: 9.234247875213622
40th percentile: 9.322129821777343
50th percentile: 9.410011768341064
60th percentile: 9.53938341140747
70th percentile: 9.668755054473877
80th percentile: 10.022780752182006
90th percentile: 10.601460504531861
95th percentile: 10.890800380706787
99th percentile: 11.122272281646728
mean time: 9.637016105651856
Pipeline stage StressChecker completed in 51.82s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 5.36s
Shutdown handler de-registered
function_tilem_2024-10-29 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Received signal 2, running shutdown handler
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Received signal 2, running shutdown handler
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Received signal 2, running shutdown handler
Shutdown handler de-registered