submission_id: function_rasab_2024-11-06
developer_uid: chai_backend_admin
celo_rating: 1131.97
display_name: retune_with_base
family_friendly_score: 0.5913999999999999
family_friendly_standard_error: 0.006951921173316049
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: retune_with_base
num_battles: 19043
num_wins: 6951
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-11-06T22:27:55+00:00
us_pacific_date: 2024-11-06
win_ratio: 0.36501601638397313
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.6547656059265137s
Received healthy response to inference request in 3.6823513507843018s
Received healthy response to inference request in 2.7014825344085693s
Received healthy response to inference request in 1.5854012966156006s
Received healthy response to inference request in 2.8735978603363037s
5 requests
0 failed requests
5th percentile: 1.8086175441741943
10th percentile: 2.031833791732788
20th percentile: 2.4782662868499754
30th percentile: 2.7359055995941164
40th percentile: 2.80475172996521
50th percentile: 2.8735978603363037
60th percentile: 3.1860649585723877
70th percentile: 3.4985320568084717
80th percentile: 3.660282754898071
90th percentile: 3.6713170528411867
95th percentile: 3.6768342018127442
99th percentile: 3.6812479209899904
mean time: 2.8995197296142576
Pipeline stage StressChecker completed in 15.86s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.71s
Shutdown handler de-registered
function_rasab_2024-11-06 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3130.77s
Shutdown handler de-registered
function_rasab_2024-11-06 status is now inactive due to auto deactivation removed underperforming models