submission_id: function_keset_2024-11-12
developer_uid: chai_backend_admin
celo_rating: 1255.54
display_name: retune_with_base
family_friendly_score: 0.581
family_friendly_standard_error: 0.006977664365674233
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: retune_with_base
num_battles: 12934
num_wins: 6543
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-11-12T13:28:29+00:00
us_pacific_date: 2024-11-12
win_ratio: 0.5058759857739292
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.333787679672241s
Received healthy response to inference request in 2.2053680419921875s
Received healthy response to inference request in 2.3473875522613525s
Received healthy response to inference request in 1.9604465961456299s
Received healthy response to inference request in 1.415926456451416s
5 requests
0 failed requests
5th percentile: 1.5248304843902587
10th percentile: 1.6337345123291016
20th percentile: 1.8515425682067872
30th percentile: 2.0094308853149414
40th percentile: 2.1073994636535645
50th percentile: 2.2053680419921875
60th percentile: 2.256735897064209
70th percentile: 2.3081037521362306
80th percentile: 2.3365076541900636
90th percentile: 2.341947603225708
95th percentile: 2.34466757774353
99th percentile: 2.346843557357788
mean time: 2.0525832653045653
Pipeline stage StressChecker completed in 11.60s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.42s
Shutdown handler de-registered
function_keset_2024-11-12 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3073.12s
Shutdown handler de-registered
function_keset_2024-11-12 status is now inactive due to auto deactivation removed underperforming models