submission_id: function_luhul_2024-11-06
developer_uid: chai_backend_admin
celo_rating: 1202.1
display_name: retune_with_base
family_friendly_score: 0.5978
family_friendly_standard_error: 0.0069344813793102075
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: retune_with_base
num_battles: 22348
num_wins: 10441
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-11-06T22:09:27+00:00
us_pacific_date: 2024-11-06
win_ratio: 0.4672006443529622
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.4724221229553223s
Received healthy response to inference request in 3.4829421043395996s
Received healthy response to inference request in 3.5883870124816895s
Received healthy response to inference request in 6.042686939239502s
Received healthy response to inference request in 2.2561991214752197s
5 requests
0 failed requests
5th percentile: 2.2994437217712402
10th percentile: 2.3426883220672607
20th percentile: 2.4291775226593018
30th percentile: 2.6745261192321776
40th percentile: 3.078734111785889
50th percentile: 3.4829421043395996
60th percentile: 3.5251200675964354
70th percentile: 3.5672980308532716
80th percentile: 4.079246997833252
90th percentile: 5.060966968536377
95th percentile: 5.5518269538879395
99th percentile: 5.94451494216919
mean time: 3.5685274600982666
Pipeline stage StressChecker completed in 19.20s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 3.13s
Shutdown handler de-registered
function_luhul_2024-11-06 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4092.88s
Shutdown handler de-registered
function_luhul_2024-11-06 status is now inactive due to auto deactivation removed underperforming models