submission_id: function_tares_2024-11-12
developer_uid: chai_backend_admin
celo_rating: 1274.84
display_name: retune_with_base
family_friendly_score: 0.5853999999999999
family_friendly_standard_error: 0.006967163554847841
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: retune_with_base
num_battles: 16210
num_wins: 8841
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-11-12T17:53:59+00:00
us_pacific_date: 2024-11-12
win_ratio: 0.545404071560765
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.6204512119293213s
Received healthy response to inference request in 2.5181281566619873s
Received healthy response to inference request in 1.7351820468902588s
Received healthy response to inference request in 3.1707277297973633s
Received healthy response to inference request in 2.2118570804595947s
5 requests
0 failed requests
5th percentile: 1.830517053604126
10th percentile: 1.9258520603179932
20th percentile: 2.1165220737457275
30th percentile: 2.2731112957000734
40th percentile: 2.3956197261810304
50th percentile: 2.5181281566619873
60th percentile: 2.559057378768921
70th percentile: 2.5999866008758543
80th percentile: 2.7305065155029298
90th percentile: 2.9506171226501463
95th percentile: 3.060672426223755
99th percentile: 3.1487166690826416
mean time: 2.451269245147705
Pipeline stage StressChecker completed in 13.47s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.61s
Shutdown handler de-registered
function_tares_2024-11-12 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 5371.37s
Shutdown handler de-registered
function_tares_2024-11-12 status is now inactive due to auto deactivation removed underperforming models