submission_id: function_nulus_2024-11-18
developer_uid: chai_backend_admin
celo_rating: 1279.97
display_name: retune_with_base
family_friendly_score: 0.575
family_friendly_standard_error: 0.006991065727054781
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: retune_with_base
num_battles: 12622
num_wins: 6787
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-11-18T18:27:35+00:00
us_pacific_date: 2024-11-18
win_ratio: 0.5377119315480906
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.463149070739746s
Received healthy response to inference request in 3.316628932952881s
Received healthy response to inference request in 4.419047594070435s
Received healthy response to inference request in 3.4705910682678223s
Received healthy response to inference request in 2.7546536922454834s
5 requests
0 failed requests
5th percentile: 2.867048740386963
10th percentile: 2.9794437885284424
20th percentile: 3.2042338848114014
30th percentile: 3.345932960510254
40th percentile: 3.404541015625
50th percentile: 3.463149070739746
60th percentile: 3.4661258697509765
70th percentile: 3.469102668762207
80th percentile: 3.6602823734283447
90th percentile: 4.03966498374939
95th percentile: 4.229356288909912
99th percentile: 4.38110933303833
mean time: 3.4848140716552733
Pipeline stage StressChecker completed in 19.72s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.80s
Shutdown handler de-registered
function_nulus_2024-11-18 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3042.05s
Shutdown handler de-registered
function_nulus_2024-11-18 status is now inactive due to auto deactivation removed underperforming models