developer_uid: chai_backend_admin
submission_id: function_nulus_2024-11-18
model_name: retune_with_base
model_group:
status: inactive
timestamp: 2024-11-18T18:27:35+00:00
num_battles: 12622
num_wins: 6787
celo_rating: 1279.97
family_friendly_score: 0.575
family_friendly_standard_error: 0.006991065727054781
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-11-18
win_ratio: 0.5377119315480906
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.463149070739746s
Received healthy response to inference request in 3.316628932952881s
Received healthy response to inference request in 4.419047594070435s
Received healthy response to inference request in 3.4705910682678223s
Received healthy response to inference request in 2.7546536922454834s
5 requests
0 failed requests
5th percentile: 2.867048740386963
10th percentile: 2.9794437885284424
20th percentile: 3.2042338848114014
30th percentile: 3.345932960510254
40th percentile: 3.404541015625
50th percentile: 3.463149070739746
60th percentile: 3.4661258697509765
70th percentile: 3.469102668762207
80th percentile: 3.6602823734283447
90th percentile: 4.03966498374939
95th percentile: 4.229356288909912
99th percentile: 4.38110933303833
mean time: 3.4848140716552733
Pipeline stage StressChecker completed in 19.72s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.80s
Shutdown handler de-registered
function_nulus_2024-11-18 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3042.05s
Shutdown handler de-registered
function_nulus_2024-11-18 status is now inactive due to auto deactivation removed underperforming models