submission_id: function_sihaf_2024-11-19
developer_uid: chai_backend_admin
celo_rating: 1259.88
display_name: retune_with_base
family_friendly_score: 0.6106
family_friendly_standard_error: 0.006895906611896655
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: retune_with_base
num_battles: 6873
num_wins: 3528
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-11-19T17:19:05+00:00
us_pacific_date: 2024-11-19
win_ratio: 0.5133129637712789
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.088979721069336s
Received healthy response to inference request in 4.0345728397369385s
Received healthy response to inference request in 2.856363296508789s
Received healthy response to inference request in 3.1056361198425293s
Received healthy response to inference request in 2.610539197921753s
5 requests
0 failed requests
5th percentile: 2.65970401763916
10th percentile: 2.7088688373565675
20th percentile: 2.807198476791382
30th percentile: 2.9028865814208986
40th percentile: 2.9959331512451173
50th percentile: 3.088979721069336
60th percentile: 3.0956422805786135
70th percentile: 3.1023048400878905
80th percentile: 3.2914234638214115
90th percentile: 3.662998151779175
95th percentile: 3.8487854957580563
99th percentile: 3.997415370941162
mean time: 3.139218235015869
Pipeline stage StressChecker completed in 16.93s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.66s
Shutdown handler de-registered
function_sihaf_2024-11-19 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3665.55s
Shutdown handler de-registered
function_sihaf_2024-11-19 status is now inactive due to auto deactivation removed underperforming models