developer_uid: rirv938
submission_id: function_pimob_2024-12-17
model_name: retune_with_base
model_group:
status: inactive
timestamp: 2024-12-17T17:15:23+00:00
num_battles: 11244
num_wins: 4962
celo_rating: 1211.2
family_friendly_score: 0.5866
family_friendly_standard_error: 0.0069642004566209895
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-12-17
win_ratio: 0.4413020277481323
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.5249273777008057s
Received healthy response to inference request in 3.4347403049468994s
Received healthy response to inference request in 3.003145933151245s
Received healthy response to inference request in 2.0238096714019775s
Received healthy response to inference request in 2.7110812664031982s
5 requests
0 failed requests
5th percentile: 2.124033212661743
10th percentile: 2.224256753921509
20th percentile: 2.42470383644104
30th percentile: 2.562158155441284
40th percentile: 2.636619710922241
50th percentile: 2.7110812664031982
60th percentile: 2.827907133102417
70th percentile: 2.944732999801636
80th percentile: 3.089464807510376
90th percentile: 3.262102556228638
95th percentile: 3.3484214305877686
99th percentile: 3.4174765300750733
mean time: 2.7395409107208253
Pipeline stage StressChecker completed in 15.50s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.32s
Shutdown handler de-registered
function_pimob_2024-12-17 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3525.10s
Shutdown handler de-registered
function_pimob_2024-12-17 status is now inactive due to auto deactivation removed underperforming models