developer_uid: chai_backend_admin
submission_id: function_nejol_2024-12-08
model_name: retune_with_base
model_group:
status: inactive
timestamp: 2024-12-08T23:23:46+00:00
num_battles: 12670
num_wins: 7043
celo_rating: 1291.29
family_friendly_score: 0.5864
family_friendly_standard_error: 0.006964697265495466
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-12-08
win_ratio: 0.5558800315706393
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.870960474014282s
Received healthy response to inference request in 3.2189347743988037s
Received healthy response to inference request in 3.6287477016448975s
Received healthy response to inference request in 4.04752516746521s
Received healthy response to inference request in 3.0472636222839355s
5 requests
0 failed requests
5th percentile: 3.081597852706909
10th percentile: 3.115932083129883
20th percentile: 3.1846005439758303
30th percentile: 3.3008973598480225
40th percentile: 3.46482253074646
50th percentile: 3.6287477016448975
60th percentile: 3.7962586879730225
70th percentile: 3.9637696743011475
80th percentile: 4.212212228775025
90th percentile: 4.5415863513946535
95th percentile: 4.706273412704467
99th percentile: 4.838023061752319
mean time: 3.762686347961426
%s, retrying in %s seconds...
Received healthy response to inference request in 2.68723464012146s
Received healthy response to inference request in 2.8382952213287354s
Received healthy response to inference request in 3.0050301551818848s
Received healthy response to inference request in 3.5908851623535156s
Received healthy response to inference request in 3.5496582984924316s
5 requests
0 failed requests
5th percentile: 2.717446756362915
10th percentile: 2.74765887260437
20th percentile: 2.80808310508728
30th percentile: 2.8716422080993653
40th percentile: 2.938336181640625
50th percentile: 3.0050301551818848
60th percentile: 3.2228814125061036
70th percentile: 3.440732669830322
80th percentile: 3.5579036712646483
90th percentile: 3.574394416809082
95th percentile: 3.582639789581299
99th percentile: 3.589236087799072
mean time: 3.1342206954956056
Pipeline stage StressChecker completed in 36.82s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.17s
Shutdown handler de-registered
function_nejol_2024-12-08 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3385.23s
Shutdown handler de-registered
function_nejol_2024-12-08 status is now inactive due to auto deactivation removed underperforming models