function_nupus_2024-12-12

developer_uid: chai_backend_admin

submission_id: function_nupus_2024-12-12

model_name: retune_with_base

model_group:

status: inactive

timestamp: 2024-12-12T23:25:43+00:00

num_battles: 6247

num_wins: 3127

celo_rating: 1256.74

family_friendly_score: 0.5838

family_friendly_standard_error: 0.006971048127792549

submission_type: function

display_name: retune_with_base

is_internal_developer: True

ranking_group: single

us_pacific_date: 2024-12-12

win_ratio: 0.500560268929086

generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 12.557207345962524s
Received healthy response to inference request in 12.807865619659424s
Received healthy response to inference request in 13.201616764068604s
Received healthy response to inference request in 8.544782638549805s
Received healthy response to inference request in 2.1345720291137695s
5 requests
0 failed requests
5th percentile: 3.4166141510009767
10th percentile: 4.698656272888184
20th percentile: 7.262740516662598
30th percentile: 9.347267580032348
40th percentile: 10.952237462997436
50th percentile: 12.557207345962524
60th percentile: 12.657470655441283
70th percentile: 12.757733964920044
80th percentile: 12.88661584854126
90th percentile: 13.044116306304932
95th percentile: 13.122866535186768
99th percentile: 13.185866718292237
mean time: 9.849208879470826
%s, retrying in %s seconds...
Received healthy response to inference request in 3.5018155574798584s
Received healthy response to inference request in 3.007240056991577s
Received healthy response to inference request in 2.2216808795928955s
Received healthy response to inference request in 2.5710031986236572s
Received healthy response to inference request in 3.1112303733825684s
5 requests
0 failed requests
5th percentile: 2.291545343399048
10th percentile: 2.3614098072052
20th percentile: 2.5011387348175047
30th percentile: 2.658250570297241
40th percentile: 2.832745313644409
50th percentile: 3.007240056991577
60th percentile: 3.048836183547974
70th percentile: 3.09043231010437
80th percentile: 3.1893474102020263
90th percentile: 3.3455814838409426
95th percentile: 3.4236985206604005
99th percentile: 3.4861921501159667
mean time: 2.8825940132141112
Pipeline stage StressChecker completed in 65.86s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.54s
Shutdown handler de-registered
function_nupus_2024-12-12 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2809.72s
Shutdown handler de-registered
function_nupus_2024-12-12 status is now inactive due to auto deactivation removed underperforming models