submission_id: function_bupik_2024-11-18
developer_uid: chai_backend_admin
celo_rating: 1249.5
display_name: retune_with_base
family_friendly_score: 0.6064
family_friendly_standard_error: 0.006909110507149239
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: retune_with_base
num_battles: 12382
num_wins: 6048
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-11-18T18:17:39+00:00
us_pacific_date: 2024-11-18
win_ratio: 0.488450977225004
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.808981418609619s
Received healthy response to inference request in 2.760331869125366s
Received healthy response to inference request in 3.9776463508605957s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 4.204990863800049s
Received healthy response to inference request in 4.351502418518066s
5 requests
0 failed requests
5th percentile: 2.7700617790222166
10th percentile: 2.7797916889190675
20th percentile: 2.7992515087127687
30th percentile: 3.0427144050598143
40th percentile: 3.510180377960205
50th percentile: 3.9776463508605957
60th percentile: 4.068584156036377
70th percentile: 4.159521961212159
80th percentile: 4.2342931747436525
90th percentile: 4.292897796630859
95th percentile: 4.322200107574463
99th percentile: 4.345641956329346
mean time: 3.6206905841827393
%s, retrying in %s seconds...
Received healthy response to inference request in 4.467391729354858s
Received healthy response to inference request in 4.476415395736694s
Received healthy response to inference request in 4.433755874633789s
Received healthy response to inference request in 3.8332760334014893s
Received healthy response to inference request in 3.0823488235473633s
5 requests
0 failed requests
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
5th percentile: 3.2325342655181886
Connection pool is full, discarding connection: %s. Connection pool size: %s
10th percentile: 3.382719707489014
20th percentile: 3.683090591430664
30th percentile: 3.953372001647949
40th percentile: 4.193563938140869
50th percentile: 4.433755874633789
60th percentile: 4.447210216522217
70th percentile: 4.460664558410644
80th percentile: 4.469196462631226
90th percentile: 4.47280592918396
95th percentile: 4.474610662460327
99th percentile: 4.4760544490814205
mean time: 4.058637571334839
%s, retrying in %s seconds...
Received healthy response to inference request in 2.980198383331299s
Received healthy response to inference request in 3.112154722213745s
Received healthy response to inference request in 3.7046406269073486s
Received healthy response to inference request in 3.2655577659606934s
Received healthy response to inference request in 3.0906009674072266s
5 requests
0 failed requests
5th percentile: 3.0022789001464845
10th percentile: 3.02435941696167
20th percentile: 3.068520450592041
30th percentile: 3.09491171836853
40th percentile: 3.103533220291138
50th percentile: 3.112154722213745
60th percentile: 3.1735159397125243
70th percentile: 3.2348771572113035
80th percentile: 3.3533743381500245
90th percentile: 3.5290074825286863
95th percentile: 3.6168240547180175
99th percentile: 3.6870773124694822
mean time: 3.2306304931640626
Pipeline stage StressChecker completed in 58.90s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.63s
Shutdown handler de-registered
function_bupik_2024-11-18 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3487.70s
Shutdown handler de-registered
function_bupik_2024-11-18 status is now inactive due to auto deactivation removed underperforming models