submission_id: function_mumuk_2024-10-01
developer_uid: chai_backend_admin
celo_rating: 1262.06
display_name: retune_with_base
family_friendly_score: 0.5680415719389412
family_friendly_standard_error: 0.008901833246912688
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', '###', 'Bot:', 'User:', 'You:', '<|im_end|>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: retune_with_base
num_battles: 3168
num_wins: 1643
ranking_group: single
status: torndown
submission_type: function
timestamp: 2024-10-01T03:27:36+00:00
us_pacific_date: 2024-09-30
win_ratio: 0.5186237373737373
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.038555860519409s
HTTPSConnectionPool(host='guanaco-submitter.chai-research.com', port=443): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 5.707759857177734s
Received healthy response to inference request in 4.635817766189575s
Received healthy response to inference request in 3.702934980392456s
5 requests
1 failed requests
5th percentile: 3.1714316844940185
10th percentile: 3.3043075084686278
20th percentile: 3.5700591564178468
30th percentile: 3.88951153755188
40th percentile: 4.262664651870727
50th percentile: 4.635817766189575
60th percentile: 5.0645946025848385
70th percentile: 5.493371438980103
80th percentile: 8.615024662017825
90th percentile: 14.429554271697999
95th percentile: 17.336819076538085
99th percentile: 19.662630920410155
mean time: 7.46583046913147
%s, retrying in %s seconds...
Received healthy response to inference request in 2.614380359649658s
Received healthy response to inference request in 2.5374343395233154s
Received healthy response to inference request in 3.289886236190796s
Received healthy response to inference request in 5.642785310745239s
Received healthy response to inference request in 3.190676212310791s
5 requests
0 failed requests
5th percentile: 2.552823543548584
10th percentile: 2.5682127475738525
20th percentile: 2.5989911556243896
30th percentile: 2.7296395301818848
40th percentile: 2.960157871246338
50th percentile: 3.190676212310791
60th percentile: 3.230360221862793
70th percentile: 3.270044231414795
80th percentile: 3.7604660511016847
90th percentile: 4.7016256809234624
95th percentile: 5.17220549583435
99th percentile: 5.548669347763061
mean time: 3.45503249168396
Pipeline stage StressChecker completed in 71.92s
Shutdown handler de-registered
function_mumuk_2024-10-01 status is now deployed due to DeploymentManager action
function_mumuk_2024-10-01 status is now inactive due to auto deactivation removed underperforming models
function_mumuk_2024-10-01 status is now torndown due to DeploymentManager action