developer_uid: rirv938
submission_id: function_fonot_2024-12-09
model_name: retune_with_base
model_group:
status: inactive
timestamp: 2024-12-09T20:37:31+00:00
num_battles: 13601
num_wins: 6977
celo_rating: 1244.41
family_friendly_score: 0.5906
family_friendly_standard_error: 0.006954015243008891
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-12-09
win_ratio: 0.512976986986251
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.939824342727661s
Received healthy response to inference request in 2.676985025405884s
Received healthy response to inference request in 2.6808016300201416s
Received healthy response to inference request in 4.163667678833008s
5 requests
1 failed requests
5th percentile: 2.6777483463287353
10th percentile: 2.6785116672515867
20th percentile: 2.68003830909729
30th percentile: 2.7326061725616455
40th percentile: 2.8362152576446533
50th percentile: 2.939824342727661
60th percentile: 3.4293616771697994
70th percentile: 3.918899011611938
80th percentile: 7.353888034820559
90th percentile: 13.734328746795656
95th percentile: 16.9245491027832
99th percentile: 19.476725387573243
mean time: 6.515209627151489
%s, retrying in %s seconds...
Received healthy response to inference request in 2.870347738265991s
Received healthy response to inference request in 2.750638961791992s
Received healthy response to inference request in 2.5002105236053467s
Received healthy response to inference request in 4.232475519180298s
Received healthy response to inference request in 3.469371795654297s
5 requests
0 failed requests
5th percentile: 2.5502962112426757
10th percentile: 2.6003818988800047
20th percentile: 2.700553274154663
30th percentile: 2.774580717086792
40th percentile: 2.8224642276763916
50th percentile: 2.870347738265991
60th percentile: 3.1099573612213134
70th percentile: 3.3495669841766356
80th percentile: 3.6219925403594972
90th percentile: 3.9272340297698975
95th percentile: 4.0798547744750975
99th percentile: 4.201951370239258
mean time: 3.164608907699585
Pipeline stage StressChecker completed in 50.65s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.26s
Shutdown handler de-registered
function_fonot_2024-12-09 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3099.94s
Shutdown handler de-registered
function_fonot_2024-12-09 status is now inactive due to auto deactivation removed underperforming models