submission_id: function_jeduf_2024-11-18
developer_uid: chai_backend_admin
celo_rating: 1281.17
display_name: retune_with_base
family_friendly_score: 0.5920000000000001
family_friendly_standard_error: 0.006950338121271511
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: retune_with_base
num_battles: 12873
num_wins: 6911
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-11-18T18:33:00+00:00
us_pacific_date: 2024-11-18
win_ratio: 0.5368600947720035
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.4291765689849854s
Received healthy response to inference request in 4.151886463165283s
Received healthy response to inference request in 3.645716428756714s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 3.9577300548553467s
Received healthy response to inference request in 6.18381404876709s
5 requests
0 failed requests
5th percentile: 3.4724845409393312
10th percentile: 3.5157925128936767
20th percentile: 3.602408456802368
30th percentile: 3.7081191539764404
40th percentile: 3.8329246044158936
50th percentile: 3.9577300548553467
60th percentile: 4.035392618179321
70th percentile: 4.113055181503296
80th percentile: 4.558271980285645
90th percentile: 5.371043014526367
95th percentile: 5.777428531646728
99th percentile: 6.102536945343018
mean time: 4.273664712905884
%s, retrying in %s seconds...
Received healthy response to inference request in 4.296272039413452s
Received healthy response to inference request in 3.0765442848205566s
Received healthy response to inference request in 4.058679819107056s
Received healthy response to inference request in 4.985209226608276s
Received healthy response to inference request in 4.085729360580444s
5 requests
0 failed requests
5th percentile: 3.2729713916778564
10th percentile: 3.4693984985351562
20th percentile: 3.862252712249756
30th percentile: 4.064089727401734
40th percentile: 4.0749095439910885
50th percentile: 4.085729360580444
60th percentile: 4.1699464321136475
70th percentile: 4.254163503646851
80th percentile: 4.4340594768524175
90th percentile: 4.7096343517303465
95th percentile: 4.847421789169311
99th percentile: 4.957651739120483
mean time: 4.100486946105957
%s, retrying in %s seconds...
Received healthy response to inference request in 3.26663875579834s
Received healthy response to inference request in 3.3647077083587646s
Received healthy response to inference request in 2.891350269317627s
Received healthy response to inference request in 3.240828037261963s
Received healthy response to inference request in 6.317065954208374s
5 requests
0 failed requests
5th percentile: 2.961245822906494
10th percentile: 3.0311413764953614
20th percentile: 3.170932483673096
30th percentile: 3.245990180969238
40th percentile: 3.2563144683837892
50th percentile: 3.26663875579834
60th percentile: 3.30586633682251
70th percentile: 3.34509391784668
80th percentile: 3.955179357528687
90th percentile: 5.1361226558685305
95th percentile: 5.726594305038452
99th percentile: 6.19897162437439
mean time: 3.8161181449890136
Pipeline stage StressChecker completed in 65.32s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.60s
Shutdown handler de-registered
function_jeduf_2024-11-18 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3496.06s
Shutdown handler de-registered
function_jeduf_2024-11-18 status is now inactive due to auto deactivation removed underperforming models