developer_uid: chai_backend_admin
submission_id: function_nedab_2024-11-27
model_name: retune_with_base
model_group:
status: inactive
timestamp: 2024-11-27T22:17:03+00:00
num_battles: 16417
num_wins: 8365
celo_rating: 1265.55
family_friendly_score: 0.5928
family_friendly_standard_error: 0.006948210704922527
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-11-27
win_ratio: 0.5095328013644393
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.115128993988037s
Received healthy response to inference request in 2.3662991523742676s
Received healthy response to inference request in 3.804119825363159s
Received healthy response to inference request in 4.2481606006622314s
Received healthy response to inference request in 4.6517109870910645s
5 requests
0 failed requests
5th percentile: 2.5160651206970215
10th percentile: 2.6658310890197754
20th percentile: 2.965363025665283
30th percentile: 3.2529271602630616
40th percentile: 3.52852349281311
50th percentile: 3.804119825363159
60th percentile: 3.9817361354827883
70th percentile: 4.159352445602417
80th percentile: 4.328870677947998
90th percentile: 4.490290832519531
95th percentile: 4.5710009098052975
99th percentile: 4.635568971633911
mean time: 3.637083911895752
%s, retrying in %s seconds...
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 2.6412839889526367s
Received healthy response to inference request in 3.228670835494995s
Received healthy response to inference request in 3.48099422454834s
Received healthy response to inference request in 3.786224365234375s
Received healthy response to inference request in 3.7845072746276855s
5 requests
0 failed requests
5th percentile: 2.7587613582611086
10th percentile: 2.87623872756958
20th percentile: 3.1111934661865233
30th percentile: 3.279135513305664
40th percentile: 3.380064868927002
50th percentile: 3.48099422454834
60th percentile: 3.602399444580078
70th percentile: 3.723804664611816
80th percentile: 3.7848506927490235
90th percentile: 3.785537528991699
95th percentile: 3.785880947113037
99th percentile: 3.7861556816101074
mean time: 3.3843361377716064
%s, retrying in %s seconds...
Received healthy response to inference request in 3.6445188522338867s
Received healthy response to inference request in 3.524757146835327s
Received healthy response to inference request in 3.7767415046691895s
Received healthy response to inference request in 3.1908724308013916s
Received healthy response to inference request in 3.3106794357299805s
5 requests
0 failed requests
5th percentile: 3.2148338317871095
10th percentile: 3.2387952327728273
20th percentile: 3.2867180347442626
30th percentile: 3.35349497795105
40th percentile: 3.4391260623931883
50th percentile: 3.524757146835327
60th percentile: 3.572661828994751
70th percentile: 3.6205665111541747
80th percentile: 3.6709633827209474
90th percentile: 3.723852443695068
95th percentile: 3.750296974182129
99th percentile: 3.7714525985717775
mean time: 3.489513874053955
Pipeline stage StressChecker completed in 55.94s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.04s
Shutdown handler de-registered
function_nedab_2024-11-27 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4507.11s
Shutdown handler de-registered
function_nedab_2024-11-27 status is now inactive due to auto deactivation removed underperforming models