function_fomal_2024-11-19

submission_id: function_fomal_2024-11-19

developer_uid: chai_backend_admin

celo_rating: 1277.93

display_name: retune_with_base

family_friendly_score: 0.5953999999999999

family_friendly_standard_error: 0.006941164743758788

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

is_internal_developer: True

model_group:

model_name: retune_with_base

num_battles: 11835

num_wins: 6314

ranking_group: single

status: inactive

submission_type: function

timestamp: 2024-11-19T21:41:36+00:00

us_pacific_date: 2024-11-19

win_ratio: 0.533502323616392

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 5.217072248458862s
Received healthy response to inference request in 3.501116991043091s
Received healthy response to inference request in 5.953108787536621s
Received healthy response to inference request in 6.176409721374512s
Received healthy response to inference request in 3.6058332920074463s
5 requests
0 failed requests
5th percentile: 3.522060251235962
10th percentile: 3.543003511428833
20th percentile: 3.584890031814575
30th percentile: 3.9280810832977293
40th percentile: 4.572576665878296
50th percentile: 5.217072248458862
60th percentile: 5.5114868640899655
70th percentile: 5.8059014797210695
80th percentile: 5.997768974304199
90th percentile: 6.087089347839355
95th percentile: 6.131749534606934
99th percentile: 6.167477684020996
mean time: 4.8907082080841064
%s, retrying in %s seconds...
Received healthy response to inference request in 5.8859944343566895s
Received healthy response to inference request in 10.56333327293396s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 13.15406608581543s
Received healthy response to inference request in 12.438954830169678s
Received healthy response to inference request in 13.127009868621826s
5 requests
0 failed requests
5th percentile: 6.821462202072143
10th percentile: 7.756929969787597
20th percentile: 9.627865505218505
30th percentile: 10.938457584381103
40th percentile: 11.688706207275391
50th percentile: 12.438954830169678
60th percentile: 12.714176845550536
70th percentile: 12.989398860931397
80th percentile: 13.132421112060547
90th percentile: 13.143243598937989
95th percentile: 13.14865484237671
99th percentile: 13.152983837127685
mean time: 11.033871698379517
%s, retrying in %s seconds...
Received healthy response to inference request in 12.458518981933594s
Received healthy response to inference request in 3.083099365234375s
Received healthy response to inference request in 2.4693763256073s
Received healthy response to inference request in 3.6339235305786133s
Received healthy response to inference request in 2.9605002403259277s
5 requests
0 failed requests
5th percentile: 2.5676011085510253
10th percentile: 2.665825891494751
20th percentile: 2.8622754573822022
30th percentile: 2.985020065307617
40th percentile: 3.0340597152709963
50th percentile: 3.083099365234375
60th percentile: 3.3034290313720702
70th percentile: 3.5237586975097654
80th percentile: 5.398842620849611
90th percentile: 8.928680801391602
95th percentile: 10.693599891662597
99th percentile: 12.105535163879395
mean time: 4.921083688735962
Pipeline stage StressChecker completed in 108.12s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 5.58s
Shutdown handler de-registered
function_fomal_2024-11-19 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3705.14s
Shutdown handler de-registered
function_fomal_2024-11-19 status is now inactive due to auto deactivation removed underperforming models