function_mamam_2024-11-19

developer_uid: chai_backend_admin

submission_id: function_mamam_2024-11-19

model_name: retune_with_base

model_group:

status: inactive

timestamp: 2024-11-19T21:50:44+00:00

num_battles: 12231

num_wins: 6349

celo_rating: 1268.2

family_friendly_score: 0.594

family_friendly_standard_error: 0.006944983801277006

submission_type: function

display_name: retune_with_base

is_internal_developer: True

ranking_group: single

us_pacific_date: 2024-11-19

win_ratio: 0.519090834764124

generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 3.9249210357666016s
Received healthy response to inference request in 2.592864513397217s
Received healthy response to inference request in 4.309762001037598s
Received healthy response to inference request in 4.80255651473999s
Received healthy response to inference request in 3.3423476219177246s
5 requests
0 failed requests
5th percentile: 2.7427611351013184
10th percentile: 2.89265775680542
20th percentile: 3.192451000213623
30th percentile: 3.4588623046875
40th percentile: 3.691891670227051
50th percentile: 3.9249210357666016
60th percentile: 4.078857421875
70th percentile: 4.232793807983398
80th percentile: 4.408320903778076
90th percentile: 4.605438709259033
95th percentile: 4.703997611999512
99th percentile: 4.782844734191895
mean time: 3.794490337371826
%s, retrying in %s seconds...
Received healthy response to inference request in 5.604747772216797s
Received healthy response to inference request in 2.976043462753296s
Received healthy response to inference request in 3.6511621475219727s
Received healthy response to inference request in 3.4218599796295166s
Received healthy response to inference request in 3.5321319103240967s
5 requests
0 failed requests
5th percentile: 3.06520676612854
10th percentile: 3.154370069503784
20th percentile: 3.3326966762542725
30th percentile: 3.443914365768433
40th percentile: 3.4880231380462647
50th percentile: 3.5321319103240967
60th percentile: 3.579744005203247
70th percentile: 3.6273561000823973
80th percentile: 4.041879272460938
90th percentile: 4.823313522338867
95th percentile: 5.2140306472778315
99th percentile: 5.526604347229004
mean time: 3.837189054489136
Pipeline stage StressChecker completed in 41.89s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.79s
Shutdown handler de-registered
function_mamam_2024-11-19 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3995.85s
Shutdown handler de-registered
function_mamam_2024-11-19 status is now inactive due to auto deactivation removed underperforming models