function_mejan_2024-11-19

developer_uid: chai_backend_admin

submission_id: function_mejan_2024-11-19

model_name: retune_with_base

model_group:

status: inactive

timestamp: 2024-11-19T17:24:45+00:00

num_battles: 7132

num_wins: 3635

celo_rating: 1257.36

family_friendly_score: 0.5768

family_friendly_standard_error: 0.006987156216945489

submission_type: function

display_name: retune_with_base

is_internal_developer: True

ranking_group: single

us_pacific_date: 2024-11-19

win_ratio: 0.5096747055524397

generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.089129686355591s
Received healthy response to inference request in 2.3435537815093994s
Received healthy response to inference request in 4.08213996887207s
Received healthy response to inference request in 2.9149255752563477s
Received healthy response to inference request in 3.85732102394104s
5 requests
0 failed requests
5th percentile: 2.457828140258789
10th percentile: 2.572102499008179
20th percentile: 2.800651216506958
30th percentile: 3.103404664993286
40th percentile: 3.4803628444671633
50th percentile: 3.85732102394104
60th percentile: 3.947248601913452
70th percentile: 4.037176179885864
80th percentile: 4.083537912368774
90th percentile: 4.086333799362182
95th percentile: 4.087731742858887
99th percentile: 4.08885009765625
mean time: 3.4574140071868897
%s, retrying in %s seconds...
Received healthy response to inference request in 2.481147527694702s
Received healthy response to inference request in 2.577596426010132s
Received healthy response to inference request in 2.5598201751708984s
Received healthy response to inference request in 2.171515941619873s
Received healthy response to inference request in 3.847773313522339s
5 requests
0 failed requests
5th percentile: 2.233442258834839
10th percentile: 2.2953685760498046
20th percentile: 2.419221210479736
30th percentile: 2.4968820571899415
40th percentile: 2.5283511161804197
50th percentile: 2.5598201751708984
60th percentile: 2.5669306755065917
70th percentile: 2.574041175842285
80th percentile: 2.8316318035125736
90th percentile: 3.3397025585174562
95th percentile: 3.593737936019897
99th percentile: 3.7969662380218505
mean time: 2.727570676803589
Pipeline stage StressChecker completed in 33.75s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.65s
Shutdown handler de-registered
function_mejan_2024-11-19 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2991.93s
Shutdown handler de-registered
function_mejan_2024-11-19 status is now inactive due to auto deactivation removed underperforming models