function_mimok_2024-11-15

submission_id: function_mimok_2024-11-15

developer_uid: chai_backend_admin

celo_rating: 1252.06

display_name: retune_with_base

family_friendly_score: 0.583

family_friendly_standard_error: 0.006972962067873307

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

is_internal_developer: True

model_group:

model_name: retune_with_base

num_battles: 15153

num_wins: 7728

ranking_group: single

status: inactive

submission_type: function

timestamp: 2024-11-15T22:02:28+00:00

us_pacific_date: 2024-11-15

win_ratio: 0.509998020194021

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.767237663269043s
Received healthy response to inference request in 4.521410226821899s
Received healthy response to inference request in 3.3941540718078613s
Received healthy response to inference request in 3.663597583770752s
Received healthy response to inference request in 3.739039182662964s
5 requests
0 failed requests
5th percentile: 3.4480427742004394
10th percentile: 3.5019314765930174
20th percentile: 3.609708881378174
30th percentile: 3.6786859035491943
40th percentile: 3.708862543106079
50th percentile: 3.739039182662964
60th percentile: 3.7503185749053953
70th percentile: 3.7615979671478272
80th percentile: 3.9180721759796144
90th percentile: 4.2197412014007565
95th percentile: 4.370575714111328
99th percentile: 4.491243324279785
mean time: 3.817087745666504
%s, retrying in %s seconds...
Received healthy response to inference request in 3.5588228702545166s
Received healthy response to inference request in 4.367923259735107s
Received healthy response to inference request in 2.8239054679870605s
Received healthy response to inference request in 3.5969254970550537s
Received healthy response to inference request in 3.4134202003479004s
5 requests
0 failed requests
5th percentile: 2.9418084144592287
10th percentile: 3.0597113609313964
20th percentile: 3.2955172538757322
30th percentile: 3.4425007343292235
40th percentile: 3.5006618022918703
50th percentile: 3.5588228702545166
60th percentile: 3.5740639209747314
70th percentile: 3.589304971694946
80th percentile: 3.7511250495910646
90th percentile: 4.059524154663086
95th percentile: 4.2137237071990965
99th percentile: 4.337083349227905
mean time: 3.552199459075928
Pipeline stage StressChecker completed in 39.49s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 3.13s
Shutdown handler de-registered
function_mimok_2024-11-15 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3124.13s
Shutdown handler de-registered
function_mimok_2024-11-15 status is now inactive due to auto deactivation removed underperforming models