function_kefas_2024-11-18

developer_uid: chai_backend_admin

submission_id: function_kefas_2024-11-18

model_name: retune_with_base

model_group:

status: inactive

timestamp: 2024-11-18T18:36:21+00:00

num_battles: 15006

num_wins: 7670

celo_rating: 1264.57

family_friendly_score: 0.607

family_friendly_standard_error: 0.006907257053273753

submission_type: function

display_name: retune_with_base

is_internal_developer: True

ranking_group: single

us_pacific_date: 2024-11-18

win_ratio: 0.5111288817806211

generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.195505857467651s
Received healthy response to inference request in 4.552473783493042s
Received healthy response to inference request in 3.3755857944488525s
Received healthy response to inference request in 3.8636021614074707s
Received healthy response to inference request in 4.190983057022095s
5 requests
0 failed requests
5th percentile: 3.4731890678405763
10th percentile: 3.5707923412322997
20th percentile: 3.765998888015747
30th percentile: 3.9290783405303955
40th percentile: 4.060030698776245
50th percentile: 4.190983057022095
60th percentile: 4.192792177200317
70th percentile: 4.19460129737854
80th percentile: 4.266899442672729
90th percentile: 4.409686613082886
95th percentile: 4.481080198287964
99th percentile: 4.538195066452026
mean time: 4.035630130767823
%s, retrying in %s seconds...
Received healthy response to inference request in 4.021906614303589s
Received healthy response to inference request in 3.3611786365509033s
Received healthy response to inference request in 3.356321334838867s
Received healthy response to inference request in 3.216768980026245s
Received healthy response to inference request in 3.2888760566711426s
5 requests
0 failed requests
5th percentile: 3.231190395355225
10th percentile: 3.245611810684204
20th percentile: 3.274454641342163
30th percentile: 3.3023651123046873
40th percentile: 3.3293432235717773
50th percentile: 3.356321334838867
60th percentile: 3.3582642555236815
70th percentile: 3.360207176208496
80th percentile: 3.4933242321014406
90th percentile: 3.7576154232025147
95th percentile: 3.8897610187530516
99th percentile: 3.9954774951934815
mean time: 3.4490103244781496
Pipeline stage StressChecker completed in 40.43s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.94s
Shutdown handler de-registered
function_kefas_2024-11-18 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3467.92s
Shutdown handler de-registered
function_kefas_2024-11-18 status is now inactive due to auto deactivation removed underperforming models