function_sabil_2024-11-20

developer_uid: chai_backend_admin

submission_id: function_sabil_2024-11-20

model_name: retune_with_base

model_group:

status: inactive

timestamp: 2024-11-20T01:59:42+00:00

num_battles: 11894

num_wins: 6398

celo_rating: 1282.68

family_friendly_score: 0.587

family_friendly_standard_error: 0.0069632032858448125

submission_type: function

display_name: retune_with_base

is_internal_developer: True

ranking_group: single

us_pacific_date: 2024-11-19

win_ratio: 0.5379182781234235

generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.580859661102295s
Received healthy response to inference request in 3.1314399242401123s
Received healthy response to inference request in 3.791964292526245s
Received healthy response to inference request in 2.691246509552002s
Received healthy response to inference request in 4.3207080364227295s
5 requests
0 failed requests
5th percentile: 2.779285192489624
10th percentile: 2.867323875427246
20th percentile: 3.0434012413024902
30th percentile: 3.263544797897339
40th percentile: 3.527754545211792
50th percentile: 3.791964292526245
60th percentile: 4.003461790084839
70th percentile: 4.214959287643433
80th percentile: 4.372738361358643
90th percentile: 4.476799011230469
95th percentile: 4.528829336166382
99th percentile: 4.5704535961151125
mean time: 3.703243684768677
%s, retrying in %s seconds...
Received healthy response to inference request in 3.5115277767181396s
Received healthy response to inference request in 2.501044750213623s
Received healthy response to inference request in 3.243112564086914s
Received healthy response to inference request in 3.2400031089782715s
Received healthy response to inference request in 2.4836971759796143s
5 requests
0 failed requests
5th percentile: 2.487166690826416
10th percentile: 2.490636205673218
20th percentile: 2.4975752353668215
30th percentile: 2.6488364219665526
40th percentile: 2.944419765472412
50th percentile: 3.2400031089782715
60th percentile: 3.2412468910217287
70th percentile: 3.2424906730651855
80th percentile: 3.296795606613159
90th percentile: 3.4041616916656494
95th percentile: 3.4578447341918945
99th percentile: 3.5007911682128907
mean time: 2.9958770751953123
Pipeline stage StressChecker completed in 36.19s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 5.64s
Shutdown handler de-registered
function_sabil_2024-11-20 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3388.93s
Shutdown handler de-registered
function_sabil_2024-11-20 status is now inactive due to auto deactivation removed underperforming models