function_dohif_2024-12-18

developer_uid: jxlu90

submission_id: function_dohif_2024-12-18

model_name: run3_v65_1k

model_group:

status: torndown

timestamp: 2024-12-18T18:16:49+00:00

num_battles: 6498

num_wins: 3378

celo_rating: 1271.8

family_friendly_score: 0.0

family_friendly_standard_error: 0.0

submission_type: function

display_name: run3_v65_1k

is_internal_developer: False

ranking_group: single

us_pacific_date: 2024-12-18

win_ratio: 0.5198522622345337

generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', 'You:', '</s>'], 'max_input_tokens': 2048, 'best_of': 4, 'max_output_tokens': 64}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.741074800491333s
Received healthy response to inference request in 4.5900022983551025s
Received healthy response to inference request in 1.8564424514770508s
Received healthy response to inference request in 6.667695045471191s
Received healthy response to inference request in 4.4397876262664795s
5 requests
0 failed requests
5th percentile: 2.233368921279907
10th percentile: 2.6102953910827638
20th percentile: 3.3641483306884767
30th percentile: 3.8808173656463625
40th percentile: 4.1603024959564205
50th percentile: 4.4397876262664795
60th percentile: 4.499873495101928
70th percentile: 4.559959363937378
80th percentile: 5.00554084777832
90th percentile: 5.836617946624756
95th percentile: 6.252156496047974
99th percentile: 6.584587335586548
mean time: 4.259000444412232
%s, retrying in %s seconds...
Received healthy response to inference request in 3.7430479526519775s
Received healthy response to inference request in 3.731870412826538s
Received healthy response to inference request in 3.33500599861145s
Received healthy response to inference request in 3.4705302715301514s
Received healthy response to inference request in 3.3465590476989746s
5 requests
0 failed requests
5th percentile: 3.337316608428955
10th percentile: 3.33962721824646
20th percentile: 3.3442484378814696
30th percentile: 3.37135329246521
40th percentile: 3.420941781997681
50th percentile: 3.4705302715301514
60th percentile: 3.575066328048706
70th percentile: 3.6796023845672607
80th percentile: 3.734105920791626
90th percentile: 3.7385769367218016
95th percentile: 3.7408124446868896
99th percentile: 3.74260085105896
mean time: 3.5254027366638185
Pipeline stage StressChecker completed in 41.28s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.40s
Shutdown handler de-registered
function_dohif_2024-12-18 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Received signal 15, running shutdown handler
Shutdown handler de-registered
function_dohif_2024-12-18 status is now inactive due to auto deactivation removed underperforming models
function_dohif_2024-12-18 status is now torndown due to DeploymentManager action
function_dohif_2024-12-18 status is now torndown due to DeploymentManager action
function_dohif_2024-12-18 status is now torndown due to DeploymentManager action