developer_uid: jxlu90
submission_id: function_lider_2024-12-14
model_name: avian
model_group:
status: inactive
timestamp: 2024-12-14T03:15:47+00:00
num_battles: 7159
num_wins: 3368
celo_rating: 1241.13
family_friendly_score: 0.5791999999999999
family_friendly_standard_error: 0.006981795757539746
submission_type: function
display_name: avian
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-12-13
win_ratio: 0.47045676770498673
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', '####', 'Bot:', 'User:', 'You:', '<|im_end|>', '<|eot_id|>'], 'max_input_tokens': 1024, 'best_of': 4, 'max_output_tokens': 68}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.3997180461883545s
Received healthy response to inference request in 3.452566385269165s
Received healthy response to inference request in 5.595942974090576s
Received healthy response to inference request in 3.5103461742401123s
Received healthy response to inference request in 3.6747913360595703s
5 requests
0 failed requests
5th percentile: 3.4641223430633543
10th percentile: 3.475678300857544
20th percentile: 3.498790216445923
30th percentile: 3.543235206604004
40th percentile: 3.6090132713317873
50th percentile: 3.6747913360595703
60th percentile: 3.964762020111084
70th percentile: 4.254732704162597
80th percentile: 4.638963031768799
90th percentile: 5.117453002929688
95th percentile: 5.356697988510132
99th percentile: 5.548093976974488
mean time: 4.126672983169556
%s, retrying in %s seconds...
Received healthy response to inference request in 3.0314433574676514s
Received healthy response to inference request in 3.4272923469543457s
Received healthy response to inference request in 3.7739334106445312s
Received healthy response to inference request in 6.888041019439697s
Received healthy response to inference request in 3.7238569259643555s
5 requests
0 failed requests
5th percentile: 3.11061315536499
10th percentile: 3.189782953262329
20th percentile: 3.348122549057007
30th percentile: 3.4866052627563477
40th percentile: 3.6052310943603514
50th percentile: 3.7238569259643555
60th percentile: 3.743887519836426
70th percentile: 3.7639181137084963
80th percentile: 4.396754932403565
90th percentile: 5.642397975921631
95th percentile: 6.265219497680664
99th percentile: 6.763476715087891
mean time: 4.168913412094116
%s, retrying in %s seconds...
Received healthy response to inference request in 3.181544542312622s
Received healthy response to inference request in 2.905632734298706s
Received healthy response to inference request in 3.1809868812561035s
Received healthy response to inference request in 3.0497400760650635s
Received healthy response to inference request in 2.392549514770508s
5 requests
0 failed requests
5th percentile: 2.4951661586761475
10th percentile: 2.5977828025817873
20th percentile: 2.8030160903930663
30th percentile: 2.9344542026519775
40th percentile: 2.9920971393585205
50th percentile: 3.0497400760650635
60th percentile: 3.1022387981414794
70th percentile: 3.1547375202178953
80th percentile: 3.1810984134674074
90th percentile: 3.1813214778900147
95th percentile: 3.181433010101318
99th percentile: 3.181522235870361
mean time: 2.9420907497406006
Pipeline stage StressChecker completed in 59.89s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.26s
Shutdown handler de-registered
function_lider_2024-12-14 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 1624.34s
Shutdown handler de-registered
function_lider_2024-12-14 status is now inactive due to auto deactivation removed underperforming models