developer_uid: azuruce
submission_id: function_nenim_2024-10-28
model_name: select-by-reward
model_group:
status: torndown
timestamp: 2024-10-28T07:08:54+00:00
num_battles: 9586
num_wins: 5261
celo_rating: 1262.87
family_friendly_score: 0.5873999999999999
family_friendly_standard_error: 0.006962201376001702
submission_type: function
display_name: select-by-reward
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-10-28
win_ratio: 0.5488211975798039
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.4605612754821777s
Received healthy response to inference request in 1.981567144393921s
Received healthy response to inference request in 1.856384038925171s
Received healthy response to inference request in 2.0816268920898438s
Received healthy response to inference request in 1.92765212059021s
5 requests
0 failed requests
5th percentile: 1.8706376552581787
10th percentile: 1.8848912715911865
20th percentile: 1.9133985042572021
30th percentile: 1.9384351253509522
40th percentile: 1.9600011348724364
50th percentile: 1.981567144393921
60th percentile: 2.02159104347229
70th percentile: 2.0616149425506594
80th percentile: 2.157413768768311
90th percentile: 2.308987522125244
95th percentile: 2.3847743988037107
99th percentile: 2.4454039001464842
mean time: 2.0615582942962645
Pipeline stage StressChecker completed in 12.26s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 6.92s
Shutdown handler de-registered
function_nenim_2024-10-28 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2328.73s
Shutdown handler de-registered
function_nenim_2024-10-28 status is now inactive due to auto deactivation removed underperforming models
function_nenim_2024-10-28 status is now torndown due to DeploymentManager action