developer_uid: valentin
submission_id: function_mutef_2024-11-26
model_name: test_function
model_group:
status: torndown
timestamp: 2024-11-26T15:19:45+00:00
num_battles: 21023
num_wins: 666
celo_rating: 669.72
family_friendly_score: 0.6546000000000001
family_friendly_standard_error: 0.006724564521216225
submission_type: function
display_name: test_function
is_internal_developer: False
ranking_group: single
us_pacific_date: 2024-11-26
win_ratio: 0.03167958902154783
generation_params: {'temperature': 1.5, 'top_p': 0.99, 'min_p': 0.1, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|end▁of▁sentence|>'], 'max_input_tokens': 512, 'best_of': 1, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 0.19095468521118164s
Received healthy response to inference request in 0.2472398281097412s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 1.1031718254089355s
Received healthy response to inference request in 0.313723087310791s
Received healthy response to inference request in 0.3033609390258789s
5 requests
0 failed requests
5th percentile: 0.20221171379089356
10th percentile: 0.21346874237060548
20th percentile: 0.2359827995300293
30th percentile: 0.25846405029296876
40th percentile: 0.2809124946594238
50th percentile: 0.3033609390258789
60th percentile: 0.30750579833984376
70th percentile: 0.3116506576538086
80th percentile: 0.47161283493042006
90th percentile: 0.7873923301696778
95th percentile: 0.9452820777893065
99th percentile: 1.0715938758850097
mean time: 0.43169007301330564
Pipeline stage StressChecker completed in 3.36s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.71s
Shutdown handler de-registered
function_mutef_2024-11-26 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 1111.64s
Shutdown handler de-registered
function_mutef_2024-11-26 status is now inactive due to auto deactivation removed underperforming models
function_mutef_2024-11-26 status is now torndown due to DeploymentManager action
function_mutef_2024-11-26 status is now torndown due to DeploymentManager action
function_mutef_2024-11-26 status is now torndown due to DeploymentManager action