developer_uid: jxlu90
submission_id: function_sekib_2024-12-27
model_name: llama31_70B
model_group:
status: torndown
timestamp: 2024-12-27T17:20:32+00:00
num_battles: 5764
num_wins: 2813
celo_rating: 1256.34
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: llama31_70B
is_internal_developer: False
ranking_group: single
us_pacific_date: 2024-12-27
win_ratio: 0.48802914642609296
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['You:', '</s>', '\n'], 'max_input_tokens': 1024, 'best_of': 4, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.220247268676758s
Received healthy response to inference request in 2.836103916168213s
Received healthy response to inference request in 1.9546465873718262s
Received healthy response to inference request in 4.082041501998901s
Received healthy response to inference request in 3.0751078128814697s
5 requests
0 failed requests
5th percentile: 2.1309380531311035
10th percentile: 2.307229518890381
20th percentile: 2.6598124504089355
30th percentile: 2.8839046955108643
40th percentile: 2.979506254196167
50th percentile: 3.0751078128814697
60th percentile: 3.133163595199585
70th percentile: 3.1912193775177
80th percentile: 3.3926061153411866
90th percentile: 3.737323808670044
95th percentile: 3.9096826553344726
99th percentile: 4.047569732666015
mean time: 3.0336294174194336
Pipeline stage StressChecker completed in 16.32s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.64s
Shutdown handler de-registered
function_sekib_2024-12-27 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Received signal 15, running shutdown handler
Shutdown handler de-registered
function_sekib_2024-12-27 status is now torndown due to DeploymentManager action
function_sekib_2024-12-27 status is now torndown due to DeploymentManager action
function_sekib_2024-12-27 status is now torndown due to DeploymentManager action