submission_id: nousresearch-meta-llama_4939_v76
developer_uid: chai_backend_admin
best_of: 8
celo_rating: 1220.84
display_name: nousresearch-meta-llama_4939_v76
family_friendly_score: 0.557388777107087
family_friendly_standard_error: 0.005718155367458714
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
language_model: NousResearch/Meta-Llama-3.1-8B-Instruct
max_input_tokens: 1024
max_output_tokens: 64
model_architecture: LlamaForCausalLM
model_group: NousResearch/Meta-Llama-
model_name: nousresearch-meta-llama_4939_v76
model_num_parameters: 8030261248.0
model_repo: NousResearch/Meta-Llama-3.1-8B-Instruct
model_size: 8B
num_battles: 7916
num_wins: 4022
ranking_group: single
status: inactive
submission_type: basic
timestamp: 2024-10-16T06:23:54+00:00
us_pacific_date: 2024-10-15
win_ratio: 0.5080848913592724
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name nousresearch-meta-llama-4939-v76-mkmlizer
Waiting for job on nousresearch-meta-llama-4939-v76-mkmlizer to finish
nousresearch-meta-llama-4939-v76-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
nousresearch-meta-llama-4939-v76-mkmlizer: ║ _____ __ __ ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ /___/ ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ Version: 0.11.12 ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ https://mk1.ai ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ The license key for the current software has been verified as ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ belonging to: ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ Chai Research Corp. ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
nousresearch-meta-llama-4939-v76-mkmlizer: ║ ║
nousresearch-meta-llama-4939-v76-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
nousresearch-meta-llama-4939-v76-mkmlizer: Downloaded to shared memory in 40.743s
nousresearch-meta-llama-4939-v76-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp2hxw7_di, device:0
nousresearch-meta-llama-4939-v76-mkmlizer: Saving flywheel model at /dev/shm/model_cache
nousresearch-meta-llama-4939-v76-mkmlizer: quantized model in 26.357s
nousresearch-meta-llama-4939-v76-mkmlizer: Processed model NousResearch/Meta-Llama-3.1-8B-Instruct in 67.100s
nousresearch-meta-llama-4939-v76-mkmlizer: creating bucket guanaco-mkml-models
nousresearch-meta-llama-4939-v76-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
nousresearch-meta-llama-4939-v76-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/nousresearch-meta-llama-4939-v76
nousresearch-meta-llama-4939-v76-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/nousresearch-meta-llama-4939-v76/config.json
nousresearch-meta-llama-4939-v76-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/nousresearch-meta-llama-4939-v76/special_tokens_map.json
nousresearch-meta-llama-4939-v76-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/nousresearch-meta-llama-4939-v76/tokenizer_config.json
nousresearch-meta-llama-4939-v76-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/nousresearch-meta-llama-4939-v76/tokenizer.json
nousresearch-meta-llama-4939-v76-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/nousresearch-meta-llama-4939-v76/flywheel_model.0.safetensors
nousresearch-meta-llama-4939-v76-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 2%|▏ | 5/291 [00:00<00:08, 34.37it/s] Loading 0: 4%|▍ | 13/291 [00:00<00:04, 56.45it/s] Loading 0: 7%|▋ | 20/291 [00:00<00:05, 52.95it/s] Loading 0: 9%|▉ | 26/291 [00:00<00:05, 50.34it/s] Loading 0: 11%|█ | 32/291 [00:00<00:06, 42.81it/s] Loading 0: 14%|█▎ | 40/291 [00:00<00:04, 51.40it/s] Loading 0: 16%|█▌ | 46/291 [00:00<00:05, 47.35it/s] Loading 0: 18%|█▊ | 52/291 [00:01<00:04, 49.21it/s] Loading 0: 20%|█▉ | 58/291 [00:01<00:04, 51.61it/s] Loading 0: 22%|██▏ | 64/291 [00:01<00:04, 47.34it/s] Loading 0: 24%|██▎ | 69/291 [00:01<00:04, 46.59it/s] Loading 0: 26%|██▌ | 76/291 [00:01<00:04, 50.67it/s] Loading 0: 28%|██▊ | 82/291 [00:01<00:04, 46.75it/s] Loading 0: 30%|██▉ | 87/291 [00:02<00:06, 31.06it/s] Loading 0: 32%|███▏ | 94/291 [00:02<00:05, 37.71it/s] Loading 0: 34%|███▍ | 100/291 [00:02<00:04, 38.61it/s] Loading 0: 36%|███▌ | 105/291 [00:02<00:04, 39.93it/s] Loading 0: 38%|███▊ | 112/291 [00:02<00:03, 45.61it/s] Loading 0: 41%|████ | 118/291 [00:02<00:03, 43.80it/s] Loading 0: 42%|████▏ | 123/291 [00:02<00:03, 43.76it/s] Loading 0: 45%|████▍ | 130/291 [00:02<00:03, 48.89it/s] Loading 0: 47%|████▋ | 136/291 [00:03<00:03, 45.41it/s] Loading 0: 48%|████▊ | 141/291 [00:03<00:03, 43.71it/s] Loading 0: 51%|█████ | 148/291 [00:03<00:02, 49.09it/s] Loading 0: 53%|█████▎ | 154/291 [00:03<00:02, 46.38it/s] Loading 0: 55%|█████▍ | 159/291 [00:03<00:02, 45.79it/s] Loading 0: 57%|█████▋ | 166/291 [00:03<00:02, 50.19it/s] Loading 0: 59%|█████▉ | 172/291 [00:03<00:02, 46.41it/s] Loading 0: 62%|██████▏ | 179/291 [00:03<00:02, 50.02it/s] Loading 0: 64%|██████▎ | 185/291 [00:04<00:02, 51.60it/s] Loading 0: 66%|██████▌ | 191/291 [00:04<00:03, 32.75it/s] Loading 0: 67%|██████▋ | 196/291 [00:04<00:02, 34.95it/s] Loading 0: 69%|██████▉ | 202/291 [00:04<00:02, 39.68it/s] Loading 0: 71%|███████▏ | 208/291 [00:04<00:02, 40.12it/s] Loading 0: 73%|███████▎ | 213/291 [00:04<00:01, 41.50it/s] Loading 0: 76%|███████▌ | 220/291 [00:04<00:01, 47.45it/s] Loading 0: 78%|███████▊ | 226/291 [00:05<00:01, 45.54it/s] Loading 0: 79%|███████▉ | 231/291 [00:05<00:01, 45.16it/s] Loading 0: 82%|████████▏ | 238/291 [00:05<00:01, 50.13it/s] Loading 0: 84%|████████▍ | 244/291 [00:05<00:00, 47.13it/s] Loading 0: 86%|████████▌ | 249/291 [00:05<00:00, 45.04it/s] Loading 0: 88%|████████▊ | 255/291 [00:05<00:00, 47.16it/s] Loading 0: 89%|████████▉ | 260/291 [00:05<00:00, 45.78it/s] Loading 0: 91%|█████████ | 265/291 [00:05<00:00, 45.63it/s] Loading 0: 93%|█████████▎| 271/291 [00:06<00:00, 43.61it/s] Loading 0: 95%|█████████▍| 276/291 [00:06<00:00, 43.20it/s] Loading 0: 97%|█████████▋| 281/291 [00:06<00:00, 44.70it/s] Loading 0: 98%|█████████▊| 286/291 [00:06<00:00, 38.88it/s] Loading 0: 100%|██████████| 291/291 [00:11<00:00, 3.02it/s]
Job nousresearch-meta-llama-4939-v76-mkmlizer completed after 94.08s with status: succeeded
Stopping job with name nousresearch-meta-llama-4939-v76-mkmlizer
Pipeline stage MKMLizer completed in 94.59s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service nousresearch-meta-llama-4939-v76
Waiting for inference service nousresearch-meta-llama-4939-v76 to be ready
Inference service nousresearch-meta-llama-4939-v76 ready after 170.60794258117676s
Pipeline stage MKMLDeployer completed in 171.18s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.4102578163146973s
Received healthy response to inference request in 1.4341247081756592s
Received healthy response to inference request in 1.2105457782745361s
Received healthy response to inference request in 1.3075251579284668s
Received healthy response to inference request in 1.2061290740966797s
5 requests
0 failed requests
5th percentile: 1.207012414932251
10th percentile: 1.2078957557678223
20th percentile: 1.2096624374389648
30th percentile: 1.2299416542053223
40th percentile: 1.2687334060668944
50th percentile: 1.3075251579284668
60th percentile: 1.348618221282959
70th percentile: 1.389711284637451
80th percentile: 1.4150311946868896
90th percentile: 1.4245779514312744
95th percentile: 1.4293513298034668
99th percentile: 1.4331700325012207
mean time: 1.313716506958008
Pipeline stage StressChecker completed in 8.26s
Shutdown handler de-registered
nousresearch-meta-llama_4939_v76 status is now deployed due to DeploymentManager action
nousresearch-meta-llama_4939_v76 status is now inactive due to auto deactivation removed underperforming models