submission_id: nousresearch-meta-llama_4939_v36
developer_uid: end_to_end_test
best_of: 4
display_name: nousresearch-meta-llama_4939_v36
family_friendly_score: 0.0
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 0.99, 'min_p': 0.1, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}
ineligible_reason: model is only for e2e test
is_internal_developer: True
language_model: NousResearch/Meta-Llama-3.1-8B-Instruct
max_input_tokens: 512
max_output_tokens: 64
model_architecture: LlamaForCausalLM
model_group: NousResearch/Meta-Llama-
model_name: nousresearch-meta-llama_4939_v36
model_num_parameters: 8030261248.0
model_repo: NousResearch/Meta-Llama-3.1-8B-Instruct
model_size: 8B
num_battles: 23
num_wins: 8
ranking_group: single
status: torndown
submission_type: basic
timestamp: 2024-08-30T18:25:58+00:00
us_pacific_date: 2024-08-30
win_ratio: 0.34782608695652173
Download Preference Data
Resubmit model
Running pipeline stage MKMLizer
Starting job with name nousresearch-meta-llama-4939-v36-mkmlizer
Waiting for job on nousresearch-meta-llama-4939-v36-mkmlizer to finish
nousresearch-meta-llama-4939-v36-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
nousresearch-meta-llama-4939-v36-mkmlizer: ║ _____ __ __ ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ /___/ ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ Version: 0.10.1 ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ https://mk1.ai ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ The license key for the current software has been verified as ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ belonging to: ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ Chai Research Corp. ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
nousresearch-meta-llama-4939-v36-mkmlizer: ║ ║
nousresearch-meta-llama-4939-v36-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
nousresearch-meta-llama-4939-v36-mkmlizer: Downloaded to shared memory in 50.921s
nousresearch-meta-llama-4939-v36-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpndep6t4_, device:0
nousresearch-meta-llama-4939-v36-mkmlizer: Saving flywheel model at /dev/shm/model_cache
nousresearch-meta-llama-4939-v36-mkmlizer: quantized model in 26.393s
nousresearch-meta-llama-4939-v36-mkmlizer: Processed model NousResearch/Meta-Llama-3.1-8B-Instruct in 77.315s
nousresearch-meta-llama-4939-v36-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
nousresearch-meta-llama-4939-v36-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/nousresearch-meta-llama-4939-v36
nousresearch-meta-llama-4939-v36-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/nousresearch-meta-llama-4939-v36/config.json
nousresearch-meta-llama-4939-v36-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/nousresearch-meta-llama-4939-v36/special_tokens_map.json
nousresearch-meta-llama-4939-v36-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/nousresearch-meta-llama-4939-v36/tokenizer_config.json
nousresearch-meta-llama-4939-v36-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/nousresearch-meta-llama-4939-v36/tokenizer.json
nousresearch-meta-llama-4939-v36-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/nousresearch-meta-llama-4939-v36/flywheel_model.0.safetensors
nousresearch-meta-llama-4939-v36-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 2%|▏ | 5/291 [00:00<00:08, 34.25it/s] Loading 0: 4%|▍ | 13/291 [00:00<00:04, 55.83it/s] Loading 0: 7%|▋ | 19/291 [00:00<00:05, 46.49it/s] Loading 0: 8%|▊ | 24/291 [00:00<00:05, 44.89it/s] Loading 0: 11%|█ | 31/291 [00:00<00:04, 52.20it/s] Loading 0: 13%|█▎ | 37/291 [00:00<00:05, 47.04it/s] Loading 0: 14%|█▍ | 42/291 [00:00<00:05, 44.86it/s] Loading 0: 17%|█▋ | 49/291 [00:01<00:04, 49.83it/s] Loading 0: 19%|█▉ | 55/291 [00:01<00:05, 46.46it/s] Loading 0: 21%|██ | 60/291 [00:01<00:05, 45.94it/s] Loading 0: 23%|██▎ | 67/291 [00:01<00:04, 51.31it/s] Loading 0: 25%|██▌ | 73/291 [00:01<00:04, 47.25it/s] Loading 0: 27%|██▋ | 78/291 [00:01<00:04, 46.01it/s] Loading 0: 29%|██▊ | 83/291 [00:01<00:06, 33.02it/s] Loading 0: 30%|██▉ | 87/291 [00:02<00:06, 32.68it/s] Loading 0: 33%|███▎ | 95/291 [00:02<00:05, 37.62it/s] Loading 0: 36%|███▌ | 104/291 [00:02<00:04, 43.05it/s] Loading 0: 38%|███▊ | 112/291 [00:02<00:03, 50.14it/s] Loading 0: 41%|████ | 118/291 [00:02<00:03, 49.48it/s] Loading 0: 43%|████▎ | 124/291 [00:02<00:03, 50.91it/s] Loading 0: 45%|████▍ | 130/291 [00:02<00:03, 52.93it/s] Loading 0: 47%|████▋ | 136/291 [00:02<00:03, 50.71it/s] Loading 0: 49%|████▉ | 142/291 [00:03<00:03, 48.56it/s] Loading 0: 51%|█████ | 147/291 [00:03<00:03, 47.44it/s] Loading 0: 52%|█████▏ | 152/291 [00:03<00:03, 45.10it/s] Loading 0: 54%|█████▍ | 157/291 [00:03<00:02, 45.28it/s] Loading 0: 56%|█████▌ | 163/291 [00:03<00:02, 44.39it/s] Loading 0: 58%|█████▊ | 168/291 [00:03<00:02, 44.08it/s] Loading 0: 60%|██████ | 175/291 [00:03<00:02, 50.51it/s] Loading 0: 62%|██████▏ | 181/291 [00:03<00:02, 42.93it/s] Loading 0: 64%|██████▍ | 187/291 [00:04<00:02, 35.92it/s] Loading 0: 66%|██████▌ | 191/291 [00:04<00:02, 36.65it/s] Loading 0: 67%|██████▋ | 195/291 [00:04<00:02, 36.09it/s] Loading 0: 69%|██████▉ | 202/291 [00:04<00:02, 43.84it/s] Loading 0: 71%|███████▏ | 208/291 [00:04<00:01, 44.99it/s] Loading 0: 73%|███████▎ | 213/291 [00:04<00:01, 44.27it/s] Loading 0: 76%|███████▌ | 220/291 [00:04<00:01, 49.29it/s] Loading 0: 78%|███████▊ | 226/291 [00:05<00:01, 46.09it/s] Loading 0: 79%|███████▉ | 231/291 [00:05<00:01, 46.34it/s] Loading 0: 82%|████████▏ | 238/291 [00:05<00:01, 50.74it/s] Loading 0: 84%|████████▍ | 244/291 [00:05<00:00, 49.81it/s] Loading 0: 86%|████████▌ | 250/291 [00:05<00:00, 51.32it/s] Loading 0: 88%|████████▊ | 257/291 [00:05<00:00, 48.24it/s] Loading 0: 91%|█████████ | 265/291 [00:05<00:00, 54.46it/s] Loading 0: 93%|█████████▎| 271/291 [00:05<00:00, 51.12it/s] Loading 0: 95%|█████████▌| 277/291 [00:06<00:00, 52.14it/s] Loading 0: 97%|█████████▋| 283/291 [00:06<00:00, 46.49it/s] Loading 0: 99%|█████████▉| 288/291 [00:11<00:00, 3.40it/s]
Job nousresearch-meta-llama-4939-v36-mkmlizer completed after 98.94s with status: succeeded
Stopping job with name nousresearch-meta-llama-4939-v36-mkmlizer
Pipeline stage MKMLizer completed in 100.01s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.28s
Running pipeline stage MKMLDeployer
Creating inference service nousresearch-meta-llama-4939-v36
Waiting for inference service nousresearch-meta-llama-4939-v36 to be ready
Inference service nousresearch-meta-llama-4939-v36 ready after 191.80632615089417s
Pipeline stage MKMLDeployer completed in 192.62s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0911319255828857s
Received healthy response to inference request in 1.6290159225463867s
Received healthy response to inference request in 2.0026350021362305s
Received healthy response to inference request in 1.681640863418579s
Received healthy response to inference request in 1.3866620063781738s
5 requests
0 failed requests
5th percentile: 1.4351327896118165
10th percentile: 1.483603572845459
20th percentile: 1.580545139312744
30th percentile: 1.6395409107208252
40th percentile: 1.6605908870697021
50th percentile: 1.681640863418579
60th percentile: 1.8100385189056396
70th percentile: 1.9384361743927
80th percentile: 2.0203343868255614
90th percentile: 2.055733156204224
95th percentile: 2.0734325408935548
99th percentile: 2.0875920486450195
mean time: 1.7582171440124512
Pipeline stage StressChecker completed in 10.89s
Running pipeline stage TriggerMKMLProfilingPipeline
starting trigger_guanaco_pipeline %s
triggered trigger_guanaco_pipeline %s
Pipeline stage TriggerMKMLProfilingPipeline completed in 2.43s
nousresearch-meta-llama_4939_v36 status is now deployed due to DeploymentManager action
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.53s
Running pipeline stage MKMLProfilerDeployer
Pipeline stage %s skipped, reason=%s
Pipeline stage MKMLProfilerDeployer completed in 0.22s
Running pipeline stage MKMLProfilerRunner
script pods %s
Pipeline stage MKMLProfilerRunner completed in 0.93s
Running pipeline stage MKMLProfilerDeleter
Checking if service nousresearch-meta-llama-4939-v36-profiler is running
Tearing down inference service nousresearch-meta-llama-4939-v36-profiler
Service nousresearch-meta-llama-4939-v36-profiler has been torndown
Pipeline stage MKMLProfilerDeleter completed in 3.40s
nousresearch-meta-llama_4939_v36 status is now inactive due to admin request
nousresearch-meta-llama_4939_v36 status is now torndown due to DeploymentManager action