developer_uid: zonemercy
submission_id: nousresearch-meta-llama_4941_v94
model_name: nousresearch-meta-llama_4941_v94
model_group: NousResearch/Meta-Llama-
status: torndown
timestamp: 2024-07-29T19:42:42+00:00
num_battles: 10998
num_wins: 4301
celo_rating: 1103.69
family_friendly_score: 0.0
submission_type: basic
model_repo: NousResearch/Meta-Llama-3-8B-Instruct
model_architecture: LlamaForCausalLM
reward_repo: ChaiML/gpt2_xl_pairwise_89m_step_347634
model_num_parameters: 8030261248.0
best_of: 1
max_input_tokens: 512
max_output_tokens: 64
display_name: nousresearch-meta-llama_4941_v94
is_internal_developer: True
language_model: NousResearch/Meta-Llama-3-8B-Instruct
model_size: 8B
ranking_group: single
us_pacific_date: 2024-07-29
win_ratio: 0.3910711038370613
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 1, 'max_output_tokens': 64, 'reward_max_token_input': 256}
formatter: {'memory_template': "<|begin_of_text|><|start_header_id|>system<|end_header_id|>\n\n{bot_name}'s Persona: {memory}\n\n", 'prompt_template': '{prompt}<|eot_id|>', 'bot_template': '<|start_header_id|>assistant<|end_header_id|>\n\n{bot_name}: {message}<|eot_id|>', 'user_template': '<|start_header_id|>user<|end_header_id|>\n\n{user_name}: {message}<|eot_id|>', 'response_template': '<|start_header_id|>assistant<|end_header_id|>\n\n{bot_name}:', 'truncate_by_message': False}
reward_formatter: {'bot_template': '{bot_name}: {message}\n', 'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'response_template': '{bot_name}:', 'truncate_by_message': False, 'user_template': '{user_name}: {message}\n'}
Resubmit model
Running pipeline stage MKMLizer
Starting job with name nousresearch-meta-llama-4941-v94-mkmlizer
Waiting for job on nousresearch-meta-llama-4941-v94-mkmlizer to finish
nousresearch-meta-llama-4941-v94-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
nousresearch-meta-llama-4941-v94-mkmlizer: ║ _____ __ __ ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ /___/ ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ Version: 0.9.7 ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ https://mk1.ai ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ The license key for the current software has been verified as ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ belonging to: ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ Chai Research Corp. ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
nousresearch-meta-llama-4941-v94-mkmlizer: ║ ║
nousresearch-meta-llama-4941-v94-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
nousresearch-meta-llama-4941-v94-mkmlizer: Downloaded to shared memory in 22.656s
nousresearch-meta-llama-4941-v94-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpf0c9bi7q, device:0
nousresearch-meta-llama-4941-v94-mkmlizer: Saving flywheel model at /dev/shm/model_cache
nousresearch-meta-llama-4941-v94-mkmlizer: quantized model in 26.548s
nousresearch-meta-llama-4941-v94-mkmlizer: Processed model NousResearch/Meta-Llama-3-8B-Instruct in 49.204s
nousresearch-meta-llama-4941-v94-mkmlizer: creating bucket guanaco-mkml-models
nousresearch-meta-llama-4941-v94-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
nousresearch-meta-llama-4941-v94-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/nousresearch-meta-llama-4941-v94
nousresearch-meta-llama-4941-v94-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/nousresearch-meta-llama-4941-v94/config.json
nousresearch-meta-llama-4941-v94-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/nousresearch-meta-llama-4941-v94/special_tokens_map.json
nousresearch-meta-llama-4941-v94-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/nousresearch-meta-llama-4941-v94/tokenizer_config.json
nousresearch-meta-llama-4941-v94-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/nousresearch-meta-llama-4941-v94/tokenizer.json
nousresearch-meta-llama-4941-v94-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/nousresearch-meta-llama-4941-v94/flywheel_model.0.safetensors
nousresearch-meta-llama-4941-v94-mkmlizer: loading reward model from ChaiML/gpt2_xl_pairwise_89m_step_347634
nousresearch-meta-llama-4941-v94-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 2%|▏ | 5/291 [00:00<00:08, 35.23it/s] Loading 0: 4%|▍ | 13/291 [00:00<00:04, 55.91it/s] Loading 0: 7%|▋ | 19/291 [00:00<00:05, 48.84it/s] Loading 0: 9%|▊ | 25/291 [00:00<00:05, 50.58it/s] Loading 0: 11%|█ | 31/291 [00:00<00:05, 51.96it/s] Loading 0: 13%|█▎ | 37/291 [00:00<00:05, 47.52it/s] Loading 0: 14%|█▍ | 42/291 [00:00<00:05, 46.53it/s] Loading 0: 17%|█▋ | 49/291 [00:00<00:04, 51.33it/s] Loading 0: 19%|█▉ | 55/291 [00:01<00:04, 47.82it/s] Loading 0: 21%|██ | 60/291 [00:01<00:04, 46.96it/s] Loading 0: 23%|██▎ | 67/291 [00:01<00:04, 51.18it/s] Loading 0: 25%|██▌ | 73/291 [00:01<00:04, 46.63it/s] Loading 0: 27%|██▋ | 78/291 [00:01<00:04, 45.76it/s] Loading 0: 29%|██▊ | 83/291 [00:01<00:06, 31.50it/s] Loading 0: 30%|██▉ | 87/291 [00:02<00:06, 31.14it/s] Loading 0: 32%|███▏ | 94/291 [00:02<00:05, 38.36it/s] Loading 0: 34%|███▍ | 100/291 [00:02<00:04, 39.21it/s] Loading 0: 36%|███▌ | 105/291 [00:02<00:04, 40.17it/s] Loading 0: 38%|███▊ | 112/291 [00:02<00:03, 46.30it/s] Loading 0: 41%|████ | 118/291 [00:02<00:03, 44.83it/s] Loading 0: 42%|████▏ | 123/291 [00:02<00:03, 44.81it/s] Loading 0: 45%|████▍ | 130/291 [00:02<00:03, 49.84it/s] Loading 0: 47%|████▋ | 136/291 [00:03<00:03, 47.35it/s] Loading 0: 48%|████▊ | 141/291 [00:03<00:03, 46.99it/s] Loading 0: 51%|█████ | 147/291 [00:03<00:02, 50.18it/s] Loading 0: 53%|█████▎ | 153/291 [00:03<00:02, 52.04it/s] Loading 0: 55%|█████▍ | 159/291 [00:03<00:02, 46.27it/s] Loading 0: 57%|█████▋ | 166/291 [00:03<00:02, 50.68it/s] Loading 0: 59%|█████▉ | 172/291 [00:03<00:02, 46.00it/s] Loading 0: 62%|██████▏ | 179/291 [00:03<00:02, 50.15it/s] Loading 0: 64%|██████▎ | 185/291 [00:04<00:02, 51.89it/s] Loading 0: 66%|██████▌ | 191/291 [00:04<00:03, 32.43it/s] Loading 0: 67%|██████▋ | 196/291 [00:04<00:02, 34.49it/s] Loading 0: 69%|██████▉ | 202/291 [00:04<00:02, 39.23it/s] Loading 0: 71%|███████▏ | 208/291 [00:04<00:02, 38.54it/s] Loading 0: 73%|███████▎ | 213/291 [00:04<00:01, 39.26it/s] Loading 0: 75%|███████▌ | 219/291 [00:04<00:01, 43.44it/s] Loading 0: 77%|███████▋ | 224/291 [00:05<00:01, 43.48it/s] Loading 0: 79%|███████▊ | 229/291 [00:05<00:01, 43.01it/s] Loading 0: 81%|████████ | 235/291 [00:05<00:01, 42.19it/s] Loading 0: 82%|████████▏ | 240/291 [00:05<00:01, 42.37it/s] Loading 0: 85%|████████▍ | 247/291 [00:05<00:00, 47.41it/s] Loading 0: 87%|████████▋ | 253/291 [00:05<00:00, 44.17it/s] Loading 0: 89%|████████▊ | 258/291 [00:05<00:00, 42.88it/s] Loading 0: 91%|█████████ | 264/291 [00:05<00:00, 46.85it/s] Loading 0: 92%|█████████▏| 269/291 [00:06<00:00, 46.07it/s] Loading 0: 94%|█████████▍| 274/291 [00:06<00:00, 44.82it/s] Loading 0: 96%|█████████▌| 280/291 [00:06<00:00, 42.44it/s] Loading 0: 98%|█████████▊| 285/291 [00:06<00:00, 42.48it/s] Loading 0: 100%|█████████▉| 290/291 [00:11<00:00, 3.06it/s] /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:957: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
nousresearch-meta-llama-4941-v94-mkmlizer: warnings.warn(
nousresearch-meta-llama-4941-v94-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:785: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
nousresearch-meta-llama-4941-v94-mkmlizer: warnings.warn(
nousresearch-meta-llama-4941-v94-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:469: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
nousresearch-meta-llama-4941-v94-mkmlizer: warnings.warn(
nousresearch-meta-llama-4941-v94-mkmlizer: Downloading shards: 0%| | 0/2 [00:00<?, ?it/s] Downloading shards: 50%|█████ | 1/2 [00:05<00:05, 5.41s/it] Downloading shards: 100%|██████████| 2/2 [00:08<00:00, 3.94s/it] Downloading shards: 100%|██████████| 2/2 [00:08<00:00, 4.16s/it]
nousresearch-meta-llama-4941-v94-mkmlizer: Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s] Loading checkpoint shards: 50%|█████ | 1/2 [00:00<00:00, 2.31it/s] Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 3.83it/s] Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 3.48it/s]
nousresearch-meta-llama-4941-v94-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
nousresearch-meta-llama-4941-v94-mkmlizer: Saving duration: 1.368s
nousresearch-meta-llama-4941-v94-mkmlizer: Processed model ChaiML/gpt2_xl_pairwise_89m_step_347634 in 13.142s
nousresearch-meta-llama-4941-v94-mkmlizer: creating bucket guanaco-reward-models
nousresearch-meta-llama-4941-v94-mkmlizer: Bucket 's3://guanaco-reward-models/' created
nousresearch-meta-llama-4941-v94-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/nousresearch-meta-llama-4941-v94_reward
nousresearch-meta-llama-4941-v94-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/nousresearch-meta-llama-4941-v94_reward/config.json
nousresearch-meta-llama-4941-v94-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/nousresearch-meta-llama-4941-v94_reward/special_tokens_map.json
nousresearch-meta-llama-4941-v94-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/nousresearch-meta-llama-4941-v94_reward/tokenizer_config.json
nousresearch-meta-llama-4941-v94-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/nousresearch-meta-llama-4941-v94_reward/merges.txt
nousresearch-meta-llama-4941-v94-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/nousresearch-meta-llama-4941-v94_reward/vocab.json
nousresearch-meta-llama-4941-v94-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/nousresearch-meta-llama-4941-v94_reward/tokenizer.json
nousresearch-meta-llama-4941-v94-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/nousresearch-meta-llama-4941-v94_reward/reward.tensors
Job nousresearch-meta-llama-4941-v94-mkmlizer completed after 95.58s with status: succeeded
Stopping job with name nousresearch-meta-llama-4941-v94-mkmlizer
Pipeline stage MKMLizer completed in 96.73s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.09s
Running pipeline stage ISVCDeployer
Creating inference service nousresearch-meta-llama-4941-v94
Waiting for inference service nousresearch-meta-llama-4941-v94 to be ready
Inference service nousresearch-meta-llama-4941-v94 ready after 111.03601551055908s
Pipeline stage ISVCDeployer completed in 112.86s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8844079971313477s
Received healthy response to inference request in 1.083404779434204s
Received healthy response to inference request in 1.0633149147033691s
Received healthy response to inference request in 1.0561566352844238s
Received healthy response to inference request in 1.0566649436950684s
5 requests
0 failed requests
5th percentile: 1.0562582969665528
10th percentile: 1.0563599586486816
20th percentile: 1.0565632820129394
30th percentile: 1.0579949378967286
40th percentile: 1.0606549263000489
50th percentile: 1.0633149147033691
60th percentile: 1.0713508605957032
70th percentile: 1.0793868064880372
80th percentile: 1.243605422973633
90th percentile: 1.5640067100524904
95th percentile: 1.7242073535919187
99th percentile: 1.852367868423462
mean time: 1.2287898540496827
Pipeline stage StressChecker completed in 6.79s
nousresearch-meta-llama_4941_v94 status is now deployed due to DeploymentManager action
nousresearch-meta-llama_4941_v94 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of nousresearch-meta-llama_4941_v94
Running pipeline stage ISVCDeleter
Checking if service nousresearch-meta-llama-4941-v94 is running
Tearing down inference service nousresearch-meta-llama-4941-v94
Service nousresearch-meta-llama-4941-v94 has been torndown
Pipeline stage ISVCDeleter completed in 4.48s
Running pipeline stage MKMLModelDeleter
Cleaning model data from S3
Cleaning model data from model cache
Deleting key nousresearch-meta-llama-4941-v94/config.json from bucket guanaco-mkml-models
Deleting key nousresearch-meta-llama-4941-v94/flywheel_model.0.safetensors from bucket guanaco-mkml-models
Deleting key nousresearch-meta-llama-4941-v94/special_tokens_map.json from bucket guanaco-mkml-models
Deleting key nousresearch-meta-llama-4941-v94/tokenizer.json from bucket guanaco-mkml-models
Deleting key nousresearch-meta-llama-4941-v94/tokenizer_config.json from bucket guanaco-mkml-models
Cleaning model data from model cache
Deleting key nousresearch-meta-llama-4941-v94_reward/config.json from bucket guanaco-reward-models
Deleting key nousresearch-meta-llama-4941-v94_reward/merges.txt from bucket guanaco-reward-models
Deleting key nousresearch-meta-llama-4941-v94_reward/reward.tensors from bucket guanaco-reward-models
Deleting key nousresearch-meta-llama-4941-v94_reward/special_tokens_map.json from bucket guanaco-reward-models
Deleting key nousresearch-meta-llama-4941-v94_reward/tokenizer.json from bucket guanaco-reward-models
Deleting key nousresearch-meta-llama-4941-v94_reward/tokenizer_config.json from bucket guanaco-reward-models
Deleting key nousresearch-meta-llama-4941-v94_reward/vocab.json from bucket guanaco-reward-models
Pipeline stage MKMLModelDeleter completed in 5.42s
nousresearch-meta-llama_4941_v94 status is now torndown due to DeploymentManager action