Running pipeline stage MKMLizer
Starting job with name anhnv125-mistral-base-v11-mkmlizer
Waiting for job on anhnv125-mistral-base-v11-mkmlizer to finish
anhnv125-mistral-base-v11-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
anhnv125-mistral-base-v11-mkmlizer: ║ _____ __ __ ║
anhnv125-mistral-base-v11-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
anhnv125-mistral-base-v11-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
anhnv125-mistral-base-v11-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
anhnv125-mistral-base-v11-mkmlizer: ║ /___/ ║
anhnv125-mistral-base-v11-mkmlizer: ║ ║
anhnv125-mistral-base-v11-mkmlizer: ║ Version: 0.6.11 ║
anhnv125-mistral-base-v11-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
anhnv125-mistral-base-v11-mkmlizer: ║ ║
anhnv125-mistral-base-v11-mkmlizer: ║ The license key for the current software has been verified as ║
anhnv125-mistral-base-v11-mkmlizer: ║ belonging to: ║
anhnv125-mistral-base-v11-mkmlizer: ║ ║
anhnv125-mistral-base-v11-mkmlizer: ║ Chai Research Corp. ║
anhnv125-mistral-base-v11-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
anhnv125-mistral-base-v11-mkmlizer: ║ Expiration: 2024-07-15 23:59:59 ║
anhnv125-mistral-base-v11-mkmlizer: ║ ║
anhnv125-mistral-base-v11-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
anhnv125-mistral-base-v11-mkmlizer:
.gitattributes: 0%| | 0.00/1.52k [00:00<?, ?B/s]
.gitattributes: 100%|██████████| 1.52k/1.52k [00:00<00:00, 19.6MB/s]
anhnv125-mistral-base-v11-mkmlizer:
README.md: 0%| | 0.00/5.18k [00:00<?, ?B/s]
README.md: 100%|██████████| 5.18k/5.18k [00:00<00:00, 57.8MB/s]
anhnv125-mistral-base-v11-mkmlizer:
config.json: 0%| | 0.00/652 [00:00<?, ?B/s]
config.json: 100%|██████████| 652/652 [00:00<00:00, 8.24MB/s]
anhnv125-mistral-base-v11-mkmlizer:
generation_config.json: 0%| | 0.00/132 [00:00<?, ?B/s]
generation_config.json: 100%|██████████| 132/132 [00:00<00:00, 1.44MB/s]
anhnv125-mistral-base-v11-mkmlizer:
model-00001-of-00003.safetensors: 0%| | 0.00/4.94G [00:00<?, ?B/s]
model-00001-of-00003.safetensors: 0%| | 10.5M/4.94G [00:00<01:13, 67.4MB/s]
model-00001-of-00003.safetensors: 1%| | 31.5M/4.94G [00:00<01:01, 79.3MB/s]
model-00001-of-00003.safetensors: 1%| | 41.9M/4.94G [00:00<01:14, 65.6MB/s]
model-00001-of-00003.safetensors: 3%|▎ | 136M/4.94G [00:00<00:26, 184MB/s]
model-00001-of-00003.safetensors: 3%|▎ | 168M/4.94G [00:01<00:23, 207MB/s]
model-00001-of-00003.safetensors: 6%|▌ | 294M/4.94G [00:01<00:11, 422MB/s]
model-00001-of-00003.safetensors: 16%|█▌ | 786M/4.94G [00:01<00:02, 1.45GB/s]
model-00001-of-00003.safetensors: 25%|██▌ | 1.24G/4.94G [00:01<00:01, 2.16GB/s]
model-00001-of-00003.safetensors: 31%|███ | 1.54G/4.94G [00:01<00:01, 2.20GB/s]
model-00001-of-00003.safetensors: 36%|███▋ | 1.80G/4.94G [00:01<00:01, 1.69GB/s]
model-00001-of-00003.safetensors: 41%|████ | 2.02G/4.94G [00:01<00:02, 1.31GB/s]
model-00001-of-00003.safetensors: 45%|████▍ | 2.22G/4.94G [00:02<00:01, 1.40GB/s]
model-00001-of-00003.safetensors: 49%|████▊ | 2.40G/4.94G [00:02<00:01, 1.36GB/s]
model-00001-of-00003.safetensors: 54%|█████▎ | 2.65G/4.94G [00:02<00:01, 1.58GB/s]
model-00001-of-00003.safetensors: 62%|██████▏ | 3.08G/4.94G [00:02<00:00, 2.18GB/s]
model-00001-of-00003.safetensors: 68%|██████▊ | 3.34G/4.94G [00:02<00:00, 2.13GB/s]
model-00001-of-00003.safetensors: 74%|███████▎ | 3.64G/4.94G [00:02<00:00, 2.23GB/s]
model-00001-of-00003.safetensors: 78%|███████▊ | 3.88G/4.94G [00:02<00:00, 2.09GB/s]
model-00001-of-00003.safetensors: 83%|████████▎ | 4.12G/4.94G [00:02<00:00, 2.15GB/s]
model-00001-of-00003.safetensors: 88%|████████▊ | 4.36G/4.94G [00:03<00:00, 1.81GB/s]
model-00001-of-00003.safetensors: 100%|█████████▉| 4.94G/4.94G [00:03<00:00, 1.51GB/s]
anhnv125-mistral-base-v11-mkmlizer:
model-00002-of-00003.safetensors: 0%| | 0.00/5.00G [00:00<?, ?B/s]
model-00002-of-00003.safetensors: 0%| | 10.5M/5.00G [00:00<03:19, 25.0MB/s]
model-00002-of-00003.safetensors: 0%| | 21.0M/5.00G [00:00<01:53, 44.1MB/s]
model-00002-of-00003.safetensors: 1%|▏ | 62.9M/5.00G [00:00<00:35, 139MB/s]
model-00002-of-00003.safetensors: 2%|▏ | 94.4M/5.00G [00:00<00:28, 175MB/s]
model-00002-of-00003.safetensors: 3%|▎ | 126M/5.00G [00:00<00:26, 182MB/s]
model-00002-of-00003.safetensors: 9%|▉ | 440M/5.00G [00:01<00:05, 906MB/s]
model-00002-of-00003.safetensors: 23%|██▎ | 1.16G/5.00G [00:01<00:01, 2.53GB/s]
model-00002-of-00003.safetensors: 30%|██▉ | 1.48G/5.00G [00:01<00:01, 1.99GB/s]
model-00002-of-00003.safetensors: 35%|███▍ | 1.74G/5.00G [00:01<00:02, 1.23GB/s]
model-00002-of-00003.safetensors: 39%|███▉ | 1.94G/5.00G [00:01<00:02, 1.34GB/s]
model-00002-of-00003.safetensors: 45%|████▌ | 2.26G/5.00G [00:01<00:01, 1.67GB/s]
model-00002-of-00003.safetensors: 54%|█████▍ | 2.72G/5.00G [00:02<00:01, 2.20GB/s]
model-00002-of-00003.safetensors: 60%|█████▉ | 3.00G/5.00G [00:02<00:01, 1.96GB/s]
model-00002-of-00003.safetensors: 65%|██████▌ | 3.25G/5.00G [00:02<00:00, 1.99GB/s]
model-00002-of-00003.safetensors: 70%|██████▉ | 3.49G/5.00G [00:02<00:00, 1.68GB/s]
model-00002-of-00003.safetensors: 75%|███████▍ | 3.74G/5.00G [00:02<00:00, 1.85GB/s]
model-00002-of-00003.safetensors: 79%|███████▉ | 3.96G/5.00G [00:02<00:00, 1.68GB/s]
model-00002-of-00003.safetensors: 84%|████████▍ | 4.22G/5.00G [00:02<00:00, 1.84GB/s]
model-00002-of-00003.safetensors: 95%|█████████▍| 4.74G/5.00G [00:03<00:00, 2.63GB/s]
model-00002-of-00003.safetensors: 100%|█████████▉| 5.00G/5.00G [00:03<00:00, 1.57GB/s]
anhnv125-mistral-base-v11-mkmlizer:
model-00003-of-00003.safetensors: 0%| | 0.00/4.54G [00:00<?, ?B/s]
model-00003-of-00003.safetensors: 0%| | 10.5M/4.54G [00:00<01:56, 38.8MB/s]
model-00003-of-00003.safetensors: 2%|▏ | 94.4M/4.54G [00:00<00:15, 295MB/s]
model-00003-of-00003.safetensors: 5%|▌ | 241M/4.54G [00:00<00:06, 639MB/s]
model-00003-of-00003.safetensors: 12%|█▏ | 545M/4.54G [00:00<00:02, 1.35GB/s]
model-00003-of-00003.safetensors: 16%|█▋ | 744M/4.54G [00:00<00:02, 1.54GB/s]
model-00003-of-00003.safetensors: 21%|██▏ | 975M/4.54G [00:00<00:02, 1.75GB/s]
model-00003-of-00003.safetensors: 26%|██▌ | 1.17G/4.54G [00:00<00:01, 1.75GB/s]
model-00003-of-00003.safetensors: 31%|███ | 1.42G/4.54G [00:01<00:01, 1.93GB/s]
model-00003-of-00003.safetensors: 36%|███▋ | 1.66G/4.54G [00:01<00:01, 2.02GB/s]
model-00003-of-00003.safetensors: 41%|████ | 1.87G/4.54G [00:01<00:01, 1.71GB/s]
model-00003-of-00003.safetensors: 47%|████▋ | 2.12G/4.54G [00:01<00:01, 1.91GB/s]
model-00003-of-00003.safetensors: 51%|█████▏ | 2.33G/4.54G [00:01<00:01, 1.74GB/s]
model-00003-of-00003.safetensors: 55%|█████▌ | 2.52G/4.54G [00:01<00:01, 1.64GB/s]
model-00003-of-00003.safetensors: 59%|█████▉ | 2.69G/4.54G [00:01<00:01, 1.50GB/s]
model-00003-of-00003.safetensors: 64%|██████▍ | 2.90G/4.54G [00:01<00:00, 1.64GB/s]
model-00003-of-00003.safetensors: 68%|██████▊ | 3.10G/4.54G [00:02<00:00, 1.66GB/s]
model-00003-of-00003.safetensors: 72%|███████▏ | 3.28G/4.54G [00:02<00:00, 1.66GB/s]
model-00003-of-00003.safetensors: 76%|███████▌ | 3.46G/4.54G [00:02<00:00, 1.60GB/s]
model-00003-of-00003.safetensors: 80%|███████▉ | 3.63G/4.54G [00:02<00:00, 1.54GB/s]
model-00003-of-00003.safetensors: 87%|████████▋ | 3.94G/4.54G [00:02<00:00, 1.93GB/s]
model-00003-of-00003.safetensors: 93%|█████████▎| 4.23G/4.54G [00:02<00:00, 2.13GB/s]
model-00003-of-00003.safetensors: 100%|█████████▉| 4.54G/4.54G [00:02<00:00, 1.65GB/s]
anhnv125-mistral-base-v11-mkmlizer:
model.safetensors.index.json: 0%| | 0.00/23.9k [00:00<?, ?B/s]
model.safetensors.index.json: 100%|██████████| 23.9k/23.9k [00:00<00:00, 129MB/s]
anhnv125-mistral-base-v11-mkmlizer:
special_tokens_map.json: 0%| | 0.00/551 [00:00<?, ?B/s]
special_tokens_map.json: 100%|██████████| 551/551 [00:00<00:00, 5.88MB/s]
anhnv125-mistral-base-v11-mkmlizer:
tokenizer.json: 0%| | 0.00/1.80M [00:00<?, ?B/s]
tokenizer.json: 100%|██████████| 1.80M/1.80M [00:00<00:00, 21.7MB/s]
anhnv125-mistral-base-v11-mkmlizer:
tokenizer.model: 0%| | 0.00/493k [00:00<?, ?B/s]
tokenizer.model: 100%|██████████| 493k/493k [00:00<00:00, 61.0MB/s]
anhnv125-mistral-base-v11-mkmlizer:
tokenizer_config.json: 0%| | 0.00/1.02k [00:00<?, ?B/s]
tokenizer_config.json: 100%|██████████| 1.02k/1.02k [00:00<00:00, 8.02MB/s]
anhnv125-mistral-base-v11-mkmlizer: Downloaded to shared memory in 11.504s
anhnv125-mistral-base-v11-mkmlizer: quantizing model to /dev/shm/model_cache
anhnv125-mistral-base-v11-mkmlizer: Saving mkml model at /dev/shm/model_cache
anhnv125-mistral-base-v11-mkmlizer: Reading /tmp/tmpu4v9hg4g/model.safetensors.index.json
anhnv125-mistral-base-v11-mkmlizer:
Profiling: 0%| | 0/291 [00:00<?, ?it/s]
Profiling: 0%| | 1/291 [00:01<07:07, 1.47s/it]
Profiling: 7%|▋ | 20/291 [00:01<00:15, 17.22it/s]
Profiling: 13%|█▎ | 38/291 [00:01<00:07, 34.94it/s]
Profiling: 20%|██ | 59/291 [00:01<00:03, 58.44it/s]
Profiling: 27%|██▋ | 79/291 [00:01<00:02, 81.24it/s]
Profiling: 34%|███▎ | 98/291 [00:02<00:02, 67.35it/s]
Profiling: 40%|███▉ | 115/291 [00:02<00:02, 82.79it/s]
Profiling: 46%|████▌ | 134/291 [00:02<00:01, 101.66it/s]
Profiling: 54%|█████▎ | 156/291 [00:02<00:01, 122.99it/s]
Profiling: 60%|██████ | 175/291 [00:02<00:00, 136.07it/s]
Profiling: 67%|██████▋ | 194/291 [00:02<00:00, 145.83it/s]
Profiling: 73%|███████▎ | 212/291 [00:04<00:02, 32.03it/s]
Profiling: 79%|███████▉ | 231/291 [00:04<00:01, 42.70it/s]
Profiling: 87%|████████▋ | 253/291 [00:04<00:00, 58.35it/s]
Profiling: 94%|█████████▍| 274/291 [00:04<00:00, 74.70it/s]
Profiling: 100%|██████████| 291/291 [00:05<00:00, 57.35it/s]
anhnv125-mistral-base-v11-mkmlizer: quantized model in 15.360s
anhnv125-mistral-base-v11-mkmlizer: Processed model anhnv125/mistral-base in 27.860s
anhnv125-mistral-base-v11-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/anhnv125-mistral-base-v11/tokenizer.json
anhnv125-mistral-base-v11-mkmlizer: cp /dev/shm/model_cache/tokenizer.model s3://guanaco-mkml-models/anhnv125-mistral-base-v11/tokenizer.model
anhnv125-mistral-base-v11-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/anhnv125-mistral-base-v11/special_tokens_map.json
anhnv125-mistral-base-v11-mkmlizer: cp /dev/shm/model_cache/mkml_model.tensors s3://guanaco-mkml-models/anhnv125-mistral-base-v11/mkml_model.tensors
anhnv125-mistral-base-v11-mkmlizer: loading reward model from rirv938/reward_gpt2_medium_preference_24m_e2
anhnv125-mistral-base-v11-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:1067: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
anhnv125-mistral-base-v11-mkmlizer: warnings.warn(
anhnv125-mistral-base-v11-mkmlizer:
config.json: 0%| | 0.00/1.05k [00:00<?, ?B/s]
config.json: 100%|██████████| 1.05k/1.05k [00:00<00:00, 12.2MB/s]
anhnv125-mistral-base-v11-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:690: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
anhnv125-mistral-base-v11-mkmlizer: warnings.warn(
anhnv125-mistral-base-v11-mkmlizer:
tokenizer_config.json: 0%| | 0.00/234 [00:00<?, ?B/s]
tokenizer_config.json: 100%|██████████| 234/234 [00:00<00:00, 3.20MB/s]
anhnv125-mistral-base-v11-mkmlizer:
vocab.json: 0%| | 0.00/1.04M [00:00<?, ?B/s]
vocab.json: 100%|██████████| 1.04M/1.04M [00:00<00:00, 8.11MB/s]
vocab.json: 100%|██████████| 1.04M/1.04M [00:00<00:00, 8.06MB/s]
anhnv125-mistral-base-v11-mkmlizer:
tokenizer.json: 0%| | 0.00/2.11M [00:00<?, ?B/s]
tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 5.35MB/s]
tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 5.34MB/s]
anhnv125-mistral-base-v11-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:472: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
anhnv125-mistral-base-v11-mkmlizer: warnings.warn(
anhnv125-mistral-base-v11-mkmlizer:
pytorch_model.bin: 0%| | 0.00/1.44G [00:00<?, ?B/s]
pytorch_model.bin: 1%| | 10.5M/1.44G [00:00<00:22, 64.8MB/s]
pytorch_model.bin: 1%|▏ | 21.0M/1.44G [00:00<00:24, 59.0MB/s]
pytorch_model.bin: 2%|▏ | 31.5M/1.44G [00:00<00:20, 69.5MB/s]
pytorch_model.bin: 17%|█▋ | 252M/1.44G [00:00<00:01, 705MB/s]
pytorch_model.bin: 27%|██▋ | 388M/1.44G [00:00<00:01, 888MB/s]
pytorch_model.bin: 35%|███▍ | 503M/1.44G [00:00<00:00, 948MB/s]
pytorch_model.bin: 44%|████▎ | 629M/1.44G [00:00<00:00, 994MB/s]
pytorch_model.bin: 55%|█████▌ | 797M/1.44G [00:01<00:00, 1.15GB/s]
pytorch_model.bin: 69%|██████▉ | 996M/1.44G [00:01<00:00, 1.18GB/s]
pytorch_model.bin: 78%|███████▊ | 1.12G/1.44G [00:01<00:00, 916MB/s]
pytorch_model.bin: 98%|█████████▊| 1.41G/1.44G [00:01<00:00, 1.31GB/s]
pytorch_model.bin: 100%|█████████▉| 1.44G/1.44G [00:01<00:00, 912MB/s]
anhnv125-mistral-base-v11-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
anhnv125-mistral-base-v11-mkmlizer: Saving duration: 0.269s
anhnv125-mistral-base-v11-mkmlizer: Processed model rirv938/reward_gpt2_medium_preference_24m_e2 in 6.458s
anhnv125-mistral-base-v11-mkmlizer: creating bucket guanaco-reward-models
anhnv125-mistral-base-v11-mkmlizer: Bucket 's3://guanaco-reward-models/' created
anhnv125-mistral-base-v11-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/anhnv125-mistral-base-v11_reward
anhnv125-mistral-base-v11-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/anhnv125-mistral-base-v11_reward/config.json
anhnv125-mistral-base-v11-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/anhnv125-mistral-base-v11_reward/tokenizer_config.json
anhnv125-mistral-base-v11-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/anhnv125-mistral-base-v11_reward/vocab.json
anhnv125-mistral-base-v11-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/anhnv125-mistral-base-v11_reward/merges.txt
anhnv125-mistral-base-v11-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/anhnv125-mistral-base-v11_reward/tokenizer.json
anhnv125-mistral-base-v11-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/anhnv125-mistral-base-v11_reward/special_tokens_map.json
anhnv125-mistral-base-v11-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/anhnv125-mistral-base-v11_reward/reward.tensors
Job anhnv125-mistral-base-v11-mkmlizer completed after 54.31s with status: succeeded
Stopping job with name anhnv125-mistral-base-v11-mkmlizer
Pipeline stage MKMLizer completed in 58.99s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.12s
Running pipeline stage ISVCDeployer
Creating inference service anhnv125-mistral-base-v11
Waiting for inference service anhnv125-mistral-base-v11 to be ready
Inference service anhnv125-mistral-base-v11 ready after 41.30245804786682s
Pipeline stage ISVCDeployer completed in 48.95s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8751962184906006s
Received healthy response to inference request in 1.2543811798095703s
Received healthy response to inference request in 1.387115716934204s
Received healthy response to inference request in 1.270995855331421s
Received healthy response to inference request in 1.2655792236328125s
5 requests
0 failed requests
5th percentile: 1.2566207885742187
10th percentile: 1.2588603973388672
20th percentile: 1.2633396148681642
30th percentile: 1.2666625499725341
40th percentile: 1.2688292026519776
50th percentile: 1.270995855331421
60th percentile: 1.3174437999725341
70th percentile: 1.3638917446136474
80th percentile: 1.4847318172454835
90th percentile: 1.679964017868042
95th percentile: 1.7775801181793212
99th percentile: 1.8556729984283447
mean time: 1.4106536388397217
Pipeline stage StressChecker completed in 8.06s
Running pipeline stage DaemonicModelEvalScorer
Pipeline stage DaemonicModelEvalScorer completed in 0.05s
Running pipeline stage DaemonicSafetyScorer
Running M-Eval for topic stay_in_character
Pipeline stage DaemonicSafetyScorer completed in 0.04s
M-Eval Dataset for topic stay_in_character is loaded
anhnv125-mistral-base_v11 status is now deployed due to DeploymentManager action
anhnv125-mistral-base_v11 status is now rejected due to Failing to get Model Eval score