Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-mistral-24b-2048-25909-v1-mkmlizer
Waiting for job on chaiml-mistral-24b-2048-25909-v1-mkmlizer to finish
chaiml-mistral-24b-2048-74727-v1-mkmlizer: /root/miniconda3/envs/nvidia/lib/python3.11/site-packages/mk1/__init__.py:1: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
chaiml-mistral-24b-2048-74727-v1-mkmlizer: __import__('pkg_resources').declare_namespace(__name__)
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ Version: 0.30.6+torch280 ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ belonging to: ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-mistral-24b-2048-25909-v1-mkmlizer: /root/miniconda3/envs/nvidia/lib/python3.11/site-packages/mk1/__init__.py:1: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
chaiml-mistral-24b-2048-25909-v1-mkmlizer: __import__('pkg_resources').declare_namespace(__name__)
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ Version: 0.30.6+torch280 ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ belonging to: ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-25909-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-mistral-24b-2048-74727-v1-mkmlizer: Downloaded to shared memory in 100.451s
chaiml-mistral-24b-2048-74727-v1-mkmlizer: Checking if ChaiML/mistral_24b_2048_gemini_opus_ds_v9_1686_merged already exists in ChaiML
chaiml-mistral-24b-2048-74727-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:q4, folder:/tmp/tmp8ev8egxa, device:0
chaiml-mistral-24b-2048-74727-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-mistral-24b-2048-25909-v1-mkmlizer: Downloaded to shared memory in 106.706s
chaiml-mistral-24b-2048-25909-v1-mkmlizer: Checking if ChaiML/mistral_24b_2048_gemini_opus_ds_v9_843_merged already exists in ChaiML
chaiml-mistral-24b-2048-25909-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:q4, folder:/tmp/tmpjn705egk, device:0
chaiml-mistral-24b-2048-25909-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Stopping job with name chaiml-mistral-24b-2048-74727-v1-mkmlizer
%s, retrying in %s seconds...
Starting job with name chaiml-mistral-24b-2048-74727-v1-mkmlizer
Waiting for job on chaiml-mistral-24b-2048-74727-v1-mkmlizer to finish
chaiml-mistral-24b-2048-74727-v1-mkmlizer: bash: cannot set terminal process group (-1): Inappropriate ioctl for device
chaiml-mistral-24b-2048-74727-v1-mkmlizer: bash: no job control in this shell
chaiml-mistral-24b-2048-74727-v1-mkmlizer: /root/miniconda3/envs/nvidia/lib/python3.11/site-packages/mk1/__init__.py:1: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
chaiml-mistral-24b-2048-74727-v1-mkmlizer: __import__('pkg_resources').declare_namespace(__name__)
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ Version: 0.30.6+torch280 ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ belonging to: ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-mistral-24b-2048-74727-v1-mkmlizer: Downloaded to shared memory in 101.382s
chaiml-mistral-24b-2048-74727-v1-mkmlizer: Checking if ChaiML/mistral_24b_2048_gemini_opus_ds_v9_1686_merged already exists in ChaiML
chaiml-mistral-24b-2048-74727-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:q4, folder:/tmp/tmpc5ib3gsq, device:0
chaiml-mistral-24b-2048-74727-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-mistral-24b-2048-25909-v1-mkmlizer:
Loading 0: 0%| | 0.00/363 [00:00<?, ?it/s]
Loading 0: 1%| | 3.00/363 [00:02<04:25, 1.35it/s]
Loading 0: 1%| | 3.00/363 [00:02<04:25, 1.35it/s]
Loading 0: 1%| | 4.00/363 [00:04<07:11, 1.20s/it]
Loading 0: 1%| | 4.00/363 [00:04<07:11, 1.20s/it]
Loading 0: 1%|▏ | 5.00/363 [00:06<09:16, 1.55s/it]
Loading 0: 1%|▏ | 5.00/363 [00:06<09:16, 1.55s/it]
Loading 0: 3%|▎ | 11.0/363 [00:07<03:16, 1.79it/s]
Loading 0: 3%|▎ | 11.0/363 [00:07<03:16, 1.79it/s]
Loading 0: 4%|▎ | 13.0/363 [00:10<04:16, 1.36it/s]
Loading 0: 4%|▎ | 13.0/363 [00:10<04:16, 1.36it/s]
Loading 0: 4%|▍ | 14.0/363 [00:12<05:35, 1.04it/s]
Loading 0: 4%|▍ | 14.0/363 [00:12<05:35, 1.04it/s]
Loading 0: 4%|▍ | 15.0/363 [00:14<06:59, 1.20s/it]
Loading 0: 4%|▍ | 15.0/363 [00:14<06:59, 1.20s/it]
Loading 0: 6%|▌ | 21.0/363 [00:17<04:15, 1.34it/s]
Loading 0: 6%|▌ | 21.0/363 [00:17<04:15, 1.34it/s]
Loading 0: 6%|▌ | 22.0/363 [00:19<05:16, 1.08it/s]
Loading 0: 6%|▌ | 22.0/363 [00:19<05:16, 1.08it/s]
Loading 0: 6%|▋ | 23.0/363 [00:22<06:27, 1.14s/it]
Loading 0: 6%|▋ | 23.0/363 [00:22<06:27, 1.14s/it]
Loading 0: 8%|▊ | 30.0/363 [00:23<03:01, 1.84it/s]
Loading 0: 8%|▊ | 30.0/363 [00:23<03:01, 1.84it/s]
Loading 0: 9%|▉ | 34.0/363 [00:26<03:18, 1.66it/s]
Loading 0: 9%|▉ | 34.0/363 [00:26<03:18, 1.66it/s]
Loading 0: 10%|▉ | 35.0/363 [00:28<04:12, 1.30it/s]
Loading 0: 10%|▉ | 35.0/363 [00:28<04:12, 1.30it/s]
Loading 0: 10%|▉ | 36.0/363 [00:30<05:17, 1.03it/s]
Loading 0: 10%|▉ | 36.0/363 [00:30<05:17, 1.03it/s]
Loading 0: 11%|█ | 39.0/363 [00:32<04:46, 1.13it/s]
Loading 0: 11%|█ | 39.0/363 [00:32<04:46, 1.13it/s]
Loading 0: 11%|█ | 40.0/363 [00:34<05:46, 1.07s/it]
Loading 0: 11%|█ | 40.0/363 [00:34<05:46, 1.07s/it]
Loading 0: 11%|█▏ | 41.0/363 [00:37<06:54, 1.29s/it]
Loading 0: 11%|█▏ | 41.0/363 [00:37<06:54, 1.29s/it]
Loading 0: 13%|█▎ | 48.0/363 [00:38<02:59, 1.76it/s]
Loading 0: 13%|█▎ | 48.0/363 [00:38<02:59, 1.76it/s]
Loading 0: 14%|█▍ | 52.0/363 [00:41<03:11, 1.63it/s]
Loading 0: 14%|█▍ | 52.0/363 [00:41<03:11, 1.63it/s]
Loading 0: 15%|█▍ | 53.0/363 [00:43<04:03, 1.28it/s]
Loading 0: 15%|█▍ | 53.0/363 [00:43<04:03, 1.28it/s]
Loading 0: 15%|█▍ | 54.0/363 [00:45<05:05, 1.01it/s]
Loading 0: 15%|█▍ | 54.0/363 [00:45<05:05, 1.01it/s]
Loading 0: 16%|█▌ | 57.0/363 [00:47<04:33, 1.12it/s]
Loading 0: 16%|█▌ | 57.0/363 [00:47<04:33, 1.12it/s]
Loading 0: 16%|█▌ | 58.0/363 [00:49<05:30, 1.08s/it]
Loading 0: 16%|█▌ | 58.0/363 [00:49<05:30, 1.08s/it]
Loading 0: 16%|█▋ | 59.0/363 [00:52<06:34, 1.30s/it]
Loading 0: 16%|█▋ | 59.0/363 [00:52<06:34, 1.30s/it]
Loading 0: 18%|█▊ | 66.0/363 [00:53<02:50, 1.74it/s]
Loading 0: 18%|█▊ | 66.0/363 [00:53<02:50, 1.74it/s]
Loading 0: 19%|█▉ | 70.0/363 [00:56<03:01, 1.61it/s]
Loading 0: 19%|█▉ | 70.0/363 [00:56<03:01, 1.61it/s]
Loading 0: 20%|█▉ | 71.0/363 [00:58<03:50, 1.27it/s]
Loading 0: 20%|█▉ | 71.0/363 [00:58<03:50, 1.27it/s]
Loading 0: 20%|█▉ | 72.0/363 [01:00<04:48, 1.01it/s]
Loading 0: 20%|█▉ | 72.0/363 [01:00<04:48, 1.01it/s]
Loading 0: 21%|██ | 75.0/363 [01:02<04:17, 1.12it/s]
Loading 0: 21%|██ | 75.0/363 [01:02<04:17, 1.12it/s]
Loading 0: 21%|██ | 76.0/363 [01:04<05:11, 1.09s/it]
Loading 0: 21%|██ | 76.0/363 [01:04<05:11, 1.09s/it]
Loading 0: 21%|██ | 77.0/363 [01:07<06:10, 1.29s/it]
Loading 0: 21%|██ | 77.0/363 [01:07<06:10, 1.29s/it]
Loading 0: 23%|██▎ | 84.0/363 [01:08<02:40, 1.74it/s]
Loading 0: 23%|██▎ | 84.0/363 [01:08<02:40, 1.74it/s]
Loading 0: 24%|██▍ | 88.0/363 [01:11<02:49, 1.62it/s]
Loading 0: 24%|██▍ | 88.0/363 [01:11<02:49, 1.62it/s]
Loading 0: 25%|██▍ | 89.0/363 [01:13<03:35, 1.27it/s]
Loading 0: 25%|██▍ | 89.0/363 [01:13<03:35, 1.27it/s]
Loading 0: 25%|██▍ | 90.0/363 [01:15<04:29, 1.01it/s]
Loading 0: 25%|██▍ | 90.0/363 [01:15<04:29, 1.01it/s]
Loading 0: 27%|██▋ | 97.0/363 [01:16<02:16, 1.94it/s]
Loading 0: 27%|██▋ | 97.0/363 [01:16<02:16, 1.94it/s]
Loading 0: 28%|██▊ | 101/363 [01:19<02:29, 1.76it/s]
Loading 0: 28%|██▊ | 101/363 [01:19<02:29, 1.76it/s]
Loading 0: 28%|██▊ | 102/363 [01:21<03:11, 1.36it/s]
Loading 0: 28%|██▊ | 102/363 [01:21<03:11, 1.36it/s]
Loading 0: 28%|██▊ | 103/363 [01:24<04:04, 1.06it/s]
Loading 0: 28%|██▊ | 103/363 [01:24<04:04, 1.06it/s]
Loading 0: 29%|██▉ | 106/363 [01:26<03:47, 1.13it/s]
Loading 0: 29%|██▉ | 106/363 [01:26<03:47, 1.13it/s]
Loading 0: 29%|██▉ | 107/363 [01:28<04:34, 1.07s/it]
Loading 0: 29%|██▉ | 107/363 [01:28<04:34, 1.07s/it]
Loading 0: 30%|██▉ | 108/363 [01:30<05:27, 1.29s/it]
Loading 0: 30%|██▉ | 108/363 [01:30<05:27, 1.29s/it]
Loading 0: 31%|███ | 111/363 [01:33<04:25, 1.05s/it]
Loading 0: 31%|███ | 111/363 [01:33<04:25, 1.05s/it]
Loading 0: 31%|███ | 112/363 [01:35<05:11, 1.24s/it]
Loading 0: 31%|███ | 112/363 [01:35<05:11, 1.24s/it]
Loading 0: 31%|███ | 113/363 [01:37<06:03, 1.45s/it]
Loading 0: 31%|███ | 113/363 [01:37<06:03, 1.45s/it]
Loading 0: 33%|███▎ | 120/363 [01:38<02:25, 1.67it/s]
Loading 0: 33%|███▎ | 120/363 [01:38<02:25, 1.67it/s]
Loading 0: 34%|███▍ | 124/363 [01:41<02:31, 1.58it/s]
Loading 0: 34%|███▍ | 124/363 [01:41<02:31, 1.58it/s]
Loading 0: 34%|███▍ | 125/363 [01:43<03:12, 1.24it/s]
Loading 0: 34%|███▍ | 125/363 [01:43<03:12, 1.24it/s]
Loading 0: 35%|███▍ | 126/363 [01:46<04:01, 1.02s/it]
Loading 0: 35%|███▍ | 126/363 [01:46<04:01, 1.02s/it]
Loading 0: 36%|███▌ | 129/363 [01:48<03:33, 1.09it/s]
Loading 0: 36%|███▌ | 129/363 [01:48<03:33, 1.09it/s]
Loading 0: 36%|███▌ | 130/363 [01:50<04:16, 1.10s/it]
Loading 0: 36%|███▌ | 130/363 [01:50<04:16, 1.10s/it]
Loading 0: 36%|███▌ | 131/363 [01:52<05:05, 1.31s/it]
Loading 0: 36%|███▌ | 131/363 [01:52<05:05, 1.31s/it]
Loading 0: 38%|███▊ | 138/363 [01:53<02:10, 1.72it/s]
Loading 0: 38%|███▊ | 138/363 [01:53<02:10, 1.72it/s]
Loading 0: 39%|███▉ | 142/363 [01:56<02:18, 1.60it/s]
Loading 0: 39%|███▉ | 142/363 [01:56<02:18, 1.60it/s]
Loading 0: 39%|███▉ | 143/363 [01:58<02:55, 1.26it/s]
Loading 0: 39%|███▉ | 143/363 [01:58<02:55, 1.26it/s]
Loading 0: 40%|███▉ | 144/363 [02:01<03:39, 1.00s/it]
Loading 0: 40%|███▉ | 144/363 [02:01<03:39, 1.00s/it]
Loading 0: 40%|████ | 147/363 [02:03<03:15, 1.11it/s]
Loading 0: 40%|████ | 147/363 [02:03<03:15, 1.11it/s]
Loading 0: 41%|████ | 148/363 [02:05<03:58, 1.11s/it]
Loading 0: 41%|████ | 148/363 [02:05<03:58, 1.11s/it]
Loading 0: 41%|████ | 149/363 [02:08<04:42, 1.32s/it]
Loading 0: 41%|████ | 149/363 [02:08<04:42, 1.32s/it]
Loading 0: 43%|████▎ | 156/363 [02:09<02:00, 1.71it/s]
Loading 0: 43%|████▎ | 156/363 [02:09<02:00, 1.71it/s]
Loading 0: 44%|████▍ | 160/363 [02:12<02:07, 1.60it/s]
Loading 0: 44%|████▍ | 160/363 [02:12<02:07, 1.60it/s]
Loading 0: 44%|████▍ | 161/363 [02:14<02:40, 1.26it/s]
Loading 0: 44%|████▍ | 161/363 [02:14<02:40, 1.26it/s]
Loading 0: 45%|████▍ | 162/363 [02:16<03:21, 1.00s/it]
Loading 0: 45%|████▍ | 162/363 [02:16<03:21, 1.00s/it]
Loading 0: 45%|████▌ | 165/363 [02:18<02:59, 1.11it/s]
Loading 0: 45%|████▌ | 165/363 [02:18<02:59, 1.11it/s]
Loading 0: 46%|████▌ | 166/363 [02:20<03:35, 1.10s/it]
Loading 0: 46%|████▌ | 166/363 [02:20<03:35, 1.10s/it]
Loading 0: 46%|████▌ | 167/363 [02:23<04:15, 1.31s/it]
Loading 0: 46%|████▌ | 167/363 [02:23<04:15, 1.31s/it]
Loading 0: 48%|████▊ | 174/363 [02:24<01:49, 1.73it/s]
Loading 0: 48%|████▊ | 174/363 [02:24<01:49, 1.73it/s]
Loading 0: 49%|████▉ | 178/363 [02:27<01:54, 1.61it/s]
Loading 0: 49%|████▉ | 178/363 [02:27<01:54, 1.61it/s]
Loading 0: 49%|████▉ | 179/363 [02:29<02:25, 1.26it/s]
Loading 0: 49%|████▉ | 179/363 [02:29<02:25, 1.26it/s]
Loading 0: 50%|████▉ | 180/363 [02:31<03:02, 1.00it/s]
Loading 0: 50%|████▉ | 180/363 [02:31<03:02, 1.00it/s]
Loading 0: 50%|█████ | 183/363 [02:33<02:42, 1.11it/s]
Loading 0: 50%|█████ | 183/363 [02:33<02:42, 1.11it/s]
Loading 0: 51%|█████ | 184/363 [02:36<03:19, 1.11s/it]
Loading 0: 51%|█████ | 184/363 [02:36<03:19, 1.11s/it]
Loading 0: 51%|█████ | 185/363 [02:38<03:56, 1.33s/it]
Loading 0: 51%|█████ | 185/363 [02:38<03:56, 1.33s/it]
Loading 0: 53%|█████▎ | 192/363 [02:39<01:40, 1.71it/s]
Loading 0: 53%|█████▎ | 192/363 [02:39<01:40, 1.71it/s]
Loading 0: 54%|█████▍ | 196/363 [02:42<01:44, 1.59it/s]
Loading 0: 54%|█████▍ | 196/363 [02:42<01:44, 1.59it/s]
Loading 0: 54%|█████▍ | 197/363 [02:44<02:12, 1.26it/s]
Loading 0: 54%|█████▍ | 197/363 [02:44<02:12, 1.26it/s]
Loading 0: 55%|█████▍ | 198/363 [02:47<02:45, 1.00s/it]
Loading 0: 55%|█████▍ | 198/363 [02:47<02:45, 1.00s/it]
Loading 0: 55%|█████▌ | 201/363 [02:49<02:26, 1.11it/s]
Loading 0: 55%|█████▌ | 201/363 [02:49<02:26, 1.11it/s]
Loading 0: 56%|█████▌ | 202/363 [02:51<02:56, 1.09s/it]
Loading 0: 56%|█████▌ | 202/363 [02:51<02:56, 1.09s/it]
Loading 0: 56%|█████▌ | 203/363 [02:53<03:29, 1.31s/it]
Loading 0: 56%|█████▌ | 203/363 [02:53<03:29, 1.31s/it]
Loading 0: 58%|█████▊ | 210/363 [02:54<01:28, 1.73it/s]
Loading 0: 58%|█████▊ | 210/363 [02:54<01:28, 1.73it/s]
Loading 0: 59%|█████▉ | 214/363 [02:57<01:32, 1.61it/s]
Loading 0: 59%|█████▉ | 214/363 [02:57<01:32, 1.61it/s]
Loading 0: 59%|█████▉ | 215/363 [02:59<01:56, 1.27it/s]
Loading 0: 59%|█████▉ | 215/363 [02:59<01:56, 1.27it/s]
Loading 0: 60%|█████▉ | 216/363 [03:02<02:26, 1.01it/s]
Loading 0: 60%|█████▉ | 216/363 [03:02<02:26, 1.01it/s]
Loading 0: 60%|██████ | 219/363 [03:04<02:09, 1.11it/s]
Loading 0: 60%|██████ | 219/363 [03:04<02:09, 1.11it/s]
Loading 0: 61%|██████ | 220/363 [03:06<02:37, 1.10s/it]
Loading 0: 61%|██████ | 220/363 [03:06<02:37, 1.10s/it]
Loading 0: 61%|██████ | 221/363 [03:09<03:06, 1.32s/it]
Loading 0: 61%|██████ | 221/363 [03:09<03:06, 1.32s/it]
Loading 0: 63%|██████▎ | 228/363 [03:10<01:17, 1.73it/s]
Loading 0: 63%|██████▎ | 228/363 [03:10<01:17, 1.73it/s]
Loading 0: 64%|██████▍ | 232/363 [03:12<01:21, 1.61it/s]
Loading 0: 64%|██████▍ | 232/363 [03:12<01:21, 1.61it/s]
Loading 0: 64%|██████▍ | 233/363 [03:15<01:42, 1.27it/s]
Loading 0: 64%|██████▍ | 233/363 [03:15<01:42, 1.27it/s]
Loading 0: 64%|██████▍ | 234/363 [03:17<02:07, 1.01it/s]
Loading 0: 64%|██████▍ | 234/363 [03:17<02:07, 1.01it/s]
Loading 0: 65%|██████▌ | 237/363 [03:19<01:53, 1.11it/s]
Loading 0: 65%|██████▌ | 237/363 [03:19<01:53, 1.11it/s]
Loading 0: 66%|██████▌ | 238/363 [03:21<02:16, 1.09s/it]
Loading 0: 66%|██████▌ | 238/363 [03:21<02:16, 1.09s/it]
Loading 0: 66%|██████▌ | 239/363 [03:24<02:42, 1.31s/it]
Loading 0: 66%|██████▌ | 239/363 [03:24<02:42, 1.31s/it]
Loading 0: 68%|██████▊ | 246/363 [03:25<01:07, 1.74it/s]
Loading 0: 68%|██████▊ | 246/363 [03:25<01:07, 1.74it/s]
Loading 0: 69%|██████▉ | 250/363 [03:28<01:10, 1.61it/s]
Loading 0: 69%|██████▉ | 250/363 [03:28<01:10, 1.61it/s]
Loading 0: 69%|██████▉ | 251/363 [03:30<01:28, 1.26it/s]
Loading 0: 69%|██████▉ | 251/363 [03:30<01:28, 1.26it/s]
Loading 0: 69%|██████▉ | 252/363 [03:32<01:50, 1.00it/s]
Loading 0: 69%|██████▉ | 252/363 [03:32<01:50, 1.00it/s]
Loading 0: 70%|███████ | 255/363 [03:34<01:37, 1.11it/s]
Loading 0: 70%|███████ | 255/363 [03:34<01:37, 1.11it/s]
Loading 0: 71%|███████ | 256/363 [03:36<01:56, 1.09s/it]
Loading 0: 71%|███████ | 256/363 [03:36<01:56, 1.09s/it]
Loading 0: 71%|███████ | 257/363 [03:39<02:18, 1.31s/it]
Loading 0: 71%|███████ | 257/363 [03:39<02:18, 1.31s/it]
Loading 0: 73%|███████▎ | 264/363 [03:40<00:57, 1.73it/s]
Loading 0: 73%|███████▎ | 264/363 [03:40<00:57, 1.73it/s]
Loading 0: 74%|███████▍ | 268/363 [03:43<00:59, 1.61it/s]
Loading 0: 74%|███████▍ | 268/363 [03:43<00:59, 1.61it/s]
Loading 0: 74%|███████▍ | 269/363 [03:45<01:14, 1.27it/s]
Loading 0: 74%|███████▍ | 269/363 [03:45<01:14, 1.27it/s]
Loading 0: 74%|███████▍ | 270/363 [03:47<01:32, 1.01it/s]
Loading 0: 74%|███████▍ | 270/363 [03:47<01:32, 1.01it/s]
Loading 0: 75%|███████▌ | 273/363 [04:04<03:57, 2.64s/it]
Loading 0: 75%|███████▌ | 273/363 [04:04<03:57, 2.64s/it]
Loading 0: 75%|███████▌ | 274/363 [04:06<03:48, 2.57s/it]
Loading 0: 75%|███████▌ | 274/363 [04:06<03:48, 2.57s/it]
Loading 0: 76%|███████▌ | 275/363 [04:08<03:42, 2.53s/it]
Loading 0: 76%|███████▌ | 275/363 [04:08<03:42, 2.53s/it]
Loading 0: 78%|███████▊ | 282/363 [04:09<01:22, 1.02s/it]
Loading 0: 78%|███████▊ | 282/363 [04:09<01:22, 1.02s/it]
Loading 0: 79%|███████▉ | 286/363 [04:12<01:09, 1.11it/s]
Loading 0: 79%|███████▉ | 286/363 [04:12<01:09, 1.11it/s]
Loading 0: 79%|███████▉ | 287/363 [04:14<01:18, 1.04s/it]
Loading 0: 79%|███████▉ | 287/363 [04:14<01:18, 1.04s/it]
Loading 0: 79%|███████▉ | 288/363 [04:16<01:30, 1.21s/it]
Loading 0: 79%|███████▉ | 288/363 [04:16<01:30, 1.21s/it]
Loading 0: 80%|████████ | 291/363 [04:18<01:13, 1.02s/it]
Loading 0: 80%|████████ | 291/363 [04:18<01:13, 1.02s/it]
Loading 0: 80%|████████ | 292/363 [04:21<01:24, 1.19s/it]
Loading 0: 80%|████████ | 292/363 [04:21<01:24, 1.19s/it]
Loading 0: 81%|████████ | 293/363 [04:23<01:37, 1.39s/it]
Loading 0: 81%|████████ | 293/363 [04:23<01:37, 1.39s/it]
Loading 0: 83%|████████▎ | 300/363 [04:24<00:37, 1.66it/s]
Loading 0: 83%|████████▎ | 300/363 [04:24<00:37, 1.66it/s]
Loading 0: 84%|████████▎ | 304/363 [04:27<00:37, 1.58it/s]
Loading 0: 84%|████████▎ | 304/363 [04:27<00:37, 1.58it/s]
Loading 0: 84%|████████▍ | 305/363 [04:29<00:46, 1.25it/s]
Loading 0: 84%|████████▍ | 305/363 [04:29<00:46, 1.25it/s]
Loading 0: 84%|████████▍ | 306/363 [04:31<00:57, 1.00s/it]
Loading 0: 84%|████████▍ | 306/363 [04:31<00:57, 1.00s/it]
Loading 0: 85%|████████▌ | 309/363 [04:33<00:48, 1.11it/s]
Loading 0: 85%|████████▌ | 309/363 [04:33<00:48, 1.11it/s]
Loading 0: 85%|████████▌ | 310/363 [04:36<00:58, 1.09s/it]
Loading 0: 85%|████████▌ | 310/363 [04:36<00:58, 1.09s/it]
Loading 0: 86%|████████▌ | 311/363 [04:38<01:07, 1.31s/it]
Loading 0: 86%|████████▌ | 311/363 [04:38<01:07, 1.31s/it]
Loading 0: 88%|████████▊ | 318/363 [04:39<00:25, 1.75it/s]
Loading 0: 88%|████████▊ | 318/363 [04:39<00:25, 1.75it/s]
Loading 0: 89%|████████▊ | 322/363 [04:42<00:25, 1.62it/s]
Loading 0: 89%|████████▊ | 322/363 [04:42<00:25, 1.62it/s]
Loading 0: 89%|████████▉ | 323/363 [04:44<00:31, 1.27it/s]
Loading 0: 89%|████████▉ | 323/363 [04:44<00:31, 1.27it/s]
Loading 0: 89%|████████▉ | 324/363 [04:46<00:38, 1.01it/s]
Loading 0: 89%|████████▉ | 324/363 [04:46<00:38, 1.01it/s]
Loading 0: 90%|█████████ | 327/363 [04:49<00:32, 1.12it/s]
Loading 0: 90%|█████████ | 327/363 [04:49<00:32, 1.12it/s]
Loading 0: 90%|█████████ | 328/363 [04:51<00:38, 1.10s/it]
Loading 0: 90%|█████████ | 328/363 [04:51<00:38, 1.10s/it]
Loading 0: 91%|█████████ | 329/363 [04:53<00:44, 1.31s/it]
Loading 0: 91%|█████████ | 329/363 [04:53<00:44, 1.31s/it]
Loading 0: 93%|█████████▎| 336/363 [04:54<00:15, 1.73it/s]
Loading 0: 93%|█████████▎| 336/363 [04:54<00:15, 1.73it/s]
Loading 0: 94%|█████████▎| 340/363 [04:57<00:14, 1.61it/s]
Loading 0: 94%|█████████▎| 340/363 [04:57<00:14, 1.61it/s]
Loading 0: 94%|█████████▍| 341/363 [04:59<00:17, 1.27it/s]
Loading 0: 94%|█████████▍| 341/363 [04:59<00:17, 1.27it/s]
Loading 0: 94%|█████████▍| 342/363 [05:01<00:20, 1.01it/s]
Loading 0: 94%|█████████▍| 342/363 [05:01<00:20, 1.01it/s]
Loading 0: 95%|█████████▌| 345/363 [05:04<00:16, 1.11it/s]
Loading 0: 95%|█████████▌| 345/363 [05:04<00:16, 1.11it/s]
Loading 0: 95%|█████████▌| 346/363 [05:06<00:19, 1.15s/it]
Loading 0: 95%|█████████▌| 346/363 [05:06<00:19, 1.15s/it]
Loading 0: 96%|█████████▌| 347/363 [05:09<00:21, 1.35s/it]
Loading 0: 96%|█████████▌| 347/363 [05:09<00:21, 1.35s/it]
Loading 0: 98%|█████████▊| 354/363 [05:10<00:05, 1.69it/s]
Loading 0: 98%|█████████▊| 354/363 [05:10<00:05, 1.69it/s]
Loading 0: 99%|█████████▉| 359/363 [05:13<00:02, 1.63it/s]
Loading 0: 99%|█████████▉| 359/363 [05:13<00:02, 1.63it/s]
Loading 0: 99%|█████████▉| 360/363 [05:15<00:02, 1.29it/s]
Loading 0: 99%|█████████▉| 360/363 [05:15<00:02, 1.29it/s]
Loading 0: 99%|█████████▉| 361/363 [05:17<00:01, 1.04it/s]
Loading 0: 99%|█████████▉| 361/363 [05:17<00:01, 1.04it/s]
Loading 0: 100%|██████████| 363/363 [05:17<00:00, 1.04it/s]
Loading 0: 100%|██████████| 363/363 [05:17<00:00, 1.14it/s]
chaiml-mistral-24b-2048-25909-v1-mkmlizer: The tokenizer you are loading from '/tmp/tmpjn705egk' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-mistral-24b-2048-25909-v1-mkmlizer: quantized model in 325.752s
chaiml-mistral-24b-2048-25909-v1-mkmlizer: Processed model ChaiML/mistral_24b_2048_gemini_opus_ds_v9_843_merged in 432.458s
chaiml-mistral-24b-2048-25909-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-mistral-24b-2048-25909-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-mistral-24b-2048-25909-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v1/nvidia
chaiml-mistral-24b-2048-25909-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v1/nvidia/config.json
chaiml-mistral-24b-2048-25909-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v1/nvidia/special_tokens_map.json
chaiml-mistral-24b-2048-25909-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v1/nvidia/tokenizer_config.json
chaiml-mistral-24b-2048-25909-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v1/nvidia/tokenizer.json
chaiml-mistral-24b-2048-25909-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v1/nvidia/flywheel_model.1.safetensors
chaiml-mistral-24b-2048-25909-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v1/nvidia/flywheel_model.0.safetensors
Job chaiml-mistral-24b-2048-25909-v1-mkmlizer completed after 551.61s with status: succeeded
Stopping job with name chaiml-mistral-24b-2048-25909-v1-mkmlizer
Pipeline stage MKMLizer completed in 552.99s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.38s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-mistral-24b-2048-25909-v1
Waiting for inference service chaiml-mistral-24b-2048-25909-v1 to be ready
Inference service chaiml-mistral-24b-2048-25909-v1 ready after 152.86114144325256s
Pipeline stage MKMLDeployer completed in 154.11s
run pipeline stage %s
Running pipeline stage StressChecker
chaiml-mistral-24b-2048-74727-v1-mkmlizer:
Loading 0: 0%| | 0.00/363 [00:00<?, ?it/s]
Loading 0: 1%| | 3.00/363 [00:01<03:56, 1.52it/s]
Loading 0: 1%| | 3.00/363 [00:01<03:56, 1.52it/s]
Loading 0: 1%| | 4.00/363 [00:03<06:19, 1.06s/it]
Loading 0: 1%| | 4.00/363 [00:03<06:19, 1.06s/it]
Loading 0: 1%|▏ | 5.00/363 [00:05<08:06, 1.36s/it]
Loading 0: 1%|▏ | 5.00/363 [00:05<08:06, 1.36s/it]
Loading 0: 3%|▎ | 11.0/363 [00:07<03:00, 1.95it/s]
Loading 0: 3%|▎ | 11.0/363 [00:07<03:00, 1.95it/s]
Loading 0: 4%|▎ | 13.0/363 [00:09<03:55, 1.49it/s]
Loading 0: 4%|▎ | 13.0/363 [00:09<03:55, 1.49it/s]
Loading 0: 4%|▍ | 14.0/363 [00:11<05:01, 1.16it/s]
Loading 0: 4%|▍ | 14.0/363 [00:11<05:01, 1.16it/s]
Loading 0: 4%|▍ | 15.0/363 [00:13<06:13, 1.07s/it]
Loading 0: 4%|▍ | 15.0/363 [00:13<06:13, 1.07s/it]
Loading 0: 6%|▌ | 21.0/363 [00:15<03:45, 1.52it/s]
Loading 0: 6%|▌ | 21.0/363 [00:15<03:45, 1.52it/s]
Loading 0: 6%|▌ | 22.0/363 [00:17<04:37, 1.23it/s]
Loading 0: 6%|▌ | 22.0/363 [00:17<04:37, 1.23it/s]
Loading 0: 6%|▋ | 23.0/363 [00:19<05:39, 1.00it/s]
Loading 0: 6%|▋ | 23.0/363 [00:19<05:39, 1.00it/s]
Loading 0: 9%|▊ | 31.0/363 [00:20<02:32, 2.18it/s]
Loading 0: 9%|▊ | 31.0/363 [00:20<02:32, 2.18it/s]
Loading 0: 9%|▉ | 34.0/363 [00:22<02:54, 1.89it/s]
Loading 0: 9%|▉ | 34.0/363 [00:22<02:54, 1.89it/s]
Loading 0: 10%|▉ | 35.0/363 [00:24<03:42, 1.48it/s]
Loading 0: 10%|▉ | 35.0/363 [00:24<03:42, 1.48it/s]
Loading 0: 10%|▉ | 36.0/363 [00:26<04:39, 1.17it/s]
Loading 0: 10%|▉ | 36.0/363 [00:26<04:39, 1.17it/s]
Loading 0: 11%|█ | 39.0/363 [00:28<04:10, 1.29it/s]
Loading 0: 11%|█ | 39.0/363 [00:28<04:10, 1.29it/s]
Loading 0: 11%|█ | 40.0/363 [00:30<05:03, 1.06it/s]
Loading 0: 11%|█ | 40.0/363 [00:30<05:03, 1.06it/s]
Loading 0: 11%|█▏ | 41.0/363 [00:32<06:03, 1.13s/it]
Loading 0: 11%|█▏ | 41.0/363 [00:32<06:03, 1.13s/it]
Loading 0: 13%|█▎ | 49.0/363 [00:33<02:29, 2.10it/s]
Loading 0: 13%|█▎ | 49.0/363 [00:33<02:29, 2.10it/s]
Loading 0: 14%|█▍ | 52.0/363 [00:36<02:49, 1.84it/s]
Loading 0: 14%|█▍ | 52.0/363 [00:36<02:49, 1.84it/s]
Loading 0: 15%|█▍ | 53.0/363 [00:37<03:35, 1.44it/s]
Loading 0: 15%|█▍ | 53.0/363 [00:37<03:35, 1.44it/s]
Loading 0: 15%|█▍ | 54.0/363 [00:39<04:30, 1.14it/s]
Loading 0: 15%|█▍ | 54.0/363 [00:39<04:30, 1.14it/s]
Loading 0: 16%|█▌ | 57.0/363 [00:41<04:00, 1.27it/s]
Loading 0: 16%|█▌ | 57.0/363 [00:41<04:00, 1.27it/s]
Loading 0: 16%|█▌ | 58.0/363 [00:43<04:50, 1.05it/s]
Loading 0: 16%|█▌ | 58.0/363 [00:43<04:50, 1.05it/s]
Loading 0: 16%|█▋ | 59.0/363 [00:45<05:46, 1.14s/it]
Loading 0: 16%|█▋ | 59.0/363 [00:45<05:46, 1.14s/it]
Loading 0: 18%|█▊ | 67.0/363 [00:46<02:22, 2.08it/s]
Loading 0: 18%|█▊ | 67.0/363 [00:46<02:22, 2.08it/s]
Loading 0: 19%|█▉ | 70.0/363 [00:49<02:40, 1.82it/s]
Loading 0: 19%|█▉ | 70.0/363 [00:49<02:40, 1.82it/s]
Loading 0: 20%|█▉ | 71.0/363 [00:51<03:23, 1.43it/s]
Loading 0: 20%|█▉ | 71.0/363 [00:51<03:23, 1.43it/s]
Loading 0: 20%|█▉ | 72.0/363 [00:53<04:15, 1.14it/s]
Loading 0: 20%|█▉ | 72.0/363 [00:53<04:15, 1.14it/s]
Loading 0: 21%|██ | 75.0/363 [00:55<03:47, 1.27it/s]
Loading 0: 21%|██ | 75.0/363 [00:55<03:47, 1.27it/s]
Loading 0: 21%|██ | 76.0/363 [00:56<04:33, 1.05it/s]
Loading 0: 21%|██ | 76.0/363 [00:56<04:33, 1.05it/s]
Loading 0: 21%|██ | 77.0/363 [00:58<05:25, 1.14s/it]
Loading 0: 21%|██ | 77.0/363 [00:58<05:25, 1.14s/it]
Loading 0: 23%|██▎ | 85.0/363 [01:00<02:12, 2.10it/s]
Loading 0: 23%|██▎ | 85.0/363 [01:00<02:12, 2.10it/s]
Loading 0: 24%|██▍ | 88.0/363 [01:02<02:29, 1.84it/s]
Loading 0: 24%|██▍ | 88.0/363 [01:02<02:29, 1.84it/s]
Loading 0: 25%|██▍ | 89.0/363 [01:04<03:10, 1.44it/s]
Loading 0: 25%|██▍ | 89.0/363 [01:04<03:10, 1.44it/s]
Loading 0: 25%|██▍ | 90.0/363 [01:06<03:58, 1.14it/s]
Loading 0: 25%|██▍ | 90.0/363 [01:06<03:58, 1.14it/s]
Loading 0: 27%|██▋ | 98.0/363 [01:07<01:53, 2.34it/s]
Loading 0: 27%|██▋ | 98.0/363 [01:07<01:53, 2.34it/s]
Loading 0: 28%|██▊ | 101/363 [01:09<02:08, 2.04it/s]
Loading 0: 28%|██▊ | 101/363 [01:09<02:08, 2.04it/s]
Loading 0: 28%|██▊ | 102/363 [01:11<02:46, 1.57it/s]
Loading 0: 28%|██▊ | 102/363 [01:11<02:46, 1.57it/s]
Loading 0: 28%|██▊ | 103/363 [01:13<03:32, 1.22it/s]
Loading 0: 28%|██▊ | 103/363 [01:13<03:32, 1.22it/s]
Loading 0: 29%|██▉ | 106/363 [01:15<03:17, 1.30it/s]
Loading 0: 29%|██▉ | 106/363 [01:15<03:17, 1.30it/s]
Loading 0: 29%|██▉ | 107/363 [01:17<03:59, 1.07it/s]
Loading 0: 29%|██▉ | 107/363 [01:17<03:59, 1.07it/s]
Loading 0: 30%|██▉ | 108/363 [01:19<04:45, 1.12s/it]
Loading 0: 30%|██▉ | 108/363 [01:19<04:45, 1.12s/it]
Loading 0: 31%|███ | 111/363 [01:21<03:50, 1.09it/s]
Loading 0: 31%|███ | 111/363 [01:21<03:50, 1.09it/s]
Loading 0: 31%|███ | 112/363 [01:23<04:30, 1.08s/it]
Loading 0: 31%|███ | 112/363 [01:23<04:30, 1.08s/it]
Loading 0: 31%|███ | 113/363 [01:25<05:14, 1.26s/it]
Loading 0: 31%|███ | 113/363 [01:25<05:14, 1.26s/it]
Loading 0: 33%|███▎ | 121/363 [01:26<01:59, 2.02it/s]
Loading 0: 33%|███▎ | 121/363 [01:26<01:59, 2.02it/s]
Loading 0: 34%|███▍ | 124/363 [01:28<02:13, 1.78it/s]
Loading 0: 34%|███▍ | 124/363 [01:28<02:13, 1.78it/s]
Loading 0: 34%|███▍ | 125/363 [01:30<02:49, 1.40it/s]
Loading 0: 34%|███▍ | 125/363 [01:30<02:49, 1.40it/s]
Loading 0: 35%|███▍ | 126/363 [01:32<03:32, 1.12it/s]
Loading 0: 35%|███▍ | 126/363 [01:32<03:32, 1.12it/s]
Loading 0: 36%|███▌ | 129/363 [01:34<03:06, 1.25it/s]
Loading 0: 36%|███▌ | 129/363 [01:34<03:06, 1.25it/s]
Loading 0: 36%|███▌ | 130/363 [01:36<03:44, 1.04it/s]
Loading 0: 36%|███▌ | 130/363 [01:36<03:44, 1.04it/s]
Loading 0: 36%|███▌ | 131/363 [01:38<04:26, 1.15s/it]
Loading 0: 36%|███▌ | 131/363 [01:38<04:26, 1.15s/it]
Loading 0: 38%|███▊ | 139/363 [01:39<01:47, 2.09it/s]
Loading 0: 38%|███▊ | 139/363 [01:39<01:47, 2.09it/s]
Loading 0: 39%|███▉ | 142/363 [01:41<02:00, 1.83it/s]
Loading 0: 39%|███▉ | 142/363 [01:41<02:00, 1.83it/s]
Loading 0: 39%|███▉ | 143/363 [01:43<02:33, 1.44it/s]
Loading 0: 39%|███▉ | 143/363 [01:43<02:33, 1.44it/s]
Loading 0: 40%|███▉ | 144/363 [01:45<03:11, 1.14it/s]
Loading 0: 40%|███▉ | 144/363 [01:45<03:11, 1.14it/s]
Loading 0: 40%|████ | 147/363 [01:47<02:50, 1.27it/s]
Loading 0: 40%|████ | 147/363 [01:47<02:50, 1.27it/s]
Loading 0: 41%|████ | 148/363 [01:49<03:25, 1.05it/s]
Loading 0: 41%|████ | 148/363 [01:49<03:25, 1.05it/s]
Loading 0: 41%|████ | 149/363 [01:51<04:04, 1.14s/it]
Loading 0: 41%|████ | 149/363 [01:51<04:04, 1.14s/it]
Loading 0: 43%|████▎ | 157/363 [01:52<01:38, 2.08it/s]
Loading 0: 43%|████▎ | 157/363 [01:52<01:38, 2.08it/s]
Loading 0: 44%|████▍ | 160/363 [01:54<01:50, 1.83it/s]
Loading 0: 44%|████▍ | 160/363 [01:54<01:50, 1.83it/s]
Loading 0: 44%|████▍ | 161/363 [01:56<02:22, 1.42it/s]
Loading 0: 44%|████▍ | 161/363 [01:56<02:22, 1.42it/s]
Loading 0: 45%|████▍ | 162/363 [01:58<02:57, 1.13it/s]
Loading 0: 45%|████▍ | 162/363 [01:58<02:57, 1.13it/s]
Loading 0: 45%|████▌ | 165/363 [02:00<02:36, 1.26it/s]
Loading 0: 45%|████▌ | 165/363 [02:00<02:36, 1.26it/s]
Loading 0: 46%|████▌ | 166/363 [02:02<03:08, 1.04it/s]
Loading 0: 46%|████▌ | 166/363 [02:02<03:08, 1.04it/s]
Loading 0: 46%|████▌ | 167/363 [02:04<03:44, 1.14s/it]
Loading 0: 46%|████▌ | 167/363 [02:04<03:44, 1.14s/it]
Loading 0: 48%|████▊ | 175/363 [02:05<01:29, 2.09it/s]
Loading 0: 48%|████▊ | 175/363 [02:05<01:29, 2.09it/s]
Loading 0: 49%|████▉ | 178/363 [02:07<01:40, 1.84it/s]
Loading 0: 49%|████▉ | 178/363 [02:07<01:40, 1.84it/s]
Loading 0: 49%|████▉ | 179/363 [02:09<02:07, 1.44it/s]
Loading 0: 49%|████▉ | 179/363 [02:09<02:07, 1.44it/s]
Loading 0: 50%|████▉ | 180/363 [02:11<02:40, 1.14it/s]
Loading 0: 50%|████▉ | 180/363 [02:11<02:40, 1.14it/s]
Loading 0: 50%|█████ | 183/363 [02:13<02:21, 1.27it/s]
Loading 0: 50%|█████ | 183/363 [02:13<02:21, 1.27it/s]
Loading 0: 51%|█████ | 184/363 [02:15<02:51, 1.04it/s]
Loading 0: 51%|█████ | 184/363 [02:15<02:51, 1.04it/s]
Loading 0: 51%|█████ | 185/363 [02:17<03:27, 1.16s/it]
Loading 0: 51%|█████ | 185/363 [02:17<03:27, 1.16s/it]
Loading 0: 53%|█████▎ | 193/363 [02:18<01:22, 2.05it/s]
Loading 0: 53%|█████▎ | 193/363 [02:18<01:22, 2.05it/s]
Loading 0: 54%|█████▍ | 196/363 [02:21<01:32, 1.81it/s]
Loading 0: 54%|█████▍ | 196/363 [02:21<01:32, 1.81it/s]
Loading 0: 54%|█████▍ | 197/363 [02:23<01:56, 1.43it/s]
Loading 0: 54%|█████▍ | 197/363 [02:23<01:56, 1.43it/s]
Loading 0: 55%|█████▍ | 198/363 [02:25<02:25, 1.14it/s]
Loading 0: 55%|█████▍ | 198/363 [02:25<02:25, 1.14it/s]
Loading 0: 55%|█████▌ | 201/363 [02:26<02:07, 1.27it/s]
Loading 0: 55%|█████▌ | 201/363 [02:26<02:07, 1.27it/s]
Loading 0: 56%|█████▌ | 202/363 [02:28<02:33, 1.05it/s]
Loading 0: 56%|█████▌ | 202/363 [02:28<02:33, 1.05it/s]
Loading 0: 56%|█████▌ | 203/363 [02:30<03:02, 1.14s/it]
Loading 0: 56%|█████▌ | 203/363 [02:30<03:02, 1.14s/it]
Loading 0: 58%|█████▊ | 210/363 [02:31<01:17, 1.99it/s]
Loading 0: 58%|█████▊ | 210/363 [02:31<01:17, 1.99it/s]
Loading 0: 59%|█████▉ | 214/363 [02:34<01:20, 1.85it/s]
Loading 0: 59%|█████▉ | 214/363 [02:34<01:20, 1.85it/s]
Loading 0: 59%|█████▉ | 215/363 [02:36<01:41, 1.46it/s]
Loading 0: 59%|█████▉ | 215/363 [02:36<01:41, 1.46it/s]
Loading 0: 60%|█████▉ | 216/363 [02:38<02:06, 1.16it/s]
Loading 0: 60%|█████▉ | 216/363 [02:38<02:06, 1.16it/s]
Loading 0: 60%|██████ | 219/363 [02:40<01:52, 1.28it/s]
Loading 0: 60%|██████ | 219/363 [02:40<01:52, 1.28it/s]
Loading 0: 61%|██████ | 220/363 [02:41<02:14, 1.06it/s]
Loading 0: 61%|██████ | 220/363 [02:41<02:14, 1.06it/s]
Loading 0: 61%|██████ | 221/363 [02:43<02:40, 1.13s/it]
Loading 0: 61%|██████ | 221/363 [02:43<02:40, 1.13s/it]
Loading 0: 63%|██████▎ | 229/363 [02:45<01:04, 2.09it/s]
Loading 0: 63%|██████▎ | 229/363 [02:45<01:04, 2.09it/s]
Loading 0: 64%|██████▍ | 232/363 [02:47<01:11, 1.84it/s]
Loading 0: 64%|██████▍ | 232/363 [02:47<01:11, 1.84it/s]
Loading 0: 64%|██████▍ | 233/363 [02:49<01:30, 1.44it/s]
Loading 0: 64%|██████▍ | 233/363 [02:49<01:30, 1.44it/s]
Loading 0: 64%|██████▍ | 234/363 [02:51<01:52, 1.15it/s]
Loading 0: 64%|██████▍ | 234/363 [02:51<01:52, 1.15it/s]
Loading 0: 65%|██████▌ | 237/363 [02:53<01:38, 1.28it/s]
Loading 0: 65%|██████▌ | 237/363 [02:53<01:38, 1.28it/s]
Loading 0: 66%|██████▌ | 238/363 [02:55<01:58, 1.05it/s]
Loading 0: 66%|██████▌ | 238/363 [02:55<01:58, 1.05it/s]
Loading 0: 66%|██████▌ | 239/363 [02:57<02:21, 1.14s/it]
Loading 0: 66%|██████▌ | 239/363 [02:57<02:21, 1.14s/it]
Loading 0: 68%|██████▊ | 247/363 [02:58<00:55, 2.09it/s]
Loading 0: 68%|██████▊ | 247/363 [02:58<00:55, 2.09it/s]
Loading 0: 69%|██████▉ | 250/363 [03:00<01:01, 1.84it/s]
Loading 0: 69%|██████▉ | 250/363 [03:00<01:01, 1.84it/s]
Loading 0: 69%|██████▉ | 251/363 [03:02<01:17, 1.44it/s]
Loading 0: 69%|██████▉ | 251/363 [03:02<01:17, 1.44it/s]
Loading 0: 69%|██████▉ | 252/363 [03:04<01:36, 1.15it/s]
Loading 0: 69%|██████▉ | 252/363 [03:04<01:36, 1.15it/s]
Loading 0: 70%|███████ | 255/363 [03:06<01:24, 1.28it/s]
Loading 0: 70%|███████ | 255/363 [03:06<01:24, 1.28it/s]
Loading 0: 71%|███████ | 256/363 [03:08<01:41, 1.05it/s]
Loading 0: 71%|███████ | 256/363 [03:08<01:41, 1.05it/s]
Loading 0: 71%|███████ | 257/363 [03:10<02:00, 1.14s/it]
Loading 0: 71%|███████ | 257/363 [03:10<02:00, 1.14s/it]
Loading 0: 73%|███████▎ | 265/363 [03:11<00:46, 2.10it/s]
Loading 0: 73%|███████▎ | 265/363 [03:11<00:46, 2.10it/s]
Loading 0: 74%|███████▍ | 268/363 [03:13<00:51, 1.84it/s]
Loading 0: 74%|███████▍ | 268/363 [03:13<00:51, 1.84it/s]
Loading 0: 74%|███████▍ | 269/363 [03:15<01:05, 1.44it/s]
Loading 0: 74%|███████▍ | 269/363 [03:15<01:05, 1.44it/s]
Loading 0: 74%|███████▍ | 270/363 [03:17<01:21, 1.14it/s]
Loading 0: 74%|███████▍ | 270/363 [03:17<01:21, 1.14it/s]
Loading 0: 75%|███████▌ | 273/363 [03:32<03:34, 2.38s/it]
Loading 0: 75%|███████▌ | 273/363 [03:32<03:34, 2.38s/it]
Loading 0: 75%|███████▌ | 274/363 [03:34<03:25, 2.31s/it]
Loading 0: 75%|███████▌ | 274/363 [03:34<03:25, 2.31s/it]
Loading 0: 76%|███████▌ | 275/363 [03:36<03:19, 2.27s/it]
Loading 0: 76%|███████▌ | 275/363 [03:36<03:19, 2.27s/it]
Loading 0: 78%|███████▊ | 283/363 [03:37<01:07, 1.18it/s]
Loading 0: 78%|███████▊ | 283/363 [03:37<01:07, 1.18it/s]
Loading 0: 79%|███████▉ | 286/363 [03:39<01:02, 1.24it/s]
Loading 0: 79%|███████▉ | 286/363 [03:39<01:02, 1.24it/s]
Loading 0: 79%|███████▉ | 287/363 [03:41<01:09, 1.09it/s]
Loading 0: 79%|███████▉ | 287/363 [03:41<01:09, 1.09it/s]
Loading 0: 79%|███████▉ | 288/363 [03:43<01:20, 1.07s/it]
Loading 0: 79%|███████▉ | 288/363 [03:43<01:20, 1.07s/it]
Loading 0: 80%|████████ | 291/363 [03:45<01:04, 1.12it/s]
Loading 0: 80%|████████ | 291/363 [03:45<01:04, 1.12it/s]
Loading 0: 80%|████████ | 292/363 [03:46<01:14, 1.04s/it]
Loading 0: 80%|████████ | 292/363 [03:46<01:14, 1.04s/it]
Loading 0: 81%|████████ | 293/363 [03:48<01:24, 1.21s/it]
Loading 0: 81%|████████ | 293/363 [03:48<01:24, 1.21s/it]
Loading 0: 83%|████████▎ | 301/363 [03:50<00:30, 2.02it/s]
Loading 0: 83%|████████▎ | 301/363 [03:50<00:30, 2.02it/s]
Loading 0: 84%|████████▎ | 304/363 [03:52<00:32, 1.82it/s]
Loading 0: 84%|████████▎ | 304/363 [03:52<00:32, 1.82it/s]
Loading 0: 84%|████████▍ | 305/363 [03:54<00:40, 1.43it/s]
Loading 0: 84%|████████▍ | 305/363 [03:54<00:40, 1.43it/s]
Loading 0: 84%|████████▍ | 306/363 [03:56<00:50, 1.14it/s]
Loading 0: 84%|████████▍ | 306/363 [03:56<00:50, 1.14it/s]
Loading 0: 85%|████████▌ | 309/363 [03:58<00:42, 1.27it/s]
Loading 0: 85%|████████▌ | 309/363 [03:58<00:42, 1.27it/s]
Loading 0: 85%|████████▌ | 310/363 [03:59<00:50, 1.05it/s]
Loading 0: 85%|████████▌ | 310/363 [03:59<00:50, 1.05it/s]
Loading 0: 86%|████████▌ | 311/363 [04:01<00:59, 1.14s/it]
Loading 0: 86%|████████▌ | 311/363 [04:01<00:59, 1.14s/it]
Loading 0: 88%|████████▊ | 319/363 [04:03<00:20, 2.12it/s]
Loading 0: 88%|████████▊ | 319/363 [04:03<00:20, 2.12it/s]
Loading 0: 89%|████████▊ | 322/363 [04:05<00:21, 1.89it/s]
Loading 0: 89%|████████▊ | 322/363 [04:05<00:21, 1.89it/s]
Loading 0: 89%|████████▉ | 323/363 [04:06<00:27, 1.47it/s]
Loading 0: 89%|████████▉ | 323/363 [04:06<00:27, 1.47it/s]
Loading 0: 89%|████████▉ | 324/363 [04:09<00:33, 1.16it/s]
Loading 0: 89%|████████▉ | 324/363 [04:09<00:33, 1.16it/s]
Loading 0: 90%|█████████ | 327/363 [04:10<00:28, 1.28it/s]
Loading 0: 90%|█████████ | 327/363 [04:10<00:28, 1.28it/s]
Loading 0: 90%|█████████ | 328/363 [04:12<00:33, 1.06it/s]
Loading 0: 90%|█████████ | 328/363 [04:12<00:33, 1.06it/s]
Loading 0: 91%|█████████ | 329/363 [04:14<00:38, 1.13s/it]
Loading 0: 91%|█████████ | 329/363 [04:14<00:38, 1.13s/it]
Loading 0: 93%|█████████▎| 337/363 [04:15<00:12, 2.13it/s]
Loading 0: 93%|█████████▎| 337/363 [04:15<00:12, 2.13it/s]
Loading 0: 94%|█████████▎| 340/363 [04:18<00:12, 1.86it/s]
Loading 0: 94%|█████████▎| 340/363 [04:18<00:12, 1.86it/s]
Loading 0: 94%|█████████▍| 341/363 [04:20<00:15, 1.45it/s]
Loading 0: 94%|█████████▍| 341/363 [04:20<00:15, 1.45it/s]
Loading 0: 94%|█████████▍| 342/363 [04:22<00:18, 1.15it/s]
Loading 0: 94%|█████████▍| 342/363 [04:22<00:18, 1.15it/s]
Loading 0: 95%|█████████▌| 345/363 [04:23<00:14, 1.27it/s]
Loading 0: 95%|█████████▌| 345/363 [04:23<00:14, 1.27it/s]
Loading 0: 95%|█████████▌| 346/363 [04:25<00:16, 1.05it/s]
Loading 0: 95%|█████████▌| 346/363 [04:25<00:16, 1.05it/s]
Loading 0: 96%|█████████▌| 347/363 [04:27<00:18, 1.14s/it]
Loading 0: 96%|█████████▌| 347/363 [04:27<00:18, 1.14s/it]
Loading 0: 98%|█████████▊| 355/363 [04:28<00:03, 2.11it/s]
Loading 0: 98%|█████████▊| 355/363 [04:28<00:03, 2.11it/s]
Loading 0: 99%|█████████▉| 359/363 [04:31<00:02, 1.89it/s]
Loading 0: 99%|█████████▉| 359/363 [04:31<00:02, 1.89it/s]
Loading 0: 99%|█████████▉| 360/363 [04:33<00:02, 1.49it/s]
Loading 0: 99%|█████████▉| 360/363 [04:33<00:02, 1.49it/s]
Loading 0: 99%|█████████▉| 361/363 [04:35<00:01, 1.19it/s]
Loading 0: 99%|█████████▉| 361/363 [04:35<00:01, 1.19it/s]
Loading 0: 100%|██████████| 363/363 [04:35<00:00, 1.19it/s]
Loading 0: 100%|██████████| 363/363 [04:35<00:00, 1.32it/s]
Received healthy response to inference request in 3.7318568229675293s
Received healthy response to inference request in 3.0810186862945557s
Received healthy response to inference request in 3.1900863647460938s
chaiml-mistral-24b-2048-74727-v1-mkmlizer: The tokenizer you are loading from '/tmp/tmpc5ib3gsq' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-mistral-24b-2048-74727-v1-mkmlizer: quantized model in 282.088s
chaiml-mistral-24b-2048-74727-v1-mkmlizer: Processed model ChaiML/mistral_24b_2048_gemini_opus_ds_v9_1686_merged in 383.470s
chaiml-mistral-24b-2048-74727-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-mistral-24b-2048-74727-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-mistral-24b-2048-74727-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-mistral-24b-2048-74727-v1/nvidia
chaiml-mistral-24b-2048-74727-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-74727-v1/nvidia/config.json
chaiml-mistral-24b-2048-74727-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-74727-v1/nvidia/special_tokens_map.json
Received healthy response to inference request in 3.001359701156616s
chaiml-mistral-24b-2048-74727-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-74727-v1/nvidia/tokenizer_config.json
chaiml-mistral-24b-2048-74727-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-74727-v1/nvidia/tokenizer.json
Received healthy response to inference request in 3.404947519302368s
5 requests
0 failed requests
5th percentile: 3.0172914981842043
10th percentile: 3.033223295211792
20th percentile: 3.0650868892669676
30th percentile: 3.102832221984863
40th percentile: 3.1464592933654787
50th percentile: 3.1900863647460938
60th percentile: 3.2760308265686033
70th percentile: 3.3619752883911134
80th percentile: 3.4703293800354005
90th percentile: 3.6010931015014647
95th percentile: 3.666474962234497
99th percentile: 3.718780450820923
mean time: 3.2818538188934325
Pipeline stage StressChecker completed in 22.68s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.73s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 1.82s
Shutdown handler de-registered
chaiml-mistral-24b-2048_25909_v1 status is now deployed due to DeploymentManager action
chaiml-mistral-24b-2048_25909_v1 status is now inactive due to auto deactivation removed underperforming models