developer_uid: rirv938
submission_id: chaiml-mistral-24b-2048_83002_v3
model_name: chaiml-mistral-24b-2048_83002_v3
model_group: ChaiML/mistral_24b_2048_
status: torndown
timestamp: 2026-02-10T00:48:21+00:00
num_battles: 11218
num_wins: 5968
celo_rating: 1327.39
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/mistral_24b_2048_gemini_opus_ds_v10_843_merged
model_architecture: MistralForCausalLM
model_num_parameters: 24096691200.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 96
reward_model: default
display_name: chaiml-mistral-24b-2048_83002_v3
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/mistral_24b_2048_gemini_opus_ds_v10_843_merged
model_size: 24B
ranking_group: single
us_pacific_date: 2026-02-06
win_ratio: 0.5320021394187913
generation_params: {'temperature': 1.0, 'top_p': 0.95, 'min_p': 0.05, 'top_k': 60, 'presence_penalty': 0.1, 'frequency_penalty': 0.0, 'stopping_words': ['<|im_end|>', 'You:', '</s>', '###', '<|im_start|>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 96}
formatter: {'memory_template': '[SYSTEM_PROMPT]Respond as a high quality storyteller.[/SYSTEM_PROMPT][INST]', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '[/INST]{bot_name}:', 'truncate_by_message': False}
Resubmit model
run pipeline stage %s
Shutdown handler not registered because Python interpreter is not running in the main thread
Running pipeline stage MKMLizer
run pipeline %s
run pipeline stage %s
Starting job with name chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer
Running pipeline stage MKMLizer
Waiting for job on chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer to finish
Starting job with name chaiml-mistral-24b-2048-83002-v3-mkmlizer
Waiting for job on chaiml-mistral-24b-2048-83002-v3-mkmlizer to finish
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: bash: cannot set terminal process group (-1): Inappropriate ioctl for device
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: bash: no job control in this shell
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: bash: cannot set terminal process group (-1): Inappropriate ioctl for device
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: bash: no job control in this shell
chaiml-mistral-24b-2048-83002-v3-mkmlizer: bash: cannot set terminal process group (-1): Inappropriate ioctl for device
chaiml-mistral-24b-2048-83002-v3-mkmlizer: bash: no job control in this shell
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: /root/miniconda3/envs/nvidia/lib/python3.11/site-packages/mk1/__init__.py:1: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: __import__('pkg_resources').declare_namespace(__name__)
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: /root/miniconda3/envs/nvidia/lib/python3.11/site-packages/mk1/__init__.py:1: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: __import__('pkg_resources').declare_namespace(__name__)
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: /root/miniconda3/envs/nvidia/lib/python3.11/site-packages/mk1/__init__.py:1: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: __import__('pkg_resources').declare_namespace(__name__)
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ Version: 0.30.6+torch280 ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ Version: 0.30.6+torch280 ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ belonging to: ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ Version: 0.30.6+torch280 ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ https://mk1.ai ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ belonging to: ║
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ belonging to: ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ Chai Research Corp. ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ║ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: Downloaded to shared memory in 87.084s
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: Checking if ChaiML/98p_2ff_chaiml_mistral_24b_2048_83002_v1_cp936_merged already exists in ChaiML
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:q4, folder:/tmp/tmp3ocv_ubs, device:0
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: Downloaded to shared memory in 101.515s
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: Checking if ChaiML/98p_2ff_chaiml_mistral_24b_2048_83002_v1_cp468_merged already exists in ChaiML
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:q4, folder:/tmp/tmp59blmu93, device:0
chaiml-mistral-24b-2048-83002-v3-mkmlizer: Downloaded to shared memory in 96.452s
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-mistral-24b-2048-83002-v3-mkmlizer: Checking if ChaiML/mistral_24b_2048_gemini_opus_ds_v10_843_merged already exists in ChaiML
chaiml-mistral-24b-2048-83002-v3-mkmlizer: quantizing model to /dev/shm/model_cache, profile:q4, folder:/tmp/tmpvj83r09j, device:0
chaiml-mistral-24b-2048-83002-v3-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: Loading 0: 0%| | 0.00/363 [00:00<?, ?it/s] Loading 0: 1%| | 3.00/363 [00:01<03:51, 1.55it/s] Loading 0: 1%| | 3.00/363 [00:01<03:51, 1.55it/s] Loading 0: 1%| | 4.00/363 [00:03<06:15, 1.05s/it] Loading 0: 1%| | 4.00/363 [00:03<06:15, 1.05s/it] Loading 0: 1%|▏ | 5.00/363 [00:05<08:01, 1.34s/it] Loading 0: 1%|▏ | 5.00/363 [00:05<08:01, 1.34s/it] Loading 0: 3%|▎ | 12.0/363 [00:06<02:37, 2.23it/s] Loading 0: 3%|▎ | 12.0/363 [00:06<02:37, 2.23it/s] Loading 0: 4%|▎ | 13.0/363 [00:08<03:45, 1.56it/s] Loading 0: 4%|▎ | 13.0/363 [00:08<03:45, 1.56it/s] Loading 0: 4%|▍ | 14.0/363 [00:10<04:54, 1.18it/s] Loading 0: 4%|▍ | 14.0/363 [00:10<04:54, 1.18it/s] Loading 0: 4%|▍ | 15.0/363 [00:12<06:08, 1.06s/it] Loading 0: 4%|▍ | 15.0/363 [00:12<06:08, 1.06s/it] Loading 0: 6%|▌ | 21.0/363 [00:15<03:40, 1.55it/s] Loading 0: 6%|▌ | 21.0/363 [00:15<03:40, 1.55it/s] Loading 0: 6%|▌ | 22.0/363 [00:16<04:34, 1.24it/s] Loading 0: 6%|▌ | 22.0/363 [00:16<04:34, 1.24it/s] Loading 0: 6%|▋ | 23.0/363 [00:18<05:36, 1.01it/s] Loading 0: 6%|▋ | 23.0/363 [00:18<05:36, 1.01it/s] Loading 0: 9%|▊ | 31.0/363 [00:20<02:30, 2.20it/s] Loading 0: 9%|▊ | 31.0/363 [00:20<02:30, 2.20it/s] Loading 0: 9%|▉ | 34.0/363 [00:22<02:52, 1.90it/s] Loading 0: 9%|▉ | 34.0/363 [00:22<02:52, 1.90it/s] Loading 0: 10%|▉ | 35.0/363 [00:24<03:40, 1.49it/s] Loading 0: 10%|▉ | 35.0/363 [00:24<03:40, 1.49it/s] Loading 0: 10%|▉ | 36.0/363 [00:26<04:38, 1.17it/s] Loading 0: 10%|▉ | 36.0/363 [00:26<04:38, 1.17it/s] Loading 0: 11%|█ | 39.0/363 [00:28<04:10, 1.29it/s] Loading 0: 11%|█ | 39.0/363 [00:28<04:10, 1.29it/s] Loading 0: 11%|█ | 40.0/363 [00:30<05:03, 1.07it/s] Loading 0: 11%|█ | 40.0/363 [00:30<05:03, 1.07it/s] Loading 0: 11%|█▏ | 41.0/363 [00:32<06:02, 1.12s/it] Loading 0: 11%|█▏ | 41.0/363 [00:32<06:02, 1.12s/it] Loading 0: 13%|█▎ | 49.0/363 [00:33<02:29, 2.10it/s] Loading 0: 13%|█▎ | 49.0/363 [00:33<02:29, 2.10it/s] Loading 0: 14%|█▍ | 52.0/363 [00:35<02:48, 1.84it/s] Loading 0: 14%|█▍ | 52.0/363 [00:35<02:48, 1.84it/s] Loading 0: 15%|█▍ | 53.0/363 [00:37<03:37, 1.43it/s] Loading 0: 15%|█▍ | 53.0/363 [00:37<03:37, 1.43it/s] Loading 0: 15%|█▍ | 54.0/363 [00:39<04:31, 1.14it/s] Loading 0: 15%|█▍ | 54.0/363 [00:39<04:31, 1.14it/s] Loading 0: 16%|█▌ | 57.0/363 [00:41<04:01, 1.27it/s] Loading 0: 16%|█▌ | 57.0/363 [00:41<04:01, 1.27it/s] Loading 0: 16%|█▌ | 58.0/363 [00:43<04:51, 1.05it/s] Loading 0: 16%|█▌ | 58.0/363 [00:43<04:51, 1.05it/s] Loading 0: 16%|█▋ | 59.0/363 [00:45<05:47, 1.14s/it] Loading 0: 16%|█▋ | 59.0/363 [00:45<05:47, 1.14s/it] Loading 0: 18%|█▊ | 67.0/363 [00:46<02:22, 2.08it/s] Loading 0: 18%|█▊ | 67.0/363 [00:46<02:22, 2.08it/s] Loading 0: 19%|█▉ | 70.0/363 [00:48<02:40, 1.83it/s] Loading 0: 19%|█▉ | 70.0/363 [00:48<02:40, 1.83it/s] Loading 0: 20%|█▉ | 71.0/363 [00:50<03:23, 1.44it/s] Loading 0: 20%|█▉ | 71.0/363 [00:50<03:23, 1.44it/s] Loading 0: 20%|█▉ | 72.0/363 [00:52<04:15, 1.14it/s] Loading 0: 20%|█▉ | 72.0/363 [00:52<04:15, 1.14it/s] Loading 0: 21%|██ | 75.0/363 [00:54<03:47, 1.27it/s] Loading 0: 21%|██ | 75.0/363 [00:54<03:47, 1.27it/s] Loading 0: 21%|██ | 76.0/363 [00:56<04:33, 1.05it/s] Loading 0: 21%|██ | 76.0/363 [00:56<04:33, 1.05it/s] Loading 0: 21%|██ | 77.0/363 [00:58<05:25, 1.14s/it] Loading 0: 21%|██ | 77.0/363 [00:58<05:25, 1.14s/it] Loading 0: 23%|██▎ | 85.0/363 [00:59<02:12, 2.10it/s] Loading 0: 23%|██▎ | 85.0/363 [00:59<02:12, 2.10it/s] Loading 0: 24%|██▍ | 88.0/363 [01:01<02:29, 1.84it/s] Loading 0: 24%|██▍ | 88.0/363 [01:01<02:29, 1.84it/s] Loading 0: 25%|██▍ | 89.0/363 [01:03<03:09, 1.44it/s] Loading 0: 25%|██▍ | 89.0/363 [01:03<03:09, 1.44it/s] Loading 0: 25%|██▍ | 90.0/363 [01:05<03:58, 1.14it/s] Loading 0: 25%|██▍ | 90.0/363 [01:05<03:58, 1.14it/s] Loading 0: 27%|██▋ | 98.0/363 [01:06<01:53, 2.33it/s] Loading 0: 27%|██▋ | 98.0/363 [01:06<01:53, 2.33it/s] Loading 0: 28%|██▊ | 101/363 [01:08<02:08, 2.03it/s] Loading 0: 28%|██▊ | 101/363 [01:08<02:08, 2.03it/s] Loading 0: 28%|██▊ | 102/363 [01:10<02:46, 1.57it/s] Loading 0: 28%|██▊ | 102/363 [01:10<02:46, 1.57it/s] Loading 0: 28%|██▊ | 103/363 [01:12<03:32, 1.22it/s] Loading 0: 28%|██▊ | 103/363 [01:12<03:32, 1.22it/s] Loading 0: 29%|██▉ | 106/363 [01:14<03:17, 1.30it/s] Loading 0: 29%|██▉ | 106/363 [01:14<03:17, 1.30it/s] Loading 0: 29%|██▉ | 107/363 [01:16<03:58, 1.07it/s] Loading 0: 29%|██▉ | 107/363 [01:16<03:58, 1.07it/s] Loading 0: 30%|██▉ | 108/363 [01:18<04:44, 1.12s/it] Loading 0: 30%|██▉ | 108/363 [01:18<04:44, 1.12s/it] Loading 0: 31%|███ | 111/363 [01:20<03:50, 1.10it/s] Loading 0: 31%|███ | 111/363 [01:20<03:50, 1.10it/s] Loading 0: 31%|███ | 112/363 [01:22<04:30, 1.08s/it] Loading 0: 31%|███ | 112/363 [01:22<04:30, 1.08s/it] Loading 0: 31%|███ | 113/363 [01:24<05:14, 1.26s/it] Loading 0: 31%|███ | 113/363 [01:24<05:14, 1.26s/it] Loading 0: 33%|███▎ | 120/363 [01:25<02:07, 1.91it/s] Loading 0: 33%|███▎ | 120/363 [01:25<02:07, 1.91it/s] Loading 0: 34%|███▍ | 124/363 [01:27<02:12, 1.80it/s] Loading 0: 34%|███▍ | 124/363 [01:27<02:12, 1.80it/s] Loading 0: 34%|███▍ | 125/363 [01:29<02:47, 1.42it/s] Loading 0: 34%|███▍ | 125/363 [01:29<02:47, 1.42it/s] Loading 0: 35%|███▍ | 126/363 [01:31<03:29, 1.13it/s] Loading 0: 35%|███▍ | 126/363 [01:31<03:29, 1.13it/s] Loading 0: 36%|███▌ | 129/363 [01:33<03:05, 1.26it/s] Loading 0: 36%|███▌ | 129/363 [01:33<03:05, 1.26it/s] Loading 0: 36%|███▌ | 130/363 [01:35<03:43, 1.04it/s] Loading 0: 36%|███▌ | 130/363 [01:35<03:43, 1.04it/s] Loading 0: 36%|███▌ | 131/363 [01:37<04:25, 1.14s/it] Loading 0: 36%|███▌ | 131/363 [01:37<04:25, 1.14s/it] Loading 0: 38%|███▊ | 139/363 [01:38<01:47, 2.08it/s] Loading 0: 38%|███▊ | 139/363 [01:38<01:47, 2.08it/s] Loading 0: 39%|███▉ | 142/363 [01:41<02:01, 1.82it/s] Loading 0: 39%|███▉ | 142/363 [01:41<02:01, 1.82it/s] Loading 0: 39%|███▉ | 143/363 [01:43<02:33, 1.43it/s] Loading 0: 39%|███▉ | 143/363 [01:43<02:33, 1.43it/s] Loading 0: 40%|███▉ | 144/363 [01:45<03:12, 1.14it/s] Loading 0: 40%|███▉ | 144/363 [01:45<03:12, 1.14it/s] Loading 0: 40%|████ | 147/363 [01:46<02:50, 1.27it/s] Loading 0: 40%|████ | 147/363 [01:46<02:50, 1.27it/s] Loading 0: 41%|████ | 148/363 [01:48<03:25, 1.05it/s] Loading 0: 41%|████ | 148/363 [01:48<03:25, 1.05it/s] Loading 0: 41%|████ | 149/363 [01:50<04:04, 1.14s/it] Loading 0: 41%|████ | 149/363 [01:50<04:04, 1.14s/it] Loading 0: 43%|████▎ | 157/363 [01:52<01:39, 2.08it/s] Loading 0: 43%|████▎ | 157/363 [01:52<01:39, 2.08it/s] Loading 0: 44%|████▍ | 160/363 [01:54<01:51, 1.83it/s] Loading 0: 44%|████▍ | 160/363 [01:54<01:51, 1.83it/s] Loading 0: 44%|████▍ | 161/363 [01:56<02:20, 1.43it/s] Loading 0: 44%|████▍ | 161/363 [01:56<02:20, 1.43it/s] Loading 0: 45%|████▍ | 162/363 [01:58<02:56, 1.14it/s] Loading 0: 45%|████▍ | 162/363 [01:58<02:56, 1.14it/s] Loading 0: 45%|████▌ | 165/363 [02:00<02:35, 1.27it/s] Loading 0: 45%|████▌ | 165/363 [02:00<02:35, 1.27it/s] Loading 0: 46%|████▌ | 166/363 [02:01<03:07, 1.05it/s] Loading 0: 46%|████▌ | 166/363 [02:01<03:07, 1.05it/s] Loading 0: 46%|████▌ | 167/363 [02:03<03:43, 1.14s/it] Loading 0: 46%|████▌ | 167/363 [02:03<03:43, 1.14s/it] Loading 0: 48%|████▊ | 175/363 [02:05<01:30, 2.09it/s] Loading 0: 48%|████▊ | 175/363 [02:05<01:30, 2.09it/s] Loading 0: 49%|████▉ | 178/363 [02:07<01:41, 1.83it/s] Loading 0: 49%|████▉ | 178/363 [02:07<01:41, 1.83it/s] Loading 0: 49%|████▉ | 179/363 [02:09<02:08, 1.44it/s] Loading 0: 49%|████▉ | 179/363 [02:09<02:08, 1.44it/s] Loading 0: 50%|████▉ | 180/363 [02:11<02:39, 1.14it/s] Loading 0: 50%|████▉ | 180/363 [02:11<02:39, 1.14it/s] Loading 0: 50%|█████ | 183/363 [02:13<02:21, 1.27it/s] Loading 0: 50%|█████ | 183/363 [02:13<02:21, 1.27it/s] Loading 0: 51%|█████ | 184/363 [02:15<02:50, 1.05it/s] Loading 0: 51%|█████ | 184/363 [02:15<02:50, 1.05it/s] Loading 0: 51%|█████ | 185/363 [02:17<03:22, 1.14s/it] Loading 0: 51%|█████ | 185/363 [02:17<03:22, 1.14s/it] Loading 0: 53%|█████▎ | 193/363 [02:18<01:20, 2.10it/s] Loading 0: 53%|█████▎ | 193/363 [02:18<01:20, 2.10it/s] Loading 0: 54%|█████▍ | 196/363 [02:20<01:31, 1.83it/s] Loading 0: 54%|█████▍ | 196/363 [02:20<01:31, 1.83it/s] Loading 0: 54%|█████▍ | 197/363 [02:22<01:55, 1.44it/s] Loading 0: 54%|█████▍ | 197/363 [02:22<01:55, 1.44it/s] Loading 0: 55%|█████▍ | 198/363 [02:24<02:24, 1.14it/s] Loading 0: 55%|█████▍ | 198/363 [02:24<02:24, 1.14it/s] Loading 0: 55%|█████▌ | 201/363 [02:26<02:07, 1.27it/s] Loading 0: 55%|█████▌ | 201/363 [02:26<02:07, 1.27it/s] Loading 0: 56%|█████▌ | 202/363 [02:28<02:33, 1.05it/s] Loading 0: 56%|█████▌ | 202/363 [02:28<02:33, 1.05it/s] Loading 0: 56%|█████▌ | 203/363 [02:30<03:02, 1.14s/it] Loading 0: 56%|█████▌ | 203/363 [02:30<03:02, 1.14s/it] Loading 0: 58%|█████▊ | 211/363 [02:31<01:12, 2.08it/s] Loading 0: 58%|█████▊ | 211/363 [02:31<01:12, 2.08it/s] Loading 0: 59%|█████▉ | 214/363 [02:33<01:21, 1.83it/s] Loading 0: 59%|█████▉ | 214/363 [02:33<01:21, 1.83it/s] Loading 0: 59%|█████▉ | 215/363 [02:35<01:42, 1.44it/s] Loading 0: 59%|█████▉ | 215/363 [02:35<01:42, 1.44it/s] Loading 0: 60%|█████▉ | 216/363 [02:37<02:08, 1.14it/s] Loading 0: 60%|█████▉ | 216/363 [02:37<02:08, 1.14it/s] Loading 0: 60%|██████ | 219/363 [02:39<01:53, 1.27it/s] Loading 0: 60%|██████ | 219/363 [02:39<01:53, 1.27it/s] Loading 0: 61%|██████ | 220/363 [02:41<02:16, 1.05it/s] Loading 0: 61%|██████ | 220/363 [02:41<02:16, 1.05it/s] Loading 0: 61%|██████ | 221/363 [02:43<02:41, 1.14s/it] Loading 0: 61%|██████ | 221/363 [02:43<02:41, 1.14s/it] Loading 0: 63%|██████▎ | 229/363 [02:44<01:04, 2.08it/s] Loading 0: 63%|██████▎ | 229/363 [02:44<01:04, 2.08it/s] Loading 0: 64%|██████▍ | 232/363 [02:46<01:11, 1.83it/s] Loading 0: 64%|██████▍ | 232/363 [02:46<01:11, 1.83it/s] Loading 0: 64%|██████▍ | 233/363 [02:48<01:30, 1.44it/s] Loading 0: 64%|██████▍ | 233/363 [02:48<01:30, 1.44it/s] Loading 0: 64%|██████▍ | 234/363 [02:50<01:52, 1.14it/s] Loading 0: 64%|██████▍ | 234/363 [02:50<01:52, 1.14it/s] Loading 0: 65%|██████▌ | 237/363 [02:52<01:38, 1.27it/s] Loading 0: 65%|██████▌ | 237/363 [02:52<01:38, 1.27it/s] Loading 0: 66%|██████▌ | 238/363 [02:54<01:58, 1.05it/s] Loading 0: 66%|██████▌ | 238/363 [02:54<01:58, 1.05it/s] Loading 0: 66%|██████▌ | 239/363 [02:56<02:21, 1.14s/it] Loading 0: 66%|██████▌ | 239/363 [02:56<02:21, 1.14s/it] Loading 0: 68%|██████▊ | 247/363 [02:57<00:55, 2.08it/s] Loading 0: 68%|██████▊ | 247/363 [02:57<00:55, 2.08it/s] Loading 0: 69%|██████▉ | 250/363 [02:59<01:01, 1.83it/s] Loading 0: 69%|██████▉ | 250/363 [02:59<01:01, 1.83it/s] Loading 0: 69%|██████▉ | 251/363 [03:01<01:18, 1.44it/s] Loading 0: 69%|██████▉ | 251/363 [03:01<01:18, 1.44it/s] Loading 0: 69%|██████▉ | 252/363 [03:03<01:37, 1.14it/s] Loading 0: 69%|██████▉ | 252/363 [03:03<01:37, 1.14it/s] Loading 0: 70%|███████ | 255/363 [03:05<01:24, 1.27it/s] Loading 0: 70%|███████ | 255/363 [03:05<01:24, 1.27it/s] Loading 0: 71%|███████ | 256/363 [03:07<01:41, 1.05it/s] Loading 0: 71%|███████ | 256/363 [03:07<01:41, 1.05it/s] Loading 0: 71%|███████ | 257/363 [03:09<02:00, 1.14s/it] Loading 0: 71%|███████ | 257/363 [03:09<02:00, 1.14s/it] Loading 0: 73%|███████▎ | 265/363 [03:10<00:46, 2.09it/s] Loading 0: 73%|███████▎ | 265/363 [03:10<00:46, 2.09it/s] Loading 0: 74%|███████▍ | 268/363 [03:12<00:51, 1.83it/s] Loading 0: 74%|███████▍ | 268/363 [03:12<00:51, 1.83it/s] Loading 0: 74%|███████▍ | 269/363 [03:14<01:05, 1.44it/s] Loading 0: 74%|███████▍ | 269/363 [03:14<01:05, 1.44it/s] Loading 0: 74%|███████▍ | 270/363 [03:16<01:21, 1.14it/s] Loading 0: 74%|███████▍ | 270/363 [03:16<01:21, 1.14it/s] Loading 0: 75%|███████▌ | 273/363 [03:31<03:34, 2.39s/it] Loading 0: 75%|███████▌ | 273/363 [03:31<03:34, 2.39s/it] Loading 0: 75%|███████▌ | 274/363 [03:33<03:25, 2.31s/it] Loading 0: 75%|███████▌ | 274/363 [03:33<03:25, 2.31s/it] Loading 0: 76%|███████▌ | 275/363 [03:35<03:19, 2.27s/it] Loading 0: 76%|███████▌ | 275/363 [03:35<03:19, 2.27s/it] Loading 0: 78%|███████▊ | 283/363 [03:36<01:07, 1.18it/s] Loading 0: 78%|███████▊ | 283/363 [03:36<01:07, 1.18it/s] Loading 0: 79%|███████▉ | 286/363 [03:38<01:02, 1.24it/s] Loading 0: 79%|███████▉ | 286/363 [03:38<01:02, 1.24it/s] Loading 0: 79%|███████▉ | 287/363 [03:40<01:10, 1.08it/s] Loading 0: 79%|███████▉ | 287/363 [03:40<01:10, 1.08it/s] Loading 0: 79%|███████▉ | 288/363 [03:42<01:20, 1.08s/it] Loading 0: 79%|███████▉ | 288/363 [03:42<01:20, 1.08s/it] Loading 0: 80%|████████ | 291/363 [03:44<01:05, 1.09it/s] Loading 0: 80%|████████ | 291/363 [03:44<01:05, 1.09it/s] Loading 0: 80%|████████ | 292/363 [03:46<01:16, 1.07s/it] Loading 0: 80%|████████ | 292/363 [03:46<01:16, 1.07s/it] Loading 0: 81%|████████ | 293/363 [03:48<01:26, 1.24s/it] Loading 0: 81%|████████ | 293/363 [03:48<01:26, 1.24s/it] Loading 0: 83%|████████▎ | 301/363 [03:49<00:31, 1.98it/s] Loading 0: 83%|████████▎ | 301/363 [03:49<00:31, 1.98it/s] Loading 0: 84%|████████▎ | 304/363 [03:51<00:33, 1.77it/s] Loading 0: 84%|████████▎ | 304/363 [03:51<00:33, 1.77it/s] Loading 0: 84%|████████▍ | 305/363 [03:53<00:41, 1.40it/s] Loading 0: 84%|████████▍ | 305/363 [03:53<00:41, 1.40it/s] Loading 0: 84%|████████▍ | 306/363 [03:55<00:50, 1.13it/s] Loading 0: 84%|████████▍ | 306/363 [03:55<00:50, 1.13it/s] Loading 0: 85%|████████▌ | 309/363 [03:57<00:42, 1.26it/s] Loading 0: 85%|████████▌ | 309/363 [03:57<00:42, 1.26it/s] Loading 0: 85%|████████▌ | 310/363 [03:59<00:50, 1.04it/s] Loading 0: 85%|████████▌ | 310/363 [03:59<00:50, 1.04it/s] Loading 0: 86%|████████▌ | 311/363 [04:01<00:59, 1.14s/it] Loading 0: 86%|████████▌ | 311/363 [04:01<00:59, 1.14s/it] Loading 0: 88%|████████▊ | 319/363 [04:02<00:20, 2.11it/s] Loading 0: 88%|████████▊ | 319/363 [04:02<00:20, 2.11it/s] Loading 0: 89%|████████▊ | 322/363 [04:04<00:22, 1.84it/s] Loading 0: 89%|████████▊ | 322/363 [04:04<00:22, 1.84it/s] Loading 0: 89%|████████▉ | 323/363 [04:06<00:27, 1.44it/s] Loading 0: 89%|████████▉ | 323/363 [04:06<00:27, 1.44it/s] Loading 0: 89%|████████▉ | 324/363 [04:08<00:34, 1.14it/s] Loading 0: 89%|████████▉ | 324/363 [04:08<00:34, 1.14it/s] Loading 0: 90%|█████████ | 327/363 [04:10<00:28, 1.27it/s] Loading 0: 90%|█████████ | 327/363 [04:10<00:28, 1.27it/s] Loading 0: 90%|█████████ | 328/363 [04:12<00:33, 1.05it/s] Loading 0: 90%|█████████ | 328/363 [04:12<00:33, 1.05it/s] Loading 0: 91%|█████████ | 329/363 [04:14<00:38, 1.14s/it] Loading 0: 91%|█████████ | 329/363 [04:14<00:38, 1.14s/it] Loading 0: 93%|█████████▎| 337/363 [04:15<00:12, 2.11it/s] Loading 0: 93%|█████████▎| 337/363 [04:15<00:12, 2.11it/s] Loading 0: 94%|█████████▎| 340/363 [04:18<00:12, 1.85it/s] Loading 0: 94%|█████████▎| 340/363 [04:18<00:12, 1.85it/s] Loading 0: 94%|█████████▍| 341/363 [04:19<00:15, 1.44it/s] Loading 0: 94%|█████████▍| 341/363 [04:19<00:15, 1.44it/s] Loading 0: 94%|█████████▍| 342/363 [04:21<00:18, 1.14it/s] Loading 0: 94%|█████████▍| 342/363 [04:21<00:18, 1.14it/s] Loading 0: 95%|█████████▌| 345/363 [04:23<00:14, 1.27it/s] Loading 0: 95%|█████████▌| 345/363 [04:23<00:14, 1.27it/s] Loading 0: 95%|█████████▌| 346/363 [04:25<00:16, 1.05it/s] Loading 0: 95%|█████████▌| 346/363 [04:25<00:16, 1.05it/s] Loading 0: 96%|█████████▌| 347/363 [04:27<00:18, 1.14s/it] Loading 0: 96%|█████████▌| 347/363 [04:27<00:18, 1.14s/it] Loading 0: 98%|█████████▊| 355/363 [04:28<00:03, 2.09it/s] Loading 0: 98%|█████████▊| 355/363 [04:28<00:03, 2.09it/s] Loading 0: 99%|█████████▉| 359/363 [04:31<00:02, 1.89it/s] Loading 0: 99%|█████████▉| 359/363 [04:31<00:02, 1.89it/s] Loading 0: 99%|█████████▉| 360/363 [04:33<00:02, 1.49it/s] Loading 0: 99%|█████████▉| 360/363 [04:33<00:02, 1.49it/s] Loading 0: 99%|█████████▉| 361/363 [04:35<00:01, 1.19it/s] Loading 0: 99%|█████████▉| 361/363 [04:35<00:01, 1.19it/s] Loading 0: 100%|██████████| 363/363 [04:35<00:00, 1.19it/s] Loading 0: 100%|██████████| 363/363 [04:35<00:00, 1.32it/s]
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: The tokenizer you are loading from '/tmp/tmp3ocv_ubs' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: quantized model in 281.869s
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: Processed model ChaiML/98p_2ff_chaiml_mistral_24b_2048_83002_v1_cp936_merged in 368.953s
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-98p-2ff-chaiml-m-32069-v1/nvidia
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-98p-2ff-chaiml-m-32069-v1/nvidia/config.json
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-98p-2ff-chaiml-m-32069-v1/nvidia/tokenizer_config.json
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-98p-2ff-chaiml-m-32069-v1/nvidia/special_tokens_map.json
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-98p-2ff-chaiml-m-32069-v1/nvidia/tokenizer.json
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: Loading 0: 0%| | 0.00/363 [00:00<?, ?it/s] Loading 0: 1%| | 3.00/363 [00:01<03:51, 1.55it/s] Loading 0: 1%| | 3.00/363 [00:01<03:51, 1.55it/s] Loading 0: 1%| | 4.00/363 [00:03<06:15, 1.05s/it] Loading 0: 1%| | 4.00/363 [00:03<06:15, 1.05s/it] Loading 0: 1%|▏ | 5.00/363 [00:05<08:01, 1.34s/it] Loading 0: 1%|▏ | 5.00/363 [00:05<08:01, 1.34s/it] Loading 0: 3%|▎ | 12.0/363 [00:06<02:38, 2.22it/s] Loading 0: 3%|▎ | 12.0/363 [00:06<02:38, 2.22it/s] Loading 0: 4%|▎ | 13.0/363 [00:08<03:45, 1.55it/s] Loading 0: 4%|▎ | 13.0/363 [00:08<03:45, 1.55it/s] Loading 0: 4%|▍ | 14.0/363 [00:10<04:55, 1.18it/s] Loading 0: 4%|▍ | 14.0/363 [00:10<04:55, 1.18it/s] Loading 0: 4%|▍ | 15.0/363 [00:12<06:09, 1.06s/it] Loading 0: 4%|▍ | 15.0/363 [00:12<06:09, 1.06s/it] Loading 0: 6%|▌ | 21.0/363 [00:15<03:42, 1.54it/s] Loading 0: 6%|▌ | 21.0/363 [00:15<03:42, 1.54it/s] Loading 0: 6%|▌ | 22.0/363 [00:17<04:36, 1.24it/s] Loading 0: 6%|▌ | 22.0/363 [00:17<04:36, 1.24it/s] Loading 0: 6%|▋ | 23.0/363 [00:19<05:37, 1.01it/s] Loading 0: 6%|▋ | 23.0/363 [00:19<05:37, 1.01it/s] Loading 0: 9%|▊ | 31.0/363 [00:20<02:31, 2.19it/s] Loading 0: 9%|▊ | 31.0/363 [00:20<02:31, 2.19it/s] Loading 0: 9%|▉ | 34.0/363 [00:22<02:54, 1.88it/s] Loading 0: 9%|▉ | 34.0/363 [00:22<02:54, 1.88it/s] Loading 0: 10%|▉ | 35.0/363 [00:24<03:42, 1.47it/s] Loading 0: 10%|▉ | 35.0/363 [00:24<03:42, 1.47it/s] Loading 0: 10%|▉ | 36.0/363 [00:26<04:39, 1.17it/s] Loading 0: 10%|▉ | 36.0/363 [00:26<04:39, 1.17it/s] Loading 0: 11%|█ | 39.0/363 [00:28<04:11, 1.29it/s] Loading 0: 11%|█ | 39.0/363 [00:28<04:11, 1.29it/s] Loading 0: 11%|█ | 40.0/363 [00:30<05:04, 1.06it/s] Loading 0: 11%|█ | 40.0/363 [00:30<05:04, 1.06it/s] Loading 0: 11%|█▏ | 41.0/363 [00:32<06:03, 1.13s/it] Loading 0: 11%|█▏ | 41.0/363 [00:32<06:03, 1.13s/it] Loading 0: 13%|█▎ | 49.0/363 [00:33<02:29, 2.10it/s] Loading 0: 13%|█▎ | 49.0/363 [00:33<02:29, 2.10it/s] Loading 0: 14%|█▍ | 52.0/363 [00:35<02:48, 1.84it/s] Loading 0: 14%|█▍ | 52.0/363 [00:35<02:48, 1.84it/s] Loading 0: 15%|█▍ | 53.0/363 [00:37<03:34, 1.44it/s] Loading 0: 15%|█▍ | 53.0/363 [00:37<03:34, 1.44it/s] Loading 0: 15%|█▍ | 54.0/363 [00:39<04:30, 1.14it/s] Loading 0: 15%|█▍ | 54.0/363 [00:39<04:30, 1.14it/s] Loading 0: 16%|█▌ | 57.0/363 [00:41<04:00, 1.27it/s] Loading 0: 16%|█▌ | 57.0/363 [00:41<04:00, 1.27it/s] Loading 0: 16%|█▌ | 58.0/363 [00:43<04:50, 1.05it/s] Loading 0: 16%|█▌ | 58.0/363 [00:43<04:50, 1.05it/s] Loading 0: 16%|█▋ | 59.0/363 [00:45<05:47, 1.14s/it] Loading 0: 16%|█▋ | 59.0/363 [00:45<05:47, 1.14s/it] Loading 0: 18%|█▊ | 67.0/363 [00:46<02:22, 2.08it/s] Loading 0: 18%|█▊ | 67.0/363 [00:46<02:22, 2.08it/s] Loading 0: 19%|█▉ | 70.0/363 [00:48<02:40, 1.83it/s] Loading 0: 19%|█▉ | 70.0/363 [00:48<02:40, 1.83it/s] Loading 0: 20%|█▉ | 71.0/363 [00:50<03:23, 1.43it/s] Loading 0: 20%|█▉ | 71.0/363 [00:50<03:23, 1.43it/s] Loading 0: 20%|█▉ | 72.0/363 [00:52<04:15, 1.14it/s] Loading 0: 20%|█▉ | 72.0/363 [00:52<04:15, 1.14it/s] Loading 0: 21%|██ | 75.0/363 [00:54<03:47, 1.27it/s] Loading 0: 21%|██ | 75.0/363 [00:54<03:47, 1.27it/s] Loading 0: 21%|██ | 76.0/363 [00:56<04:33, 1.05it/s] Loading 0: 21%|██ | 76.0/363 [00:56<04:33, 1.05it/s] Loading 0: 21%|██ | 77.0/363 [00:58<05:26, 1.14s/it] Loading 0: 21%|██ | 77.0/363 [00:58<05:26, 1.14s/it] Loading 0: 23%|██▎ | 85.0/363 [00:59<02:13, 2.08it/s] Loading 0: 23%|██▎ | 85.0/363 [00:59<02:13, 2.08it/s] Loading 0: 24%|██▍ | 88.0/363 [01:01<02:30, 1.83it/s] Loading 0: 24%|██▍ | 88.0/363 [01:01<02:30, 1.83it/s] Loading 0: 25%|██▍ | 89.0/363 [01:03<03:11, 1.43it/s] Loading 0: 25%|██▍ | 89.0/363 [01:03<03:11, 1.43it/s] Loading 0: 25%|██▍ | 90.0/363 [01:05<04:00, 1.14it/s] Loading 0: 25%|██▍ | 90.0/363 [01:05<04:00, 1.14it/s] Loading 0: 27%|██▋ | 98.0/363 [01:07<01:55, 2.30it/s] Loading 0: 27%|██▋ | 98.0/363 [01:07<01:55, 2.30it/s] Loading 0: 28%|██▊ | 101/363 [01:09<02:10, 2.01it/s] Loading 0: 28%|██▊ | 101/363 [01:09<02:10, 2.01it/s] Loading 0: 28%|██▊ | 102/363 [01:10<02:47, 1.55it/s] Loading 0: 28%|██▊ | 102/363 [01:10<02:47, 1.55it/s] Loading 0: 28%|██▊ | 103/363 [01:12<03:33, 1.22it/s] Loading 0: 28%|██▊ | 103/363 [01:12<03:33, 1.22it/s] Loading 0: 29%|██▉ | 106/363 [01:15<03:18, 1.30it/s] Loading 0: 29%|██▉ | 106/363 [01:15<03:18, 1.30it/s] Loading 0: 29%|██▉ | 107/363 [01:16<03:59, 1.07it/s] Loading 0: 29%|██▉ | 107/363 [01:16<03:59, 1.07it/s] Loading 0: 30%|██▉ | 108/363 [01:18<04:45, 1.12s/it] Loading 0: 30%|██▉ | 108/363 [01:18<04:45, 1.12s/it] Loading 0: 31%|███ | 111/363 [01:20<03:50, 1.09it/s] Loading 0: 31%|███ | 111/363 [01:20<03:50, 1.09it/s] Loading 0: 31%|███ | 112/363 [01:22<04:31, 1.08s/it] Loading 0: 31%|███ | 112/363 [01:22<04:31, 1.08s/it] Loading 0: 31%|███ | 113/363 [01:24<05:15, 1.26s/it] Loading 0: 31%|███ | 113/363 [01:24<05:15, 1.26s/it] Loading 0: 33%|███▎ | 121/363 [01:26<02:00, 2.01it/s] Loading 0: 33%|███▎ | 121/363 [01:26<02:00, 2.01it/s] Loading 0: 34%|███▍ | 124/363 [01:28<02:14, 1.78it/s] Loading 0: 34%|███▍ | 124/363 [01:28<02:14, 1.78it/s] Loading 0: 34%|███▍ | 125/363 [01:30<02:50, 1.40it/s] Loading 0: 34%|███▍ | 125/363 [01:30<02:50, 1.40it/s] Loading 0: 35%|███▍ | 126/363 [01:32<03:31, 1.12it/s] Loading 0: 35%|███▍ | 126/363 [01:32<03:31, 1.12it/s] Loading 0: 36%|███▌ | 129/363 [01:34<03:06, 1.25it/s] Loading 0: 36%|███▌ | 129/363 [01:34<03:06, 1.25it/s] Loading 0: 36%|███▌ | 130/363 [01:35<03:44, 1.04it/s] Loading 0: 36%|███▌ | 130/363 [01:35<03:44, 1.04it/s] Loading 0: 36%|███▌ | 131/363 [01:37<04:27, 1.15s/it] Loading 0: 36%|███▌ | 131/363 [01:37<04:27, 1.15s/it] Loading 0: 38%|███▊ | 139/363 [01:39<01:48, 2.07it/s] Loading 0: 38%|███▊ | 139/363 [01:39<01:48, 2.07it/s] Loading 0: 39%|███▉ | 142/363 [01:41<02:01, 1.82it/s] Loading 0: 39%|███▉ | 142/363 [01:41<02:01, 1.82it/s] Loading 0: 39%|███▉ | 143/363 [01:43<02:33, 1.43it/s] Loading 0: 39%|███▉ | 143/363 [01:43<02:33, 1.43it/s] Loading 0: 40%|███▉ | 144/363 [01:45<03:12, 1.14it/s] Loading 0: 40%|███▉ | 144/363 [01:45<03:12, 1.14it/s] Loading 0: 40%|████ | 147/363 [01:47<02:50, 1.27it/s] Loading 0: 40%|████ | 147/363 [01:47<02:50, 1.27it/s] Loading 0: 41%|████ | 148/363 [01:49<03:25, 1.04it/s] Loading 0: 41%|████ | 148/363 [01:49<03:25, 1.04it/s] Loading 0: 41%|████ | 149/363 [01:51<04:05, 1.15s/it] Loading 0: 41%|████ | 149/363 [01:51<04:05, 1.15s/it] Loading 0: 43%|████▎ | 156/363 [01:52<01:44, 1.98it/s] Loading 0: 43%|████▎ | 156/363 [01:52<01:44, 1.98it/s] Loading 0: 44%|████▍ | 160/363 [01:54<01:50, 1.83it/s] Loading 0: 44%|████▍ | 160/363 [01:54<01:50, 1.83it/s] Loading 0: 44%|████▍ | 161/363 [01:56<02:20, 1.44it/s] Loading 0: 44%|████▍ | 161/363 [01:56<02:20, 1.44it/s] Loading 0: 45%|████▍ | 162/363 [01:58<02:55, 1.14it/s] Loading 0: 45%|████▍ | 162/363 [01:58<02:55, 1.14it/s] Loading 0: 45%|████▌ | 165/363 [02:00<02:36, 1.27it/s] Loading 0: 45%|████▌ | 165/363 [02:00<02:36, 1.27it/s] Loading 0: 46%|████▌ | 166/363 [02:02<03:08, 1.04it/s] Loading 0: 46%|████▌ | 166/363 [02:02<03:08, 1.04it/s] Loading 0: 46%|████▌ | 167/363 [02:04<03:44, 1.15s/it] Loading 0: 46%|████▌ | 167/363 [02:04<03:44, 1.15s/it] Loading 0: 48%|████▊ | 174/363 [02:05<01:36, 1.97it/s] Loading 0: 48%|████▊ | 174/363 [02:05<01:36, 1.97it/s] Loading 0: 49%|████▉ | 178/363 [02:07<01:41, 1.83it/s] Loading 0: 49%|████▉ | 178/363 [02:07<01:41, 1.83it/s] Loading 0: 49%|████▉ | 179/363 [02:09<02:07, 1.44it/s] Loading 0: 49%|████▉ | 179/363 [02:09<02:07, 1.44it/s] Loading 0: 50%|████▉ | 180/363 [02:11<02:40, 1.14it/s] Loading 0: 50%|████▉ | 180/363 [02:11<02:40, 1.14it/s] Loading 0: 50%|█████ | 183/363 [02:13<02:22, 1.27it/s] Loading 0: 50%|█████ | 183/363 [02:13<02:22, 1.27it/s] Loading 0: 51%|█████ | 184/363 [02:15<02:51, 1.05it/s] Loading 0: 51%|█████ | 184/363 [02:15<02:51, 1.05it/s] Loading 0: 51%|█████ | 185/363 [02:17<03:23, 1.14s/it] Loading 0: 51%|█████ | 185/363 [02:17<03:23, 1.14s/it] Loading 0: 53%|█████▎ | 192/363 [02:18<01:26, 1.97it/s] Loading 0: 53%|█████▎ | 192/363 [02:18<01:26, 1.97it/s] Loading 0: 54%|█████▍ | 196/363 [02:21<01:30, 1.84it/s] Loading 0: 54%|█████▍ | 196/363 [02:21<01:30, 1.84it/s] Loading 0: 54%|█████▍ | 197/363 [02:23<01:55, 1.44it/s] Loading 0: 54%|█████▍ | 197/363 [02:23<01:55, 1.44it/s] Loading 0: 55%|█████▍ | 198/363 [02:25<02:23, 1.15it/s] Loading 0: 55%|█████▍ | 198/363 [02:25<02:23, 1.15it/s] Loading 0: 55%|█████▌ | 201/363 [02:27<02:07, 1.27it/s] Loading 0: 55%|█████▌ | 201/363 [02:27<02:07, 1.27it/s] Loading 0: 56%|█████▌ | 202/363 [02:28<02:33, 1.05it/s] Loading 0: 56%|█████▌ | 202/363 [02:28<02:33, 1.05it/s] Loading 0: 56%|█████▌ | 203/363 [02:31<03:02, 1.14s/it] Loading 0: 56%|█████▌ | 203/363 [02:31<03:02, 1.14s/it] Loading 0: 58%|█████▊ | 211/363 [02:32<01:13, 2.06it/s] Loading 0: 58%|█████▊ | 211/363 [02:32<01:13, 2.06it/s] Loading 0: 59%|█████▉ | 214/363 [02:34<01:22, 1.82it/s] Loading 0: 59%|█████▉ | 214/363 [02:34<01:22, 1.82it/s] Loading 0: 59%|█████▉ | 215/363 [02:36<01:43, 1.42it/s] Loading 0: 59%|█████▉ | 215/363 [02:36<01:43, 1.42it/s] Loading 0: 60%|█████▉ | 216/363 [02:38<02:10, 1.13it/s] Loading 0: 60%|█████▉ | 216/363 [02:38<02:10, 1.13it/s] Loading 0: 60%|██████ | 219/363 [02:40<01:54, 1.26it/s] Loading 0: 60%|██████ | 219/363 [02:40<01:54, 1.26it/s] Loading 0: 61%|██████ | 220/363 [02:42<02:17, 1.04it/s] Loading 0: 61%|██████ | 220/363 [02:42<02:17, 1.04it/s] Loading 0: 61%|██████ | 221/363 [02:44<02:43, 1.15s/it] Loading 0: 61%|██████ | 221/363 [02:44<02:43, 1.15s/it] Loading 0: 63%|██████▎ | 228/363 [02:45<01:08, 1.97it/s] Loading 0: 63%|██████▎ | 228/363 [02:45<01:08, 1.97it/s] Loading 0: 64%|██████▍ | 232/363 [02:47<01:11, 1.84it/s] Loading 0: 64%|██████▍ | 232/363 [02:47<01:11, 1.84it/s] Loading 0: 64%|██████▍ | 233/363 [02:49<01:30, 1.44it/s] Loading 0: 64%|██████▍ | 233/363 [02:49<01:30, 1.44it/s] Loading 0: 64%|██████▍ | 234/363 [02:51<01:52, 1.15it/s] Loading 0: 64%|██████▍ | 234/363 [02:51<01:52, 1.15it/s] Loading 0: 65%|██████▌ | 237/363 [02:53<01:39, 1.27it/s] Loading 0: 65%|██████▌ | 237/363 [02:53<01:39, 1.27it/s] Loading 0: 66%|██████▌ | 238/363 [02:55<01:59, 1.05it/s] Loading 0: 66%|██████▌ | 238/363 [02:55<01:59, 1.05it/s] Loading 0: 66%|██████▌ | 239/363 [02:57<02:21, 1.14s/it] Loading 0: 66%|██████▌ | 239/363 [02:57<02:21, 1.14s/it] Loading 0: 68%|██████▊ | 246/363 [02:58<00:59, 1.97it/s] Loading 0: 68%|██████▊ | 246/363 [02:58<00:59, 1.97it/s] Loading 0: 69%|██████▉ | 250/363 [03:00<01:01, 1.84it/s] Loading 0: 69%|██████▉ | 250/363 [03:00<01:01, 1.84it/s] Loading 0: 69%|██████▉ | 251/363 [03:02<01:17, 1.45it/s] Loading 0: 69%|██████▉ | 251/363 [03:02<01:17, 1.45it/s] Loading 0: 69%|██████▉ | 252/363 [03:04<01:36, 1.15it/s] Loading 0: 69%|██████▉ | 252/363 [03:04<01:36, 1.15it/s] Loading 0: 70%|███████ | 255/363 [03:06<01:24, 1.27it/s] Loading 0: 70%|███████ | 255/363 [03:06<01:24, 1.27it/s] Loading 0: 71%|███████ | 256/363 [03:08<01:41, 1.05it/s] Loading 0: 71%|███████ | 256/363 [03:08<01:41, 1.05it/s] Loading 0: 71%|███████ | 257/363 [03:10<02:01, 1.14s/it] Loading 0: 71%|███████ | 257/363 [03:10<02:01, 1.14s/it] Loading 0: 73%|███████▎ | 265/363 [03:11<00:47, 2.08it/s] Loading 0: 73%|███████▎ | 265/363 [03:11<00:47, 2.08it/s] Loading 0: 74%|███████▍ | 268/363 [03:14<00:52, 1.83it/s] Loading 0: 74%|███████▍ | 268/363 [03:14<00:52, 1.83it/s] Loading 0: 74%|███████▍ | 269/363 [03:16<01:05, 1.43it/s] Loading 0: 74%|███████▍ | 269/363 [03:16<01:05, 1.43it/s] Loading 0: 74%|███████▍ | 270/363 [03:18<01:21, 1.14it/s] Loading 0: 74%|███████▍ | 270/363 [03:18<01:21, 1.14it/s] Loading 0: 75%|███████▌ | 273/363 [03:32<03:34, 2.38s/it] Loading 0: 75%|███████▌ | 273/363 [03:32<03:34, 2.38s/it] Loading 0: 75%|███████▌ | 274/363 [03:34<03:25, 2.31s/it] Loading 0: 75%|███████▌ | 274/363 [03:34<03:25, 2.31s/it] Loading 0: 76%|███████▌ | 275/363 [03:36<03:19, 2.27s/it] Loading 0: 76%|███████▌ | 275/363 [03:36<03:19, 2.27s/it] Loading 0: 78%|███████▊ | 283/363 [03:38<01:08, 1.18it/s] Loading 0: 78%|███████▊ | 283/363 [03:38<01:08, 1.18it/s] Loading 0: 79%|███████▉ | 286/363 [03:40<01:03, 1.22it/s] Loading 0: 79%|███████▉ | 286/363 [03:40<01:03, 1.22it/s] Loading 0: 79%|███████▉ | 287/363 [03:42<01:11, 1.06it/s] Loading 0: 79%|███████▉ | 287/363 [03:42<01:11, 1.06it/s] Loading 0: 79%|███████▉ | 288/363 [03:44<01:21, 1.09s/it] Loading 0: 79%|███████▉ | 288/363 [03:44<01:21, 1.09s/it] Loading 0: 80%|████████ | 291/363 [03:46<01:06, 1.08it/s] Loading 0: 80%|████████ | 291/363 [03:46<01:06, 1.08it/s] Loading 0: 80%|████████ | 292/363 [03:47<01:16, 1.07s/it] Loading 0: 80%|████████ | 292/363 [03:47<01:16, 1.07s/it] Loading 0: 81%|████████ | 293/363 [03:50<01:26, 1.24s/it] Loading 0: 81%|████████ | 293/363 [03:50<01:26, 1.24s/it] Loading 0: 83%|████████▎ | 301/363 [03:51<00:31, 1.98it/s] Loading 0: 83%|████████▎ | 301/363 [03:51<00:31, 1.98it/s] Loading 0: 84%|████████▎ | 304/363 [03:53<00:33, 1.76it/s] Loading 0: 84%|████████▎ | 304/363 [03:53<00:33, 1.76it/s] Loading 0: 84%|████████▍ | 305/363 [03:55<00:41, 1.40it/s] Loading 0: 84%|████████▍ | 305/363 [03:55<00:41, 1.40it/s] Loading 0: 84%|████████▍ | 306/363 [03:57<00:50, 1.12it/s] Loading 0: 84%|████████▍ | 306/363 [03:57<00:50, 1.12it/s] Loading 0: 85%|████████▌ | 309/363 [03:59<00:43, 1.25it/s] Loading 0: 85%|████████▌ | 309/363 [03:59<00:43, 1.25it/s] Loading 0: 85%|████████▌ | 310/363 [04:01<00:51, 1.03it/s] Loading 0: 85%|████████▌ | 310/363 [04:01<00:51, 1.03it/s] Loading 0: 86%|████████▌ | 311/363 [04:03<00:59, 1.15s/it] Loading 0: 86%|████████▌ | 311/363 [04:03<00:59, 1.15s/it] Loading 0: 88%|████████▊ | 319/363 [04:04<00:21, 2.08it/s] Loading 0: 88%|████████▊ | 319/363 [04:04<00:21, 2.08it/s] Loading 0: 89%|████████▊ | 322/363 [04:06<00:22, 1.83it/s] Loading 0: 89%|████████▊ | 322/363 [04:06<00:22, 1.83it/s] Loading 0: 89%|████████▉ | 323/363 [04:08<00:28, 1.43it/s] Loading 0: 89%|████████▉ | 323/363 [04:08<00:28, 1.43it/s] Loading 0: 89%|████████▉ | 324/363 [04:10<00:34, 1.13it/s] Loading 0: 89%|████████▉ | 324/363 [04:10<00:34, 1.13it/s] Loading 0: 90%|█████████ | 327/363 [04:12<00:28, 1.26it/s] Loading 0: 90%|█████████ | 327/363 [04:12<00:28, 1.26it/s] Loading 0: 90%|█████████ | 328/363 [04:14<00:33, 1.04it/s] Loading 0: 90%|█████████ | 328/363 [04:14<00:33, 1.04it/s] Loading 0: 91%|█████████ | 329/363 [04:16<00:39, 1.15s/it] Loading 0: 91%|█████████ | 329/363 [04:16<00:39, 1.15s/it] Loading 0: 93%|█████████▎| 337/363 [04:17<00:12, 2.09it/s] Loading 0: 93%|█████████▎| 337/363 [04:17<00:12, 2.09it/s] Loading 0: 94%|█████████▎| 340/363 [04:19<00:12, 1.83it/s] Loading 0: 94%|█████████▎| 340/363 [04:19<00:12, 1.83it/s] Loading 0: 94%|█████████▍| 341/363 [04:21<00:15, 1.43it/s] Loading 0: 94%|█████████▍| 341/363 [04:21<00:15, 1.43it/s] Loading 0: 94%|█████████▍| 342/363 [04:23<00:18, 1.13it/s] Loading 0: 94%|█████████▍| 342/363 [04:23<00:18, 1.13it/s] Loading 0: 95%|█████████▌| 345/363 [04:25<00:14, 1.26it/s] Loading 0: 95%|█████████▌| 345/363 [04:25<00:14, 1.26it/s] Loading 0: 95%|█████████▌| 346/363 [04:27<00:16, 1.04it/s] Loading 0: 95%|█████████▌| 346/363 [04:27<00:16, 1.04it/s] Loading 0: 96%|█████████▌| 347/363 [04:29<00:18, 1.15s/it] Loading 0: 96%|█████████▌| 347/363 [04:29<00:18, 1.15s/it] Loading 0: 98%|█████████▊| 355/363 [04:30<00:03, 2.09it/s] Loading 0: 98%|█████████▊| 355/363 [04:30<00:03, 2.09it/s] Loading 0: 99%|█████████▉| 359/363 [04:33<00:02, 1.88it/s] Loading 0: 99%|█████████▉| 359/363 [04:33<00:02, 1.88it/s] Loading 0: 99%|█████████▉| 360/363 [04:35<00:02, 1.48it/s] Loading 0: 99%|█████████▉| 360/363 [04:35<00:02, 1.48it/s] Loading 0: 99%|█████████▉| 361/363 [04:37<00:01, 1.18it/s] Loading 0: 99%|█████████▉| 361/363 [04:37<00:01, 1.18it/s] Loading 0: 100%|██████████| 363/363 [04:37<00:00, 1.18it/s] Loading 0: 100%|██████████| 363/363 [04:37<00:00, 1.31it/s]
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: The tokenizer you are loading from '/tmp/tmp59blmu93' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: quantized model in 283.769s
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: Processed model ChaiML/98p_2ff_chaiml_mistral_24b_2048_83002_v1_cp468_merged in 385.284s
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-98p-2ff-chaiml-m-64828-v1/nvidia
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-98p-2ff-chaiml-m-64828-v1/nvidia/tokenizer.json
chaiml-mistral-24b-2048-83002-v3-mkmlizer: Loading 0: 0%| | 0.00/363 [00:00<?, ?it/s] Loading 0: 1%| | 3.00/363 [00:02<04:24, 1.36it/s] Loading 0: 1%| | 3.00/363 [00:02<04:24, 1.36it/s] Loading 0: 1%| | 4.00/363 [00:04<07:09, 1.20s/it] Loading 0: 1%| | 4.00/363 [00:04<07:09, 1.20s/it] Loading 0: 1%|▏ | 5.00/363 [00:06<09:13, 1.54s/it] Loading 0: 1%|▏ | 5.00/363 [00:06<09:13, 1.54s/it] Loading 0: 3%|▎ | 11.0/363 [00:07<03:13, 1.82it/s] Loading 0: 3%|▎ | 11.0/363 [00:07<03:13, 1.82it/s] Loading 0: 4%|▎ | 13.0/363 [00:10<04:10, 1.40it/s] Loading 0: 4%|▎ | 13.0/363 [00:10<04:10, 1.40it/s] Loading 0: 4%|▍ | 14.0/363 [00:12<05:28, 1.06it/s] Loading 0: 4%|▍ | 14.0/363 [00:12<05:28, 1.06it/s] Loading 0: 4%|▍ | 15.0/363 [00:14<06:51, 1.18s/it] Loading 0: 4%|▍ | 15.0/363 [00:14<06:51, 1.18s/it] Loading 0: 6%|▌ | 21.0/363 [00:17<04:11, 1.36it/s] Loading 0: 6%|▌ | 21.0/363 [00:17<04:11, 1.36it/s] Loading 0: 6%|▌ | 22.0/363 [00:19<05:12, 1.09it/s] Loading 0: 6%|▌ | 22.0/363 [00:19<05:12, 1.09it/s] Loading 0: 6%|▋ | 23.0/363 [00:21<06:22, 1.13s/it] Loading 0: 6%|▋ | 23.0/363 [00:21<06:22, 1.13s/it] Loading 0: 8%|▊ | 30.0/363 [00:22<03:00, 1.85it/s] Loading 0: 8%|▊ | 30.0/363 [00:22<03:00, 1.85it/s] Loading 0: 9%|▉ | 34.0/363 [00:25<03:14, 1.69it/s] Loading 0: 9%|▉ | 34.0/363 [00:25<03:14, 1.69it/s] Loading 0: 10%|▉ | 35.0/363 [00:27<04:08, 1.32it/s] Loading 0: 10%|▉ | 35.0/363 [00:27<04:08, 1.32it/s] Loading 0: 10%|▉ | 36.0/363 [00:30<05:14, 1.04it/s] Loading 0: 10%|▉ | 36.0/363 [00:30<05:14, 1.04it/s] Loading 0: 11%|█ | 39.0/363 [00:32<04:43, 1.14it/s] Loading 0: 11%|█ | 39.0/363 [00:32<04:43, 1.14it/s] Loading 0: 11%|█ | 40.0/363 [00:34<05:43, 1.06s/it] Loading 0: 11%|█ | 40.0/363 [00:34<05:43, 1.06s/it] Loading 0: 11%|█▏ | 41.0/363 [00:36<06:51, 1.28s/it] Loading 0: 11%|█▏ | 41.0/363 [00:36<06:51, 1.28s/it] Loading 0: 13%|█▎ | 48.0/363 [00:37<02:58, 1.77it/s] Loading 0: 13%|█▎ | 48.0/363 [00:37<02:58, 1.77it/s] Loading 0: 14%|█▍ | 52.0/363 [00:40<03:10, 1.64it/s] Loading 0: 14%|█▍ | 52.0/363 [00:40<03:10, 1.64it/s] Loading 0: 15%|█▍ | 53.0/363 [00:42<04:01, 1.28it/s] Loading 0: 15%|█▍ | 53.0/363 [00:42<04:01, 1.28it/s] Loading 0: 15%|█▍ | 54.0/363 [00:45<05:04, 1.01it/s] Loading 0: 15%|█▍ | 54.0/363 [00:45<05:04, 1.01it/s] Loading 0: 16%|█▌ | 57.0/363 [00:47<04:33, 1.12it/s] Loading 0: 16%|█▌ | 57.0/363 [00:47<04:33, 1.12it/s] Loading 0: 16%|█▌ | 58.0/363 [00:49<05:30, 1.08s/it] Loading 0: 16%|█▌ | 58.0/363 [00:49<05:30, 1.08s/it] Loading 0: 16%|█▋ | 59.0/363 [00:51<06:33, 1.30s/it] Loading 0: 16%|█▋ | 59.0/363 [00:51<06:33, 1.30s/it] Loading 0: 18%|█▊ | 66.0/363 [00:52<02:50, 1.74it/s] Loading 0: 18%|█▊ | 66.0/363 [00:52<02:50, 1.74it/s] Loading 0: 19%|█▉ | 70.0/363 [00:55<03:00, 1.62it/s] Loading 0: 19%|█▉ | 70.0/363 [00:55<03:00, 1.62it/s] Loading 0: 20%|█▉ | 71.0/363 [00:57<03:49, 1.28it/s] Loading 0: 20%|█▉ | 71.0/363 [00:57<03:49, 1.28it/s] Loading 0: 20%|█▉ | 72.0/363 [01:00<04:47, 1.01it/s] Loading 0: 20%|█▉ | 72.0/363 [01:00<04:47, 1.01it/s] Loading 0: 21%|██ | 75.0/363 [01:02<04:17, 1.12it/s] Loading 0: 21%|██ | 75.0/363 [01:02<04:17, 1.12it/s] Loading 0: 21%|██ | 76.0/363 [01:04<05:10, 1.08s/it] Loading 0: 21%|██ | 76.0/363 [01:04<05:10, 1.08s/it] Loading 0: 21%|██ | 77.0/363 [01:06<06:09, 1.29s/it] Loading 0: 21%|██ | 77.0/363 [01:06<06:09, 1.29s/it] Loading 0: 23%|██▎ | 84.0/363 [01:07<02:39, 1.75it/s] Loading 0: 23%|██▎ | 84.0/363 [01:07<02:39, 1.75it/s] Loading 0: 24%|██▍ | 88.0/363 [01:10<02:48, 1.63it/s] Loading 0: 24%|██▍ | 88.0/363 [01:10<02:48, 1.63it/s] Loading 0: 25%|██▍ | 89.0/363 [01:12<03:34, 1.28it/s] Loading 0: 25%|██▍ | 89.0/363 [01:12<03:34, 1.28it/s] Loading 0: 25%|██▍ | 90.0/363 [01:15<04:29, 1.01it/s] Loading 0: 25%|██▍ | 90.0/363 [01:15<04:29, 1.01it/s] Loading 0: 27%|██▋ | 97.0/363 [01:16<02:16, 1.95it/s] Loading 0: 27%|██▋ | 97.0/363 [01:16<02:16, 1.95it/s] Loading 0: 28%|██▊ | 101/363 [01:18<02:26, 1.79it/s] Loading 0: 28%|██▊ | 101/363 [01:18<02:26, 1.79it/s] Loading 0: 28%|██▊ | 102/363 [01:21<03:08, 1.38it/s] Loading 0: 28%|██▊ | 102/363 [01:21<03:08, 1.38it/s] Loading 0: 28%|██▊ | 103/363 [01:23<03:59, 1.08it/s] Loading 0: 28%|██▊ | 103/363 [01:23<03:59, 1.08it/s] Loading 0: 29%|██▉ | 106/363 [01:25<03:44, 1.14it/s] Loading 0: 29%|██▉ | 106/363 [01:25<03:44, 1.14it/s] Loading 0: 29%|██▉ | 107/363 [01:27<04:31, 1.06s/it] Loading 0: 29%|██▉ | 107/363 [01:27<04:31, 1.06s/it] Loading 0: 30%|██▉ | 108/363 [01:30<05:24, 1.27s/it] Loading 0: 30%|██▉ | 108/363 [01:30<05:24, 1.27s/it] Loading 0: 31%|███ | 111/363 [01:32<04:22, 1.04s/it] Loading 0: 31%|███ | 111/363 [01:32<04:22, 1.04s/it] Loading 0: 31%|███ | 112/363 [01:34<05:08, 1.23s/it] Loading 0: 31%|███ | 112/363 [01:34<05:08, 1.23s/it] Loading 0: 31%|███ | 113/363 [01:36<05:59, 1.44s/it] Loading 0: 31%|███ | 113/363 [01:36<05:59, 1.44s/it] Loading 0: 33%|███▎ | 120/363 [01:37<02:24, 1.68it/s] Loading 0: 33%|███▎ | 120/363 [01:37<02:24, 1.68it/s] Loading 0: 34%|███▍ | 124/363 [01:40<02:30, 1.58it/s] Loading 0: 34%|███▍ | 124/363 [01:40<02:30, 1.58it/s] Loading 0: 34%|███▍ | 125/363 [01:42<03:10, 1.25it/s] Loading 0: 34%|███▍ | 125/363 [01:42<03:10, 1.25it/s] Loading 0: 35%|███▍ | 126/363 [01:45<03:59, 1.01s/it] Loading 0: 35%|███▍ | 126/363 [01:45<03:59, 1.01s/it] Loading 0: 36%|███▌ | 129/363 [01:47<03:31, 1.10it/s] Loading 0: 36%|███▌ | 129/363 [01:47<03:31, 1.10it/s] Loading 0: 36%|███▌ | 130/363 [01:49<04:14, 1.09s/it] Loading 0: 36%|███▌ | 130/363 [01:49<04:14, 1.09s/it] Loading 0: 36%|███▌ | 131/363 [01:51<05:03, 1.31s/it] Loading 0: 36%|███▌ | 131/363 [01:51<05:03, 1.31s/it] Loading 0: 38%|███▊ | 138/363 [01:52<02:09, 1.73it/s] Loading 0: 38%|███▊ | 138/363 [01:52<02:09, 1.73it/s] Loading 0: 39%|███▉ | 142/363 [01:55<02:17, 1.61it/s] Loading 0: 39%|███▉ | 142/363 [01:55<02:17, 1.61it/s] Loading 0: 39%|███▉ | 143/363 [01:57<02:53, 1.27it/s] Loading 0: 39%|███▉ | 143/363 [01:57<02:53, 1.27it/s] Loading 0: 40%|███▉ | 144/363 [02:00<03:37, 1.01it/s] Loading 0: 40%|███▉ | 144/363 [02:00<03:37, 1.01it/s] Loading 0: 40%|████ | 147/363 [02:02<03:13, 1.12it/s] Loading 0: 40%|████ | 147/363 [02:02<03:13, 1.12it/s] Loading 0: 41%|████ | 148/363 [02:04<03:53, 1.08s/it] Loading 0: 41%|████ | 148/363 [02:04<03:53, 1.08s/it] Loading 0: 41%|████ | 149/363 [02:06<04:38, 1.30s/it] Loading 0: 41%|████ | 149/363 [02:06<04:38, 1.30s/it] Loading 0: 43%|████▎ | 156/363 [02:08<01:59, 1.74it/s] Loading 0: 43%|████▎ | 156/363 [02:08<01:59, 1.74it/s] Loading 0: 44%|████▍ | 160/363 [02:10<02:05, 1.62it/s] Loading 0: 44%|████▍ | 160/363 [02:10<02:05, 1.62it/s] Loading 0: 44%|████▍ | 161/363 [02:13<02:38, 1.27it/s] Loading 0: 44%|████▍ | 161/363 [02:13<02:38, 1.27it/s] Loading 0: 45%|████▍ | 162/363 [02:15<03:18, 1.01it/s] Loading 0: 45%|████▍ | 162/363 [02:15<03:18, 1.01it/s] Loading 0: 45%|████▌ | 165/363 [02:17<02:56, 1.12it/s] Loading 0: 45%|████▌ | 165/363 [02:17<02:56, 1.12it/s] Loading 0: 46%|████▌ | 166/363 [02:19<03:33, 1.08s/it] Loading 0: 46%|████▌ | 166/363 [02:19<03:33, 1.08s/it] Loading 0: 46%|████▌ | 167/363 [02:21<04:13, 1.29s/it] Loading 0: 46%|████▌ | 167/363 [02:21<04:13, 1.29s/it] Loading 0: 48%|████▊ | 174/363 [02:23<01:47, 1.75it/s] Loading 0: 48%|████▊ | 174/363 [02:23<01:47, 1.75it/s] Loading 0: 49%|████▉ | 178/363 [02:25<01:53, 1.63it/s] Loading 0: 49%|████▉ | 178/363 [02:25<01:53, 1.63it/s] Loading 0: 49%|████▉ | 179/363 [02:28<02:24, 1.28it/s] Loading 0: 49%|████▉ | 179/363 [02:28<02:24, 1.28it/s] Loading 0: 50%|████▉ | 180/363 [02:30<03:00, 1.01it/s] Loading 0: 50%|████▉ | 180/363 [02:30<03:00, 1.01it/s] Loading 0: 50%|█████ | 183/363 [02:32<02:40, 1.12it/s] Loading 0: 50%|█████ | 183/363 [02:32<02:40, 1.12it/s] Loading 0: 51%|█████ | 184/363 [02:34<03:13, 1.08s/it] Loading 0: 51%|█████ | 184/363 [02:34<03:13, 1.08s/it] Loading 0: 51%|█████ | 185/363 [02:36<03:49, 1.29s/it] Loading 0: 51%|█████ | 185/363 [02:36<03:49, 1.29s/it] Loading 0: 53%|█████▎ | 192/363 [02:38<01:37, 1.75it/s] Loading 0: 53%|█████▎ | 192/363 [02:38<01:37, 1.75it/s] Loading 0: 54%|█████▍ | 196/363 [02:40<01:42, 1.64it/s] Loading 0: 54%|█████▍ | 196/363 [02:40<01:42, 1.64it/s] Loading 0: 54%|█████▍ | 197/363 [02:42<02:09, 1.28it/s] Loading 0: 54%|█████▍ | 197/363 [02:42<02:09, 1.28it/s] Loading 0: 55%|█████▍ | 198/363 [02:45<02:42, 1.02it/s] Loading 0: 55%|█████▍ | 198/363 [02:45<02:42, 1.02it/s] Loading 0: 55%|█████▌ | 201/363 [02:47<02:24, 1.12it/s] Loading 0: 55%|█████▌ | 201/363 [02:47<02:24, 1.12it/s] Loading 0: 56%|█████▌ | 202/363 [02:49<02:54, 1.08s/it] Loading 0: 56%|█████▌ | 202/363 [02:49<02:54, 1.08s/it] Loading 0: 56%|█████▌ | 203/363 [02:51<03:27, 1.30s/it] Loading 0: 56%|█████▌ | 203/363 [02:51<03:27, 1.30s/it] Loading 0: 58%|█████▊ | 210/363 [02:53<01:27, 1.74it/s] Loading 0: 58%|█████▊ | 210/363 [02:53<01:27, 1.74it/s] Loading 0: 59%|█████▉ | 214/363 [02:55<01:31, 1.63it/s] Loading 0: 59%|█████▉ | 214/363 [02:55<01:31, 1.63it/s] Loading 0: 59%|█████▉ | 215/363 [02:57<01:55, 1.28it/s] Loading 0: 59%|█████▉ | 215/363 [02:57<01:55, 1.28it/s] Loading 0: 60%|█████▉ | 216/363 [03:00<02:24, 1.02it/s] Loading 0: 60%|█████▉ | 216/363 [03:00<02:24, 1.02it/s] Loading 0: 60%|██████ | 219/363 [03:02<02:08, 1.12it/s] Loading 0: 60%|██████ | 219/363 [03:02<02:08, 1.12it/s] Loading 0: 61%|██████ | 220/363 [03:04<02:34, 1.08s/it] Loading 0: 61%|██████ | 220/363 [03:04<02:34, 1.08s/it] Loading 0: 61%|██████ | 221/363 [03:06<03:03, 1.29s/it] Loading 0: 61%|██████ | 221/363 [03:06<03:03, 1.29s/it] Loading 0: 63%|██████▎ | 228/363 [03:08<01:16, 1.76it/s] Loading 0: 63%|██████▎ | 228/363 [03:08<01:16, 1.76it/s] Loading 0: 64%|██████▍ | 232/363 [03:10<01:20, 1.63it/s] Loading 0: 64%|██████▍ | 232/363 [03:10<01:20, 1.63it/s] Loading 0: 64%|██████▍ | 233/363 [03:12<01:41, 1.28it/s] Loading 0: 64%|██████▍ | 233/363 [03:12<01:41, 1.28it/s] Loading 0: 64%|██████▍ | 234/363 [03:15<02:06, 1.02it/s] Loading 0: 64%|██████▍ | 234/363 [03:15<02:06, 1.02it/s] Loading 0: 65%|██████▌ | 237/363 [03:17<01:52, 1.12it/s] Loading 0: 65%|██████▌ | 237/363 [03:17<01:52, 1.12it/s] Loading 0: 66%|██████▌ | 238/363 [03:19<02:14, 1.08s/it] Loading 0: 66%|██████▌ | 238/363 [03:19<02:14, 1.08s/it] Loading 0: 66%|██████▌ | 239/363 [03:21<02:40, 1.29s/it] Loading 0: 66%|██████▌ | 239/363 [03:21<02:40, 1.29s/it] Loading 0: 68%|██████▊ | 246/363 [03:23<01:07, 1.75it/s] Loading 0: 68%|██████▊ | 246/363 [03:23<01:07, 1.75it/s] Loading 0: 69%|██████▉ | 250/363 [03:25<01:09, 1.62it/s] Loading 0: 69%|██████▉ | 250/363 [03:25<01:09, 1.62it/s] Loading 0: 69%|██████▉ | 251/363 [03:27<01:27, 1.28it/s] Loading 0: 69%|██████▉ | 251/363 [03:27<01:27, 1.28it/s] Loading 0: 69%|██████▉ | 252/363 [03:30<01:49, 1.02it/s] Loading 0: 69%|██████▉ | 252/363 [03:30<01:49, 1.02it/s] Loading 0: 70%|███████ | 255/363 [03:32<01:36, 1.12it/s] Loading 0: 70%|███████ | 255/363 [03:32<01:36, 1.12it/s] Loading 0: 71%|███████ | 256/363 [03:34<01:55, 1.08s/it] Loading 0: 71%|███████ | 256/363 [03:34<01:55, 1.08s/it] Loading 0: 71%|███████ | 257/363 [03:36<02:17, 1.29s/it] Loading 0: 71%|███████ | 257/363 [03:36<02:17, 1.29s/it] Loading 0: 73%|███████▎ | 264/363 [03:38<00:56, 1.74it/s] Loading 0: 73%|███████▎ | 264/363 [03:38<00:56, 1.74it/s] Loading 0: 74%|███████▍ | 268/363 [03:40<00:58, 1.62it/s] Loading 0: 74%|███████▍ | 268/363 [03:40<00:58, 1.62it/s] Loading 0: 74%|███████▍ | 269/363 [03:42<01:13, 1.28it/s] Loading 0: 74%|███████▍ | 269/363 [03:42<01:13, 1.28it/s] Loading 0: 74%|███████▍ | 270/363 [03:45<01:31, 1.02it/s] Loading 0: 74%|███████▍ | 270/363 [03:45<01:31, 1.02it/s] Loading 0: 75%|███████▌ | 273/363 [04:01<03:53, 2.59s/it] Loading 0: 75%|███████▌ | 273/363 [04:01<03:53, 2.59s/it] Loading 0: 75%|███████▌ | 274/363 [04:03<03:44, 2.53s/it] Loading 0: 75%|███████▌ | 274/363 [04:03<03:44, 2.53s/it] Loading 0: 76%|███████▌ | 275/363 [04:05<03:39, 2.49s/it] Loading 0: 76%|███████▌ | 275/363 [04:05<03:39, 2.49s/it] Loading 0: 78%|███████▊ | 282/363 [04:06<01:21, 1.00s/it] Loading 0: 78%|███████▊ | 282/363 [04:06<01:21, 1.00s/it] Loading 0: 79%|███████▉ | 286/363 [04:09<01:09, 1.12it/s] Loading 0: 79%|███████▉ | 286/363 [04:09<01:09, 1.12it/s] Loading 0: 79%|███████▉ | 287/363 [04:11<01:18, 1.03s/it] Loading 0: 79%|███████▉ | 287/363 [04:11<01:18, 1.03s/it] Loading 0: 79%|███████▉ | 288/363 [04:14<01:30, 1.20s/it] Loading 0: 79%|███████▉ | 288/363 [04:14<01:30, 1.20s/it] Loading 0: 80%|████████ | 291/363 [04:16<01:14, 1.03s/it] Loading 0: 80%|████████ | 291/363 [04:16<01:14, 1.03s/it] Loading 0: 80%|████████ | 292/363 [04:18<01:25, 1.20s/it] Loading 0: 80%|████████ | 292/363 [04:18<01:25, 1.20s/it] Loading 0: 81%|████████ | 293/363 [04:20<01:37, 1.39s/it] Loading 0: 81%|████████ | 293/363 [04:20<01:37, 1.39s/it] Loading 0: 83%|████████▎ | 300/363 [04:21<00:37, 1.66it/s] Loading 0: 83%|████████▎ | 300/363 [04:21<00:37, 1.66it/s] Loading 0: 84%|████████▎ | 304/363 [04:24<00:37, 1.58it/s] Loading 0: 84%|████████▎ | 304/363 [04:24<00:37, 1.58it/s] Loading 0: 84%|████████▍ | 305/363 [04:26<00:46, 1.25it/s] Loading 0: 84%|████████▍ | 305/363 [04:26<00:46, 1.25it/s] Loading 0: 84%|████████▍ | 306/363 [04:29<00:56, 1.00it/s] Loading 0: 84%|████████▍ | 306/363 [04:29<00:56, 1.00it/s] Loading 0: 85%|████████▌ | 309/363 [04:31<00:48, 1.11it/s] Loading 0: 85%|████████▌ | 309/363 [04:31<00:48, 1.11it/s] Loading 0: 85%|████████▌ | 310/363 [04:33<00:57, 1.09s/it] Loading 0: 85%|████████▌ | 310/363 [04:33<00:57, 1.09s/it] Loading 0: 86%|████████▌ | 311/363 [04:35<01:07, 1.30s/it] Loading 0: 86%|████████▌ | 311/363 [04:35<01:07, 1.30s/it] Loading 0: 88%|████████▊ | 318/363 [04:36<00:25, 1.77it/s] Loading 0: 88%|████████▊ | 318/363 [04:36<00:25, 1.77it/s] Loading 0: 89%|████████▊ | 322/363 [04:39<00:24, 1.65it/s] Loading 0: 89%|████████▊ | 322/363 [04:39<00:24, 1.65it/s] Loading 0: 89%|████████▉ | 323/363 [04:41<00:31, 1.29it/s] Loading 0: 89%|████████▉ | 323/363 [04:41<00:31, 1.29it/s] Loading 0: 89%|████████▉ | 324/363 [04:43<00:38, 1.02it/s] Loading 0: 89%|████████▉ | 324/363 [04:43<00:38, 1.02it/s] Loading 0: 90%|█████████ | 327/363 [04:46<00:32, 1.12it/s] Loading 0: 90%|█████████ | 327/363 [04:46<00:32, 1.12it/s] Loading 0: 90%|█████████ | 328/363 [04:48<00:37, 1.08s/it] Loading 0: 90%|█████████ | 328/363 [04:48<00:37, 1.08s/it] Loading 0: 91%|█████████ | 329/363 [04:50<00:43, 1.29s/it] Loading 0: 91%|█████████ | 329/363 [04:50<00:43, 1.29s/it] Loading 0: 93%|█████████▎| 336/363 [04:51<00:15, 1.77it/s] Loading 0: 93%|█████████▎| 336/363 [04:51<00:15, 1.77it/s] Loading 0: 94%|█████████▎| 340/363 [04:54<00:13, 1.65it/s] Loading 0: 94%|█████████▎| 340/363 [04:54<00:13, 1.65it/s] Loading 0: 94%|█████████▍| 341/363 [04:56<00:17, 1.29it/s] Loading 0: 94%|█████████▍| 341/363 [04:56<00:17, 1.29it/s] Loading 0: 94%|█████████▍| 342/363 [04:58<00:20, 1.02it/s] Loading 0: 94%|█████████▍| 342/363 [04:58<00:20, 1.02it/s] Loading 0: 95%|█████████▌| 345/363 [05:00<00:15, 1.13it/s] Loading 0: 95%|█████████▌| 345/363 [05:00<00:15, 1.13it/s] Loading 0: 95%|█████████▌| 346/363 [05:03<00:18, 1.08s/it] Loading 0: 95%|█████████▌| 346/363 [05:03<00:18, 1.08s/it] Loading 0: 96%|█████████▌| 347/363 [05:05<00:20, 1.29s/it] Loading 0: 96%|█████████▌| 347/363 [05:05<00:20, 1.29s/it] Loading 0: 98%|█████████▊| 354/363 [05:06<00:05, 1.77it/s] Loading 0: 98%|█████████▊| 354/363 [05:06<00:05, 1.77it/s] Loading 0: 99%|█████████▉| 359/363 [05:09<00:02, 1.69it/s] Loading 0: 99%|█████████▉| 359/363 [05:09<00:02, 1.69it/s] Loading 0: 99%|█████████▉| 360/363 [05:11<00:02, 1.33it/s] Loading 0: 99%|█████████▉| 360/363 [05:11<00:02, 1.33it/s] Loading 0: 99%|█████████▉| 361/363 [05:14<00:01, 1.06it/s] Loading 0: 99%|█████████▉| 361/363 [05:14<00:01, 1.06it/s] Loading 0: 100%|██████████| 363/363 [05:14<00:00, 1.06it/s] Loading 0: 100%|██████████| 363/363 [05:14<00:00, 1.16it/s]
chaiml-mistral-24b-2048-83002-v3-mkmlizer: The tokenizer you are loading from '/tmp/tmpvj83r09j' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-mistral-24b-2048-83002-v3-mkmlizer: quantized model in 321.369s
chaiml-mistral-24b-2048-83002-v3-mkmlizer: Processed model ChaiML/mistral_24b_2048_gemini_opus_ds_v10_843_merged in 417.821s
chaiml-mistral-24b-2048-83002-v3-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-mistral-24b-2048-83002-v3-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-mistral-24b-2048-83002-v3/nvidia
chaiml-mistral-24b-2048-83002-v3-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-83002-v3/nvidia/special_tokens_map.json
chaiml-mistral-24b-2048-83002-v3-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-83002-v3/nvidia/config.json
chaiml-mistral-24b-2048-83002-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-83002-v3/nvidia/tokenizer_config.json
chaiml-mistral-24b-2048-83002-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-83002-v3/nvidia/tokenizer.json
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-98p-2ff-chaiml-m-64828-v1/nvidia/flywheel_model.1.safetensors
chaiml-mistral-24b-2048-83002-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-mistral-24b-2048-83002-v3/nvidia/flywheel_model.1.safetensors
chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-98p-2ff-chaiml-m-32069-v1/nvidia/flywheel_model.0.safetensors
Job chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer completed after 1403.36s with status: succeeded
Stopping job with name chaiml-98p-2ff-chaiml-m-32069-v1-mkmlizer
Pipeline stage MKMLizer completed in 1405.92s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.54s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-98p-2ff-chaiml-m-32069-v1
Waiting for inference service chaiml-98p-2ff-chaiml-m-32069-v1 to be ready
Stopping job with name chaiml-mistral-24b-2048-83002-v3-mkmlizer
%s, retrying in %s seconds...
Starting job with name chaiml-mistral-24b-2048-83002-v3-mkmlizer
Waiting for job on chaiml-mistral-24b-2048-83002-v3-mkmlizer to finish
chaiml-mistral-24b-2048-83002-v3-mkmlizer: bash: cannot set terminal process group (-1): Inappropriate ioctl for device
chaiml-mistral-24b-2048-83002-v3-mkmlizer: bash: no job control in this shell
chaiml-mistral-24b-2048-83002-v3-mkmlizer: /root/miniconda3/envs/nvidia/lib/python3.11/site-packages/mk1/__init__.py:1: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
chaiml-mistral-24b-2048-83002-v3-mkmlizer: __import__('pkg_resources').declare_namespace(__name__)
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ Version: 0.30.6+torch280 ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ https://mk1.ai ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ belonging to: ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ Chai Research Corp. ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-83002-v3-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-98p-2ff-chaiml-m-64828-v1/nvidia/flywheel_model.0.safetensors
Job chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer completed after 1537.29s with status: succeeded
Stopping job with name chaiml-98p-2ff-chaiml-m-64828-v1-mkmlizer
Pipeline stage MKMLizer completed in 1541.38s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.59s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-98p-2ff-chaiml-m-64828-v1
Waiting for inference service chaiml-98p-2ff-chaiml-m-64828-v1 to be ready
chaiml-mistral-24b-2048-83002-v3-mkmlizer: Loading 0: 0%| | 0.00/363 [00:00<?, ?it/s] Loading 0: 1%| | 3.00/363 [00:01<03:52, 1.55it/s] Loading 0: 1%| | 3.00/363 [00:01<03:52, 1.55it/s] Loading 0: 1%| | 4.00/363 [00:03<06:17, 1.05s/it] Loading 0: 1%| | 4.00/363 [00:03<06:17, 1.05s/it] Loading 0: 1%|▏ | 5.00/363 [00:05<08:06, 1.36s/it] Loading 0: 1%|▏ | 5.00/363 [00:05<08:06, 1.36s/it] Loading 0: 3%|▎ | 12.0/363 [00:07<02:38, 2.21it/s] Loading 0: 3%|▎ | 12.0/363 [00:07<02:38, 2.21it/s] Loading 0: 4%|▎ | 13.0/363 [00:08<03:46, 1.54it/s] Loading 0: 4%|▎ | 13.0/363 [00:08<03:46, 1.54it/s] Loading 0: 4%|▍ | 14.0/363 [00:10<04:58, 1.17it/s] Loading 0: 4%|▍ | 14.0/363 [00:10<04:58, 1.17it/s] Loading 0: 4%|▍ | 15.0/363 [00:12<06:13, 1.07s/it] Loading 0: 4%|▍ | 15.0/363 [00:12<06:13, 1.07s/it] Loading 0: 6%|▌ | 21.0/363 [00:15<03:44, 1.53it/s] Loading 0: 6%|▌ | 21.0/363 [00:15<03:44, 1.53it/s] Loading 0: 6%|▌ | 22.0/363 [00:17<04:39, 1.22it/s] Loading 0: 6%|▌ | 22.0/363 [00:17<04:39, 1.22it/s] Loading 0: 6%|▋ | 23.0/363 [00:19<05:41, 1.00s/it] Loading 0: 6%|▋ | 23.0/363 [00:19<05:41, 1.00s/it] Loading 0: 9%|▊ | 31.0/363 [00:20<02:32, 2.18it/s] Loading 0: 9%|▊ | 31.0/363 [00:20<02:32, 2.18it/s] Loading 0: 9%|▉ | 34.0/363 [00:22<02:53, 1.90it/s] Loading 0: 9%|▉ | 34.0/363 [00:22<02:53, 1.90it/s] Loading 0: 10%|▉ | 35.0/363 [00:24<03:41, 1.48it/s] Loading 0: 10%|▉ | 35.0/363 [00:24<03:41, 1.48it/s] Loading 0: 10%|▉ | 36.0/363 [00:26<04:39, 1.17it/s] Loading 0: 10%|▉ | 36.0/363 [00:26<04:39, 1.17it/s] Loading 0: 11%|█ | 39.0/363 [00:28<04:11, 1.29it/s] Loading 0: 11%|█ | 39.0/363 [00:28<04:11, 1.29it/s] Loading 0: 11%|█ | 40.0/363 [00:30<05:04, 1.06it/s] Loading 0: 11%|█ | 40.0/363 [00:30<05:04, 1.06it/s] Loading 0: 11%|█▏ | 41.0/363 [00:32<06:05, 1.13s/it] Loading 0: 11%|█▏ | 41.0/363 [00:32<06:05, 1.13s/it] Loading 0: 13%|█▎ | 49.0/363 [00:33<02:30, 2.09it/s] Loading 0: 13%|█▎ | 49.0/363 [00:33<02:30, 2.09it/s] Loading 0: 14%|█▍ | 52.0/363 [00:35<02:49, 1.84it/s] Loading 0: 14%|█▍ | 52.0/363 [00:35<02:49, 1.84it/s] Loading 0: 15%|█▍ | 53.0/363 [00:37<03:35, 1.44it/s] Loading 0: 15%|█▍ | 53.0/363 [00:37<03:35, 1.44it/s] Loading 0: 15%|█▍ | 54.0/363 [00:39<04:31, 1.14it/s] Loading 0: 15%|█▍ | 54.0/363 [00:39<04:31, 1.14it/s] Loading 0: 16%|█▌ | 57.0/363 [00:41<04:01, 1.27it/s] Loading 0: 16%|█▌ | 57.0/363 [00:41<04:01, 1.27it/s] Loading 0: 16%|█▌ | 58.0/363 [00:43<04:51, 1.05it/s] Loading 0: 16%|█▌ | 58.0/363 [00:43<04:51, 1.05it/s] Loading 0: 16%|█▋ | 59.0/363 [00:45<05:47, 1.14s/it] Loading 0: 16%|█▋ | 59.0/363 [00:45<05:47, 1.14s/it] Loading 0: 18%|█▊ | 67.0/363 [00:46<02:21, 2.09it/s] Loading 0: 18%|█▊ | 67.0/363 [00:46<02:21, 2.09it/s] Loading 0: 19%|█▉ | 70.0/363 [00:48<02:39, 1.83it/s] Loading 0: 19%|█▉ | 70.0/363 [00:48<02:39, 1.83it/s] Loading 0: 20%|█▉ | 71.0/363 [00:50<03:23, 1.44it/s] Loading 0: 20%|█▉ | 71.0/363 [00:50<03:23, 1.44it/s] Loading 0: 20%|█▉ | 72.0/363 [00:52<04:15, 1.14it/s] Loading 0: 20%|█▉ | 72.0/363 [00:52<04:15, 1.14it/s] Loading 0: 21%|██ | 75.0/363 [00:54<03:47, 1.27it/s] Loading 0: 21%|██ | 75.0/363 [00:54<03:47, 1.27it/s] Loading 0: 21%|██ | 76.0/363 [00:56<04:34, 1.05it/s] Loading 0: 21%|██ | 76.0/363 [00:56<04:34, 1.05it/s] Loading 0: 21%|██ | 77.0/363 [00:58<05:26, 1.14s/it] Loading 0: 21%|██ | 77.0/363 [00:58<05:26, 1.14s/it] Loading 0: 23%|██▎ | 85.0/363 [00:59<02:12, 2.09it/s] Loading 0: 23%|██▎ | 85.0/363 [00:59<02:12, 2.09it/s] Loading 0: 24%|██▍ | 88.0/363 [01:02<02:29, 1.84it/s] Loading 0: 24%|██▍ | 88.0/363 [01:02<02:29, 1.84it/s] Loading 0: 25%|██▍ | 89.0/363 [01:03<03:10, 1.44it/s] Loading 0: 25%|██▍ | 89.0/363 [01:03<03:10, 1.44it/s] Loading 0: 25%|██▍ | 90.0/363 [01:05<03:59, 1.14it/s] Loading 0: 25%|██▍ | 90.0/363 [01:05<03:59, 1.14it/s] Loading 0: 27%|██▋ | 97.0/363 [01:06<02:00, 2.21it/s] Loading 0: 27%|██▋ | 97.0/363 [01:06<02:00, 2.21it/s] Loading 0: 28%|██▊ | 101/363 [01:09<02:08, 2.03it/s] Loading 0: 28%|██▊ | 101/363 [01:09<02:08, 2.03it/s] Loading 0: 28%|██▊ | 102/363 [01:11<02:46, 1.57it/s] Loading 0: 28%|██▊ | 102/363 [01:11<02:46, 1.57it/s] Loading 0: 28%|██▊ | 103/363 [01:13<03:31, 1.23it/s] Loading 0: 28%|██▊ | 103/363 [01:13<03:31, 1.23it/s] Loading 0: 29%|██▉ | 106/363 [01:15<03:17, 1.30it/s] Loading 0: 29%|██▉ | 106/363 [01:15<03:17, 1.30it/s] Loading 0: 29%|██▉ | 107/363 [01:17<03:57, 1.08it/s] Loading 0: 29%|██▉ | 107/363 [01:17<03:57, 1.08it/s] Loading 0: 30%|██▉ | 108/363 [01:19<04:45, 1.12s/it] Loading 0: 30%|██▉ | 108/363 [01:19<04:45, 1.12s/it] Loading 0: 31%|███ | 111/363 [01:21<03:50, 1.09it/s] Loading 0: 31%|███ | 111/363 [01:21<03:50, 1.09it/s] Loading 0: 31%|███ | 112/363 [01:22<04:31, 1.08s/it] Loading 0: 31%|███ | 112/363 [01:22<04:31, 1.08s/it] Loading 0: 31%|███ | 113/363 [01:24<05:14, 1.26s/it] Loading 0: 31%|███ | 113/363 [01:24<05:14, 1.26s/it] Loading 0: 33%|███▎ | 120/363 [01:25<02:07, 1.91it/s] Loading 0: 33%|███▎ | 120/363 [01:25<02:07, 1.91it/s] Loading 0: 34%|███▍ | 124/363 [01:28<02:12, 1.80it/s] Loading 0: 34%|███▍ | 124/363 [01:28<02:12, 1.80it/s] Loading 0: 34%|███▍ | 125/363 [01:30<02:47, 1.42it/s] Loading 0: 34%|███▍ | 125/363 [01:30<02:47, 1.42it/s] Loading 0: 35%|███▍ | 126/363 [01:32<03:29, 1.13it/s] Loading 0: 35%|███▍ | 126/363 [01:32<03:29, 1.13it/s] Loading 0: 36%|███▌ | 129/363 [01:34<03:06, 1.26it/s] Loading 0: 36%|███▌ | 129/363 [01:34<03:06, 1.26it/s] Loading 0: 36%|███▌ | 130/363 [01:36<03:44, 1.04it/s] Loading 0: 36%|███▌ | 130/363 [01:36<03:44, 1.04it/s] Loading 0: 36%|███▌ | 131/363 [01:38<04:26, 1.15s/it] Loading 0: 36%|███▌ | 131/363 [01:38<04:26, 1.15s/it] Loading 0: 38%|███▊ | 139/363 [01:39<01:48, 2.07it/s] Loading 0: 38%|███▊ | 139/363 [01:39<01:48, 2.07it/s] Loading 0: 39%|███▉ | 142/363 [01:41<02:01, 1.82it/s] Loading 0: 39%|███▉ | 142/363 [01:41<02:01, 1.82it/s] Loading 0: 39%|███▉ | 143/363 [01:43<02:34, 1.43it/s] Loading 0: 39%|███▉ | 143/363 [01:43<02:34, 1.43it/s] Loading 0: 40%|███▉ | 144/363 [01:45<03:12, 1.14it/s] Loading 0: 40%|███▉ | 144/363 [01:45<03:12, 1.14it/s] Loading 0: 40%|████ | 147/363 [01:47<02:51, 1.26it/s] Loading 0: 40%|████ | 147/363 [01:47<02:51, 1.26it/s] Loading 0: 41%|████ | 148/363 [01:49<03:26, 1.04it/s] Loading 0: 41%|████ | 148/363 [01:49<03:26, 1.04it/s] Loading 0: 41%|████ | 149/363 [01:51<04:05, 1.15s/it] Loading 0: 41%|████ | 149/363 [01:51<04:05, 1.15s/it] Loading 0: 43%|████▎ | 157/363 [01:52<01:38, 2.08it/s] Loading 0: 43%|████▎ | 157/363 [01:52<01:38, 2.08it/s] Loading 0: 44%|████▍ | 160/363 [01:54<01:51, 1.83it/s] Loading 0: 44%|████▍ | 160/363 [01:54<01:51, 1.83it/s] Loading 0: 44%|████▍ | 161/363 [01:56<02:21, 1.43it/s] Loading 0: 44%|████▍ | 161/363 [01:56<02:21, 1.43it/s] Loading 0: 45%|████▍ | 162/363 [01:58<02:56, 1.14it/s] Loading 0: 45%|████▍ | 162/363 [01:58<02:56, 1.14it/s] Loading 0: 45%|████▌ | 165/363 [02:00<02:36, 1.26it/s] Loading 0: 45%|████▌ | 165/363 [02:00<02:36, 1.26it/s] Loading 0: 46%|████▌ | 166/363 [02:02<03:09, 1.04it/s] Loading 0: 46%|████▌ | 166/363 [02:02<03:09, 1.04it/s] Loading 0: 46%|████▌ | 167/363 [02:04<03:45, 1.15s/it] Loading 0: 46%|████▌ | 167/363 [02:04<03:45, 1.15s/it] Loading 0: 48%|████▊ | 174/363 [02:05<01:35, 1.97it/s] Loading 0: 48%|████▊ | 174/363 [02:05<01:35, 1.97it/s] Loading 0: 49%|████▉ | 178/363 [02:08<01:40, 1.84it/s] Loading 0: 49%|████▉ | 178/363 [02:08<01:40, 1.84it/s] Loading 0: 49%|████▉ | 179/363 [02:09<02:07, 1.44it/s] Loading 0: 49%|████▉ | 179/363 [02:09<02:07, 1.44it/s] Loading 0: 50%|████▉ | 180/363 [02:11<02:39, 1.15it/s] Loading 0: 50%|████▉ | 180/363 [02:11<02:39, 1.15it/s] Loading 0: 50%|█████ | 183/363 [02:13<02:21, 1.27it/s] Loading 0: 50%|█████ | 183/363 [02:13<02:21, 1.27it/s] Loading 0: 51%|█████ | 184/363 [02:15<02:50, 1.05it/s] Loading 0: 51%|█████ | 184/363 [02:15<02:50, 1.05it/s] Loading 0: 51%|█████ | 185/363 [02:17<03:22, 1.14s/it] Loading 0: 51%|█████ | 185/363 [02:17<03:22, 1.14s/it] Loading 0: 53%|█████▎ | 193/363 [02:19<01:21, 2.08it/s] Loading 0: 53%|█████▎ | 193/363 [02:19<01:21, 2.08it/s] Loading 0: 54%|█████▍ | 196/363 [02:21<01:31, 1.83it/s] Loading 0: 54%|█████▍ | 196/363 [02:21<01:31, 1.83it/s] Loading 0: 54%|█████▍ | 197/363 [02:23<01:55, 1.43it/s] Loading 0: 54%|█████▍ | 197/363 [02:23<01:55, 1.43it/s] Loading 0: 55%|█████▍ | 198/363 [02:25<02:25, 1.14it/s] Loading 0: 55%|█████▍ | 198/363 [02:25<02:25, 1.14it/s] Loading 0: 55%|█████▌ | 201/363 [02:27<02:08, 1.26it/s] Loading 0: 55%|█████▌ | 201/363 [02:27<02:08, 1.26it/s] Loading 0: 56%|█████▌ | 202/363 [02:28<02:34, 1.04it/s] Loading 0: 56%|█████▌ | 202/363 [02:28<02:34, 1.04it/s] Loading 0: 56%|█████▌ | 203/363 [02:30<03:03, 1.14s/it] Loading 0: 56%|█████▌ | 203/363 [02:30<03:03, 1.14s/it] Loading 0: 58%|█████▊ | 211/363 [02:32<01:13, 2.07it/s] Loading 0: 58%|█████▊ | 211/363 [02:32<01:13, 2.07it/s] Loading 0: 59%|█████▉ | 214/363 [02:34<01:21, 1.82it/s] Loading 0: 59%|█████▉ | 214/363 [02:34<01:21, 1.82it/s] Loading 0: 59%|█████▉ | 215/363 [02:36<01:43, 1.43it/s] Loading 0: 59%|█████▉ | 215/363 [02:36<01:43, 1.43it/s] Loading 0: 60%|█████▉ | 216/363 [02:38<02:09, 1.14it/s] Loading 0: 60%|█████▉ | 216/363 [02:38<02:09, 1.14it/s] Loading 0: 60%|██████ | 219/363 [02:40<01:53, 1.26it/s] Loading 0: 60%|██████ | 219/363 [02:40<01:53, 1.26it/s] Loading 0: 61%|██████ | 220/363 [02:42<02:16, 1.04it/s] Loading 0: 61%|██████ | 220/363 [02:42<02:16, 1.04it/s] Loading 0: 61%|██████ | 221/363 [02:44<02:42, 1.15s/it] Loading 0: 61%|██████ | 221/363 [02:44<02:42, 1.15s/it] Loading 0: 63%|██████▎ | 229/363 [02:45<01:04, 2.09it/s] Loading 0: 63%|██████▎ | 229/363 [02:45<01:04, 2.09it/s] Loading 0: 64%|██████▍ | 232/363 [02:47<01:11, 1.83it/s] Loading 0: 64%|██████▍ | 232/363 [02:47<01:11, 1.83it/s] Loading 0: 64%|██████▍ | 233/363 [02:49<01:30, 1.43it/s] Loading 0: 64%|██████▍ | 233/363 [02:49<01:30, 1.43it/s] Loading 0: 64%|██████▍ | 234/363 [02:51<01:53, 1.14it/s] Loading 0: 64%|██████▍ | 234/363 [02:51<01:53, 1.14it/s] Loading 0: 65%|██████▌ | 237/363 [02:53<01:39, 1.27it/s] Loading 0: 65%|██████▌ | 237/363 [02:53<01:39, 1.27it/s] Loading 0: 66%|██████▌ | 238/363 [02:55<01:59, 1.05it/s] Loading 0: 66%|██████▌ | 238/363 [02:55<01:59, 1.05it/s] Loading 0: 66%|██████▌ | 239/363 [02:57<02:21, 1.14s/it] Loading 0: 66%|██████▌ | 239/363 [02:57<02:21, 1.14s/it] Loading 0: 68%|██████▊ | 247/363 [02:58<00:55, 2.09it/s] Loading 0: 68%|██████▊ | 247/363 [02:58<00:55, 2.09it/s] Loading 0: 69%|██████▉ | 250/363 [03:00<01:01, 1.83it/s] Loading 0: 69%|██████▉ | 250/363 [03:00<01:01, 1.83it/s] Loading 0: 69%|██████▉ | 251/363 [03:02<01:18, 1.43it/s] Loading 0: 69%|██████▉ | 251/363 [03:02<01:18, 1.43it/s] Loading 0: 69%|██████▉ | 252/363 [03:04<01:37, 1.14it/s] Loading 0: 69%|██████▉ | 252/363 [03:04<01:37, 1.14it/s] Loading 0: 70%|███████ | 255/363 [03:06<01:25, 1.27it/s] Loading 0: 70%|███████ | 255/363 [03:06<01:25, 1.27it/s] Loading 0: 71%|███████ | 256/363 [03:08<01:42, 1.04it/s] Loading 0: 71%|███████ | 256/363 [03:08<01:42, 1.04it/s] Loading 0: 71%|███████ | 257/363 [03:10<02:00, 1.14s/it] Loading 0: 71%|███████ | 257/363 [03:10<02:00, 1.14s/it] Loading 0: 73%|███████▎ | 265/363 [03:11<00:47, 2.08it/s] Loading 0: 73%|███████▎ | 265/363 [03:11<00:47, 2.08it/s] Loading 0: 74%|███████▍ | 268/363 [03:13<00:52, 1.83it/s] Loading 0: 74%|███████▍ | 268/363 [03:13<00:52, 1.83it/s] Loading 0: 74%|███████▍ | 269/363 [03:15<01:05, 1.43it/s] Loading 0: 74%|███████▍ | 269/363 [03:15<01:05, 1.43it/s] Loading 0: 74%|███████▍ | 270/363 [03:17<01:22, 1.13it/s] Loading 0: 74%|███████▍ | 270/363 [03:17<01:22, 1.13it/s] Loading 0: 75%|███████▌ | 273/363 [03:32<03:34, 2.38s/it] Loading 0: 75%|███████▌ | 273/363 [03:32<03:34, 2.38s/it] Loading 0: 75%|███████▌ | 274/363 [03:34<03:25, 2.31s/it] Loading 0: 75%|███████▌ | 274/363 [03:34<03:25, 2.31s/it] Loading 0: 76%|███████▌ | 275/363 [03:36<03:19, 2.27s/it] Loading 0: 76%|███████▌ | 275/363 [03:36<03:19, 2.27s/it] Loading 0: 78%|███████▊ | 283/363 [03:37<01:08, 1.17it/s] Loading 0: 78%|███████▊ | 283/363 [03:37<01:08, 1.17it/s] Loading 0: 79%|███████▉ | 286/363 [03:39<01:03, 1.22it/s] Loading 0: 79%|███████▉ | 286/363 [03:39<01:03, 1.22it/s] Loading 0: 79%|███████▉ | 287/363 [03:41<01:10, 1.08it/s] Loading 0: 79%|███████▉ | 287/363 [03:41<01:10, 1.08it/s] Loading 0: 79%|███████▉ | 288/363 [03:43<01:21, 1.08s/it] Loading 0: 79%|███████▉ | 288/363 [03:43<01:21, 1.08s/it] Loading 0: 80%|████████ | 291/363 [03:45<01:06, 1.09it/s] Loading 0: 80%|████████ | 291/363 [03:45<01:06, 1.09it/s] Loading 0: 80%|████████ | 292/363 [03:47<01:16, 1.07s/it] Loading 0: 80%|████████ | 292/363 [03:47<01:16, 1.07s/it] Loading 0: 81%|████████ | 293/363 [03:49<01:27, 1.25s/it] Loading 0: 81%|████████ | 293/363 [03:49<01:27, 1.25s/it] Loading 0: 83%|████████▎ | 301/363 [03:50<00:31, 1.97it/s] Loading 0: 83%|████████▎ | 301/363 [03:50<00:31, 1.97it/s] Loading 0: 84%|████████▎ | 304/363 [03:53<00:33, 1.78it/s] Loading 0: 84%|████████▎ | 304/363 [03:53<00:33, 1.78it/s] Loading 0: 84%|████████▍ | 305/363 [03:54<00:40, 1.43it/s] Loading 0: 84%|████████▍ | 305/363 [03:54<00:40, 1.43it/s] Loading 0: 84%|████████▍ | 306/363 [03:56<00:50, 1.13it/s] Loading 0: 84%|████████▍ | 306/363 [03:56<00:50, 1.13it/s] Loading 0: 85%|████████▌ | 309/363 [03:58<00:42, 1.26it/s] Loading 0: 85%|████████▌ | 309/363 [03:58<00:42, 1.26it/s] Loading 0: 85%|████████▌ | 310/363 [04:00<00:51, 1.04it/s] Loading 0: 85%|████████▌ | 310/363 [04:00<00:51, 1.04it/s] Loading 0: 86%|████████▌ | 311/363 [04:02<01:00, 1.16s/it] Loading 0: 86%|████████▌ | 311/363 [04:02<01:00, 1.16s/it] Loading 0: 88%|████████▊ | 319/363 [04:03<00:21, 2.09it/s] Loading 0: 88%|████████▊ | 319/363 [04:03<00:21, 2.09it/s] Loading 0: 89%|████████▊ | 322/363 [04:06<00:22, 1.81it/s] Loading 0: 89%|████████▊ | 322/363 [04:06<00:22, 1.81it/s] Loading 0: 89%|████████▉ | 323/363 [04:08<00:28, 1.41it/s] Loading 0: 89%|████████▉ | 323/363 [04:08<00:28, 1.41it/s] Loading 0: 89%|████████▉ | 324/363 [04:10<00:34, 1.12it/s] Loading 0: 89%|████████▉ | 324/363 [04:10<00:34, 1.12it/s] Loading 0: 90%|█████████ | 327/363 [04:12<00:28, 1.24it/s] Loading 0: 90%|█████████ | 327/363 [04:12<00:28, 1.24it/s] Loading 0: 90%|█████████ | 328/363 [04:14<00:34, 1.03it/s] Loading 0: 90%|█████████ | 328/363 [04:14<00:34, 1.03it/s] Loading 0: 91%|█████████ | 329/363 [04:16<00:39, 1.16s/it] Loading 0: 91%|█████████ | 329/363 [04:16<00:39, 1.16s/it] Loading 0: 93%|█████████▎| 337/363 [04:17<00:12, 2.09it/s] Loading 0: 93%|█████████▎| 337/363 [04:17<00:12, 2.09it/s] Loading 0: 94%|█████████▎| 340/363 [04:19<00:12, 1.82it/s] Loading 0: 94%|█████████▎| 340/363 [04:19<00:12, 1.82it/s] Loading 0: 94%|█████████▍| 341/363 [04:21<00:15, 1.43it/s] Loading 0: 94%|█████████▍| 341/363 [04:21<00:15, 1.43it/s] Loading 0: 94%|█████████▍| 342/363 [04:23<00:18, 1.13it/s] Loading 0: 94%|█████████▍| 342/363 [04:23<00:18, 1.13it/s] Loading 0: 95%|█████████▌| 345/363 [04:25<00:14, 1.25it/s] Loading 0: 95%|█████████▌| 345/363 [04:25<00:14, 1.25it/s] Loading 0: 95%|█████████▌| 346/363 [04:27<00:16, 1.04it/s] Loading 0: 95%|█████████▌| 346/363 [04:27<00:16, 1.04it/s] Loading 0: 96%|█████████▌| 347/363 [04:29<00:18, 1.15s/it] Loading 0: 96%|█████████▌| 347/363 [04:29<00:18, 1.15s/it] Loading 0: 98%|█████████▊| 355/363 [04:30<00:03, 2.09it/s] Loading 0: 98%|█████████▊| 355/363 [04:30<00:03, 2.09it/s] Loading 0: 99%|█████████▉| 359/363 [04:33<00:02, 1.87it/s] Loading 0: 99%|█████████▉| 359/363 [04:33<00:02, 1.87it/s] Loading 0: 99%|█████████▉| 360/363 [04:34<00:02, 1.48it/s] Loading 0: 99%|█████████▉| 360/363 [04:34<00:02, 1.48it/s] Loading 0: 99%|█████████▉| 361/363 [04:36<00:01, 1.18it/s] Loading 0: 99%|█████████▉| 361/363 [04:36<00:01, 1.18it/s] Loading 0: 100%|██████████| 363/363 [04:37<00:00, 1.18it/s] Loading 0: 100%|██████████| 363/363 [04:37<00:00, 1.31it/s]
chaiml-mistral-24b-2048-83002-v3-mkmlizer: The tokenizer you are loading from '/tmp/tmppql0bdag' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-mistral-24b-2048-83002-v3-mkmlizer: quantized model in 283.599s
chaiml-mistral-24b-2048-83002-v3-mkmlizer: Processed model ChaiML/mistral_24b_2048_gemini_opus_ds_v10_843_merged in 372.766s
chaiml-mistral-24b-2048-83002-v3-mkmlizer: creating bucket guanaco-mkml-models
chaiml-mistral-24b-2048-83002-v3-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-mistral-24b-2048-83002-v3-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-mistral-24b-2048-83002-v3/nvidia
chaiml-mistral-24b-2048-83002-v3-mkmlizer: DEBUG "sync /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-83002-v3/nvidia/config.json": object size matches
chaiml-mistral-24b-2048-83002-v3-mkmlizer: DEBUG "sync /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-mistral-24b-2048-83002-v3/nvidia/flywheel_model.0.safetensors": object size matches
chaiml-mistral-24b-2048-83002-v3-mkmlizer: DEBUG "sync /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-mistral-24b-2048-83002-v3/nvidia/flywheel_model.1.safetensors": object size matches
chaiml-mistral-24b-2048-83002-v3-mkmlizer: DEBUG "sync /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-83002-v3/nvidia/special_tokens_map.json": object size matches
chaiml-mistral-24b-2048-83002-v3-mkmlizer: DEBUG "sync /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-83002-v3/nvidia/tokenizer.json": object size matches
chaiml-mistral-24b-2048-83002-v3-mkmlizer: DEBUG "sync /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-83002-v3/nvidia/tokenizer_config.json": object size matches
Job chaiml-mistral-24b-2048-83002-v3-mkmlizer completed after 444.96s with status: succeeded
Stopping job with name chaiml-mistral-24b-2048-83002-v3-mkmlizer
Pipeline stage MKMLizer completed in 1853.11s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.61s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-mistral-24b-2048-83002-v3
Waiting for inference service chaiml-mistral-24b-2048-83002-v3 to be ready
Inference service chaiml-98p-2ff-chaiml-m-32069-v1 ready after 608.0484683513641s
Pipeline stage MKMLDeployer completed in 609.88s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.336925029754639s
Received healthy response to inference request in 15.649279832839966s
Received healthy response to inference request in 3.3211867809295654s
Received healthy response to inference request in 3.3738884925842285s
Received healthy response to inference request in 2.952230215072632s
5 requests
0 failed requests
5th percentile: 3.0260215282440184
10th percentile: 3.0998128414154054
20th percentile: 3.247395467758179
30th percentile: 3.331727123260498
40th percentile: 3.352807807922363
50th percentile: 3.3738884925842285
60th percentile: 3.7591031074523924
70th percentile: 4.144317722320556
80th percentile: 6.599395990371706
90th percentile: 11.124337911605835
95th percentile: 13.386808872222899
99th percentile: 15.196785640716552
mean time: 5.926702070236206
%s, retrying in %s seconds...
Received healthy response to inference request in 2.721824884414673s
Received healthy response to inference request in 2.748514175415039s
Received healthy response to inference request in 3.2435078620910645s
Received healthy response to inference request in 3.819085121154785s
Received healthy response to inference request in 3.1782419681549072s
5 requests
0 failed requests
5th percentile: 2.727162742614746
10th percentile: 2.732500600814819
20th percentile: 2.743176317214966
30th percentile: 2.834459733963013
40th percentile: 3.00635085105896
50th percentile: 3.1782419681549072
60th percentile: 3.20434832572937
70th percentile: 3.230454683303833
80th percentile: 3.358623313903809
90th percentile: 3.588854217529297
95th percentile: 3.7039696693420407
99th percentile: 3.7960620307922364
mean time: 3.1422348022460938
Pipeline stage StressChecker completed in 65.96s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.65s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 3.13s
Shutdown handler de-registered
chaiml-98p-2ff-chaiml-m_32069_v1 status is now deployed due to DeploymentManager action
Inference service chaiml-98p-2ff-chaiml-m-64828-v1 ready after 577.7483968734741s
Pipeline stage MKMLDeployer completed in 579.65s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.468112230300903s
Received healthy response to inference request in 3.2519283294677734s
Received healthy response to inference request in 3.3789279460906982s
Received healthy response to inference request in 3.3054733276367188s
Received healthy response to inference request in 3.4178307056427s
5 requests
0 failed requests
5th percentile: 3.2626373291015627
10th percentile: 3.2733463287353515
20th percentile: 3.2947643280029295
30th percentile: 3.3201642513275145
40th percentile: 3.3495460987091064
50th percentile: 3.3789279460906982
60th percentile: 3.394489049911499
70th percentile: 3.4100501537323
80th percentile: 3.627887010574341
90th percentile: 4.047999620437622
95th percentile: 4.258055925369263
99th percentile: 4.426100969314575
mean time: 3.564454507827759
Pipeline stage StressChecker completed in 25.40s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.33s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 2.02s
Shutdown handler de-registered
chaiml-98p-2ff-chaiml-m_64828_v1 status is now deployed due to DeploymentManager action
Inference service chaiml-mistral-24b-2048-83002-v3 ready after 486.5790812969208s
Pipeline stage MKMLDeployer completed in 488.59s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.7262840270996094s
Received healthy response to inference request in 3.214431047439575s
Received healthy response to inference request in 2.9569032192230225s
Received healthy response to inference request in 3.5381968021392822s
Received healthy response to inference request in 3.1542811393737793s
5 requests
0 failed requests
5th percentile: 2.9963788032531737
10th percentile: 3.035854387283325
20th percentile: 3.114805555343628
30th percentile: 3.1663111209869386
40th percentile: 3.1903710842132567
50th percentile: 3.214431047439575
60th percentile: 3.343937349319458
70th percentile: 3.4734436511993407
80th percentile: 3.5758142471313477
90th percentile: 3.6510491371154785
95th percentile: 3.688666582107544
99th percentile: 3.718760538101196
mean time: 3.318019247055054
Pipeline stage StressChecker completed in 20.98s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.57s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 1.41s
Shutdown handler de-registered
chaiml-mistral-24b-2048_83002_v3 status is now deployed due to DeploymentManager action
chaiml-mistral-24b-2048_83002_v3 status is now inactive due to auto deactivation removed underperforming models
chaiml-mistral-24b-2048_83002_v3 status is now torndown due to DeploymentManager action