developer_uid: rirv938
submission_id: chaiml-mistral-24b-2048_74727_v3
model_name: chaiml-mistral-24b-2048_74727_v3
model_group: ChaiML/mistral_24b_2048_
status: inactive
timestamp: 2026-02-05T08:17:08+00:00
num_battles: 12794
num_wins: 6434
celo_rating: 1309.45
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/mistral_24b_2048_gemini_opus_ds_v9_1686_merged
model_architecture: MistralForCausalLM
model_num_parameters: 24096691200.0
best_of: 1
max_input_tokens: 2048
max_output_tokens: 96
reward_model: default
display_name: chaiml-mistral-24b-2048_74727_v3
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/mistral_24b_2048_gemini_opus_ds_v9_1686_merged
model_size: 24B
ranking_group: single
us_pacific_date: 2026-02-05
win_ratio: 0.5028919806159137
generation_params: {'temperature': 1.0, 'top_p': 0.95, 'min_p': 0.05, 'top_k': 60, 'presence_penalty': 0.1, 'frequency_penalty': 0.0, 'stopping_words': ['<|im_start|>', '</s>', 'You:', '###', '<|im_end|>'], 'max_input_tokens': 2048, 'best_of': 1, 'max_output_tokens': 96}
formatter: {'memory_template': '[SYSTEM_PROMPT]Respond as a high quality storyteller.[/SYSTEM_PROMPT][INST]', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '[/INST]{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-mistral-24b-2048-74727-v3-mkmlizer
Waiting for job on chaiml-mistral-24b-2048-74727-v3-mkmlizer to finish
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-mistral-24b-2048-25909-v3-mkmlizer
Waiting for job on chaiml-mistral-24b-2048-25909-v3-mkmlizer to finish
chaiml-mistral-24b-2048-74727-v3-mkmlizer: bash: cannot set terminal process group (-1): Inappropriate ioctl for device
chaiml-mistral-24b-2048-74727-v3-mkmlizer: bash: no job control in this shell
chaiml-mistral-24b-2048-74727-v3-mkmlizer: /root/miniconda3/envs/nvidia/lib/python3.11/site-packages/mk1/__init__.py:1: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
chaiml-mistral-24b-2048-74727-v3-mkmlizer: __import__('pkg_resources').declare_namespace(__name__)
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: /root/miniconda3/envs/nvidia/lib/python3.11/site-packages/mk1/__init__.py:1: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: __import__('pkg_resources').declare_namespace(__name__)
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ Version: 0.30.6+torch280 ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ https://mk1.ai ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ belonging to: ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ Version: 0.30.6+torch280 ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ Chai Research Corp. ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ https://mk1.ai ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-mistral-24b-2048-74727-v3-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ belonging to: ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ Chai Research Corp. ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ║ ║
chaiml-mistral-24b-2048-25909-v3-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-mistral-24b-2048-25909-v3-mkmlizer: Downloaded to shared memory in 81.543s
chaiml-mistral-24b-2048-25909-v3-mkmlizer: Checking if ChaiML/mistral_24b_2048_gemini_opus_ds_v9_843_merged already exists in ChaiML
chaiml-mistral-24b-2048-25909-v3-mkmlizer: quantizing model to /dev/shm/model_cache, profile:q4, folder:/tmp/tmp9p2swb3z, device:0
chaiml-mistral-24b-2048-25909-v3-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-mistral-24b-2048-74727-v3-mkmlizer: Downloaded to shared memory in 86.547s
chaiml-mistral-24b-2048-74727-v3-mkmlizer: Checking if ChaiML/mistral_24b_2048_gemini_opus_ds_v9_1686_merged already exists in ChaiML
chaiml-mistral-24b-2048-74727-v3-mkmlizer: quantizing model to /dev/shm/model_cache, profile:q4, folder:/tmp/tmptal3v4qj, device:0
chaiml-mistral-24b-2048-74727-v3-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-mistral-24b-2048-25909-v3-mkmlizer: Loading 0: 0%| | 0.00/363 [00:00<?, ?it/s] Loading 0: 1%| | 3.00/363 [00:01<03:52, 1.55it/s] Loading 0: 1%| | 3.00/363 [00:01<03:52, 1.55it/s] Loading 0: 1%| | 4.00/363 [00:03<06:16, 1.05s/it] Loading 0: 1%| | 4.00/363 [00:03<06:16, 1.05s/it] Loading 0: 1%|▏ | 5.00/363 [00:06<08:26, 1.41s/it] Loading 0: 1%|▏ | 5.00/363 [00:06<08:26, 1.41s/it] Loading 0: 3%|▎ | 11.0/363 [00:07<03:00, 1.95it/s] Loading 0: 3%|▎ | 11.0/363 [00:07<03:00, 1.95it/s] Loading 0: 4%|▎ | 13.0/363 [00:09<03:47, 1.54it/s] Loading 0: 4%|▎ | 13.0/363 [00:09<03:47, 1.54it/s] Loading 0: 4%|▍ | 14.0/363 [00:11<04:54, 1.18it/s] Loading 0: 4%|▍ | 14.0/363 [00:11<04:54, 1.18it/s] Loading 0: 4%|▍ | 15.0/363 [00:13<06:08, 1.06s/it] Loading 0: 4%|▍ | 15.0/363 [00:13<06:08, 1.06s/it] Loading 0: 6%|▌ | 21.0/363 [00:15<03:43, 1.53it/s] Loading 0: 6%|▌ | 21.0/363 [00:15<03:43, 1.53it/s] Loading 0: 6%|▌ | 22.0/363 [00:17<04:37, 1.23it/s] Loading 0: 6%|▌ | 22.0/363 [00:17<04:37, 1.23it/s] Loading 0: 6%|▋ | 23.0/363 [00:19<05:38, 1.00it/s] Loading 0: 6%|▋ | 23.0/363 [00:19<05:38, 1.00it/s] Loading 0: 9%|▊ | 31.0/363 [00:20<02:31, 2.18it/s] Loading 0: 9%|▊ | 31.0/363 [00:20<02:31, 2.18it/s] Loading 0: 9%|▉ | 34.0/363 [00:22<02:53, 1.89it/s] Loading 0: 9%|▉ | 34.0/363 [00:22<02:53, 1.89it/s] Loading 0: 10%|▉ | 35.0/363 [00:24<03:41, 1.48it/s] Loading 0: 10%|▉ | 35.0/363 [00:24<03:41, 1.48it/s] Loading 0: 10%|▉ | 36.0/363 [00:26<04:40, 1.17it/s] Loading 0: 10%|▉ | 36.0/363 [00:26<04:40, 1.17it/s] Loading 0: 11%|█ | 39.0/363 [00:28<04:11, 1.29it/s] Loading 0: 11%|█ | 39.0/363 [00:28<04:11, 1.29it/s] Loading 0: 11%|█ | 40.0/363 [00:30<05:04, 1.06it/s] Loading 0: 11%|█ | 40.0/363 [00:30<05:04, 1.06it/s] Loading 0: 11%|█▏ | 41.0/363 [00:32<06:03, 1.13s/it] Loading 0: 11%|█▏ | 41.0/363 [00:32<06:03, 1.13s/it] Loading 0: 13%|█▎ | 49.0/363 [00:33<02:30, 2.09it/s] Loading 0: 13%|█▎ | 49.0/363 [00:33<02:30, 2.09it/s] Loading 0: 14%|█▍ | 52.0/363 [00:36<02:49, 1.83it/s] Loading 0: 14%|█▍ | 52.0/363 [00:36<02:49, 1.83it/s] Loading 0: 15%|█▍ | 53.0/363 [00:37<03:36, 1.43it/s] Loading 0: 15%|█▍ | 53.0/363 [00:37<03:36, 1.43it/s] Loading 0: 15%|█▍ | 54.0/363 [00:39<04:31, 1.14it/s] Loading 0: 15%|█▍ | 54.0/363 [00:39<04:31, 1.14it/s] Loading 0: 16%|█▌ | 57.0/363 [00:41<04:01, 1.27it/s] Loading 0: 16%|█▌ | 57.0/363 [00:41<04:01, 1.27it/s] Loading 0: 16%|█▌ | 58.0/363 [00:43<04:51, 1.05it/s] Loading 0: 16%|█▌ | 58.0/363 [00:43<04:51, 1.05it/s] Loading 0: 16%|█▋ | 59.0/363 [00:45<05:48, 1.15s/it] Loading 0: 16%|█▋ | 59.0/363 [00:45<05:48, 1.15s/it] Loading 0: 18%|█▊ | 67.0/363 [00:46<02:21, 2.09it/s] Loading 0: 18%|█▊ | 67.0/363 [00:46<02:21, 2.09it/s] Loading 0: 19%|█▉ | 70.0/363 [00:49<02:40, 1.83it/s] Loading 0: 19%|█▉ | 70.0/363 [00:49<02:40, 1.83it/s] Loading 0: 20%|█▉ | 71.0/363 [00:51<03:23, 1.43it/s] Loading 0: 20%|█▉ | 71.0/363 [00:51<03:23, 1.43it/s] Loading 0: 20%|█▉ | 72.0/363 [00:53<04:16, 1.14it/s] Loading 0: 20%|█▉ | 72.0/363 [00:53<04:16, 1.14it/s] Loading 0: 21%|██ | 75.0/363 [00:55<03:47, 1.26it/s] Loading 0: 21%|██ | 75.0/363 [00:55<03:47, 1.26it/s] Loading 0: 21%|██ | 76.0/363 [00:56<04:35, 1.04it/s] Loading 0: 21%|██ | 76.0/363 [00:56<04:35, 1.04it/s] Loading 0: 21%|██ | 77.0/363 [00:58<05:27, 1.14s/it] Loading 0: 21%|██ | 77.0/363 [00:58<05:27, 1.14s/it] Loading 0: 23%|██▎ | 85.0/363 [01:00<02:13, 2.08it/s] Loading 0: 23%|██▎ | 85.0/363 [01:00<02:13, 2.08it/s] Loading 0: 24%|██▍ | 88.0/363 [01:02<02:30, 1.83it/s] Loading 0: 24%|██▍ | 88.0/363 [01:02<02:30, 1.83it/s] Loading 0: 25%|██▍ | 89.0/363 [01:04<03:11, 1.43it/s] Loading 0: 25%|██▍ | 89.0/363 [01:04<03:11, 1.43it/s] Loading 0: 25%|██▍ | 90.0/363 [01:06<04:00, 1.14it/s] Loading 0: 25%|██▍ | 90.0/363 [01:06<04:00, 1.14it/s] Loading 0: 27%|██▋ | 98.0/363 [01:07<01:53, 2.33it/s] Loading 0: 27%|██▋ | 98.0/363 [01:07<01:53, 2.33it/s] Loading 0: 28%|██▊ | 101/363 [01:09<02:09, 2.02it/s] Loading 0: 28%|██▊ | 101/363 [01:09<02:09, 2.02it/s] Loading 0: 28%|██▊ | 102/363 [01:11<02:47, 1.56it/s] Loading 0: 28%|██▊ | 102/363 [01:11<02:47, 1.56it/s] Loading 0: 28%|██▊ | 103/363 [01:13<03:33, 1.22it/s] Loading 0: 28%|██▊ | 103/363 [01:13<03:33, 1.22it/s] Loading 0: 29%|██▉ | 106/363 [01:15<03:18, 1.29it/s] Loading 0: 29%|██▉ | 106/363 [01:15<03:18, 1.29it/s] Loading 0: 29%|██▉ | 107/363 [01:17<03:59, 1.07it/s] Loading 0: 29%|██▉ | 107/363 [01:17<03:59, 1.07it/s] Loading 0: 30%|██▉ | 108/363 [01:19<04:47, 1.13s/it] Loading 0: 30%|██▉ | 108/363 [01:19<04:47, 1.13s/it] Loading 0: 31%|███ | 111/363 [01:21<03:51, 1.09it/s] Loading 0: 31%|███ | 111/363 [01:21<03:51, 1.09it/s] Loading 0: 31%|███ | 112/363 [01:23<04:32, 1.08s/it] Loading 0: 31%|███ | 112/363 [01:23<04:32, 1.08s/it] Loading 0: 31%|███ | 113/363 [01:25<05:15, 1.26s/it] Loading 0: 31%|███ | 113/363 [01:25<05:15, 1.26s/it] Loading 0: 33%|███▎ | 121/363 [01:26<02:00, 2.01it/s] Loading 0: 33%|███▎ | 121/363 [01:26<02:00, 2.01it/s] Loading 0: 34%|███▍ | 124/363 [01:28<02:14, 1.78it/s] Loading 0: 34%|███▍ | 124/363 [01:28<02:14, 1.78it/s] Loading 0: 34%|███▍ | 125/363 [01:30<02:50, 1.40it/s] Loading 0: 34%|███▍ | 125/363 [01:30<02:50, 1.40it/s] Loading 0: 35%|███▍ | 126/363 [01:32<03:32, 1.12it/s] Loading 0: 35%|███▍ | 126/363 [01:32<03:32, 1.12it/s] Loading 0: 36%|███▌ | 129/363 [01:34<03:07, 1.25it/s] Loading 0: 36%|███▌ | 129/363 [01:34<03:07, 1.25it/s] Loading 0: 36%|███▌ | 130/363 [01:36<03:45, 1.03it/s] Loading 0: 36%|███▌ | 130/363 [01:36<03:45, 1.03it/s] Loading 0: 36%|███▌ | 131/363 [01:38<04:28, 1.16s/it] Loading 0: 36%|███▌ | 131/363 [01:38<04:28, 1.16s/it] Loading 0: 38%|███▊ | 139/363 [01:39<01:48, 2.07it/s] Loading 0: 38%|███▊ | 139/363 [01:39<01:48, 2.07it/s] Loading 0: 39%|███▉ | 142/363 [01:41<02:01, 1.82it/s] Loading 0: 39%|███▉ | 142/363 [01:41<02:01, 1.82it/s] Loading 0: 39%|███▉ | 143/363 [01:43<02:34, 1.42it/s] Loading 0: 39%|███▉ | 143/363 [01:43<02:34, 1.42it/s] Loading 0: 40%|███▉ | 144/363 [01:45<03:13, 1.13it/s] Loading 0: 40%|███▉ | 144/363 [01:45<03:13, 1.13it/s] Loading 0: 40%|████ | 147/363 [01:47<02:51, 1.26it/s] Loading 0: 40%|████ | 147/363 [01:47<02:51, 1.26it/s] Loading 0: 41%|████ | 148/363 [01:49<03:26, 1.04it/s] Loading 0: 41%|████ | 148/363 [01:49<03:26, 1.04it/s] Loading 0: 41%|████ | 149/363 [01:51<04:05, 1.15s/it] Loading 0: 41%|████ | 149/363 [01:51<04:05, 1.15s/it] Loading 0: 43%|████▎ | 156/363 [01:52<01:44, 1.98it/s] Loading 0: 43%|████▎ | 156/363 [01:52<01:44, 1.98it/s] Loading 0: 44%|████▍ | 160/363 [01:55<01:50, 1.84it/s] Loading 0: 44%|████▍ | 160/363 [01:55<01:50, 1.84it/s] Loading 0: 44%|████▍ | 161/363 [01:56<02:19, 1.45it/s] Loading 0: 44%|████▍ | 161/363 [01:56<02:19, 1.45it/s] Loading 0: 45%|████▍ | 162/363 [01:58<02:54, 1.15it/s] Loading 0: 45%|████▍ | 162/363 [01:58<02:54, 1.15it/s] Loading 0: 45%|████▌ | 165/363 [02:00<02:35, 1.28it/s] Loading 0: 45%|████▌ | 165/363 [02:00<02:35, 1.28it/s] Loading 0: 46%|████▌ | 166/363 [02:02<03:07, 1.05it/s] Loading 0: 46%|████▌ | 166/363 [02:02<03:07, 1.05it/s] Loading 0: 46%|████▌ | 167/363 [02:04<03:43, 1.14s/it] Loading 0: 46%|████▌ | 167/363 [02:04<03:43, 1.14s/it] Loading 0: 48%|████▊ | 175/363 [02:06<01:30, 2.07it/s] Loading 0: 48%|████▊ | 175/363 [02:06<01:30, 2.07it/s] Loading 0: 49%|████▉ | 178/363 [02:08<01:41, 1.82it/s] Loading 0: 49%|████▉ | 178/363 [02:08<01:41, 1.82it/s] Loading 0: 49%|████▉ | 179/363 [02:10<02:10, 1.42it/s] Loading 0: 49%|████▉ | 179/363 [02:10<02:10, 1.42it/s] Loading 0: 50%|████▉ | 180/363 [02:12<02:41, 1.13it/s] Loading 0: 50%|████▉ | 180/363 [02:12<02:41, 1.13it/s] Loading 0: 50%|█████ | 183/363 [02:14<02:23, 1.25it/s] Loading 0: 50%|█████ | 183/363 [02:14<02:23, 1.25it/s] Loading 0: 51%|█████ | 184/363 [02:16<02:53, 1.03it/s] Loading 0: 51%|█████ | 184/363 [02:16<02:53, 1.03it/s] Loading 0: 51%|█████ | 185/363 [02:18<03:25, 1.16s/it] Loading 0: 51%|█████ | 185/363 [02:18<03:25, 1.16s/it] Loading 0: 53%|█████▎ | 193/363 [02:19<01:22, 2.06it/s] Loading 0: 53%|█████▎ | 193/363 [02:19<01:22, 2.06it/s] Loading 0: 54%|█████▍ | 196/363 [02:21<01:32, 1.81it/s] Loading 0: 54%|█████▍ | 196/363 [02:21<01:32, 1.81it/s] Loading 0: 54%|█████▍ | 197/363 [02:23<01:56, 1.42it/s] Loading 0: 54%|█████▍ | 197/363 [02:23<01:56, 1.42it/s] Loading 0: 55%|█████▍ | 198/363 [02:25<02:25, 1.13it/s] Loading 0: 55%|█████▍ | 198/363 [02:25<02:25, 1.13it/s] Loading 0: 55%|█████▌ | 201/363 [02:27<02:08, 1.26it/s] Loading 0: 55%|█████▌ | 201/363 [02:27<02:08, 1.26it/s] Loading 0: 56%|█████▌ | 202/363 [02:29<02:34, 1.04it/s] Loading 0: 56%|█████▌ | 202/363 [02:29<02:34, 1.04it/s] Loading 0: 56%|█████▌ | 203/363 [02:31<03:03, 1.14s/it] Loading 0: 56%|█████▌ | 203/363 [02:31<03:03, 1.14s/it] Loading 0: 58%|█████▊ | 211/363 [02:32<01:13, 2.07it/s] Loading 0: 58%|█████▊ | 211/363 [02:32<01:13, 2.07it/s] Loading 0: 59%|█████▉ | 214/363 [02:34<01:21, 1.82it/s] Loading 0: 59%|█████▉ | 214/363 [02:34<01:21, 1.82it/s] Loading 0: 59%|█████▉ | 215/363 [02:36<01:43, 1.43it/s] Loading 0: 59%|█████▉ | 215/363 [02:36<01:43, 1.43it/s] Loading 0: 60%|█████▉ | 216/363 [02:38<02:09, 1.14it/s] Loading 0: 60%|█████▉ | 216/363 [02:38<02:09, 1.14it/s] Loading 0: 60%|██████ | 219/363 [02:40<01:53, 1.27it/s] Loading 0: 60%|██████ | 219/363 [02:40<01:53, 1.27it/s] Loading 0: 61%|██████ | 220/363 [02:42<02:16, 1.05it/s] Loading 0: 61%|██████ | 220/363 [02:42<02:16, 1.05it/s] Loading 0: 61%|██████ | 221/363 [02:44<02:42, 1.14s/it] Loading 0: 61%|██████ | 221/363 [02:44<02:42, 1.14s/it] Loading 0: 63%|██████▎ | 228/363 [02:45<01:08, 1.98it/s] Loading 0: 63%|██████▎ | 228/363 [02:45<01:08, 1.98it/s] Loading 0: 64%|██████▍ | 232/363 [02:47<01:10, 1.85it/s] Loading 0: 64%|██████▍ | 232/363 [02:47<01:10, 1.85it/s] Loading 0: 64%|██████▍ | 233/363 [02:49<01:29, 1.45it/s] Loading 0: 64%|██████▍ | 233/363 [02:49<01:29, 1.45it/s] Loading 0: 64%|██████▍ | 234/363 [02:51<01:51, 1.16it/s] Loading 0: 64%|██████▍ | 234/363 [02:51<01:51, 1.16it/s] Loading 0: 65%|██████▌ | 237/363 [02:53<01:38, 1.28it/s] Loading 0: 65%|██████▌ | 237/363 [02:53<01:38, 1.28it/s] Loading 0: 66%|██████▌ | 238/363 [02:55<01:58, 1.06it/s] Loading 0: 66%|██████▌ | 238/363 [02:55<01:58, 1.06it/s] Loading 0: 66%|██████▌ | 239/363 [02:57<02:20, 1.14s/it] Loading 0: 66%|██████▌ | 239/363 [02:57<02:20, 1.14s/it] Loading 0: 68%|██████▊ | 247/363 [02:58<00:55, 2.08it/s] Loading 0: 68%|██████▊ | 247/363 [02:58<00:55, 2.08it/s] Loading 0: 69%|██████▉ | 250/363 [03:01<01:01, 1.83it/s] Loading 0: 69%|██████▉ | 250/363 [03:01<01:01, 1.83it/s] Loading 0: 69%|██████▉ | 251/363 [03:02<01:18, 1.43it/s] Loading 0: 69%|██████▉ | 251/363 [03:02<01:18, 1.43it/s] Loading 0: 69%|██████▉ | 252/363 [03:05<01:37, 1.14it/s] Loading 0: 69%|██████▉ | 252/363 [03:05<01:37, 1.14it/s] Loading 0: 70%|███████ | 255/363 [03:06<01:25, 1.27it/s] Loading 0: 70%|███████ | 255/363 [03:06<01:25, 1.27it/s] Loading 0: 71%|███████ | 256/363 [03:08<01:42, 1.05it/s] Loading 0: 71%|███████ | 256/363 [03:08<01:42, 1.05it/s] Loading 0: 71%|███████ | 257/363 [03:10<02:00, 1.14s/it] Loading 0: 71%|███████ | 257/363 [03:10<02:00, 1.14s/it] Loading 0: 73%|███████▎ | 265/363 [03:12<00:46, 2.09it/s] Loading 0: 73%|███████▎ | 265/363 [03:12<00:46, 2.09it/s] Loading 0: 74%|███████▍ | 268/363 [03:14<00:51, 1.83it/s] Loading 0: 74%|███████▍ | 268/363 [03:14<00:51, 1.83it/s] Loading 0: 74%|███████▍ | 269/363 [03:16<01:05, 1.43it/s] Loading 0: 74%|███████▍ | 269/363 [03:16<01:05, 1.43it/s] Loading 0: 74%|███████▍ | 270/363 [03:18<01:21, 1.14it/s] Loading 0: 74%|███████▍ | 270/363 [03:18<01:21, 1.14it/s] Loading 0: 75%|███████▌ | 273/363 [03:33<03:36, 2.41s/it] Loading 0: 75%|███████▌ | 273/363 [03:33<03:36, 2.41s/it] Loading 0: 75%|███████▌ | 274/363 [03:35<03:27, 2.33s/it] Loading 0: 75%|███████▌ | 274/363 [03:35<03:27, 2.33s/it] Loading 0: 76%|███████▌ | 275/363 [03:37<03:20, 2.28s/it] Loading 0: 76%|███████▌ | 275/363 [03:37<03:20, 2.28s/it] Loading 0: 78%|███████▊ | 283/363 [03:38<01:08, 1.17it/s] Loading 0: 78%|███████▊ | 283/363 [03:38<01:08, 1.17it/s] Loading 0: 79%|███████▉ | 286/363 [03:40<01:02, 1.23it/s] Loading 0: 79%|███████▉ | 286/363 [03:40<01:02, 1.23it/s] Loading 0: 79%|███████▉ | 287/363 [03:42<01:09, 1.09it/s] Loading 0: 79%|███████▉ | 287/363 [03:42<01:09, 1.09it/s] Loading 0: 79%|███████▉ | 288/363 [03:44<01:20, 1.07s/it] Loading 0: 79%|███████▉ | 288/363 [03:44<01:20, 1.07s/it] Loading 0: 80%|████████ | 291/363 [03:46<01:05, 1.10it/s] Loading 0: 80%|████████ | 291/363 [03:46<01:05, 1.10it/s] Loading 0: 80%|████████ | 292/363 [03:47<01:15, 1.06s/it] Loading 0: 80%|████████ | 292/363 [03:47<01:15, 1.06s/it] Loading 0: 81%|████████ | 293/363 [03:49<01:25, 1.23s/it] Loading 0: 81%|████████ | 293/363 [03:49<01:25, 1.23s/it] Loading 0: 83%|████████▎ | 301/363 [03:51<00:31, 1.99it/s] Loading 0: 83%|████████▎ | 301/363 [03:51<00:31, 1.99it/s] Loading 0: 84%|████████▎ | 304/363 [03:53<00:33, 1.77it/s] Loading 0: 84%|████████▎ | 304/363 [03:53<00:33, 1.77it/s] Loading 0: 84%|████████▍ | 305/363 [03:55<00:41, 1.40it/s] Loading 0: 84%|████████▍ | 305/363 [03:55<00:41, 1.40it/s] Loading 0: 84%|████████▍ | 306/363 [03:57<00:50, 1.12it/s] Loading 0: 84%|████████▍ | 306/363 [03:57<00:50, 1.12it/s] Loading 0: 85%|████████▌ | 309/363 [03:59<00:43, 1.25it/s] Loading 0: 85%|████████▌ | 309/363 [03:59<00:43, 1.25it/s] Loading 0: 85%|████████▌ | 310/363 [04:01<00:51, 1.04it/s] Loading 0: 85%|████████▌ | 310/363 [04:01<00:51, 1.04it/s] Loading 0: 86%|████████▌ | 311/363 [04:03<00:59, 1.15s/it] Loading 0: 86%|████████▌ | 311/363 [04:03<00:59, 1.15s/it] Loading 0: 88%|████████▊ | 319/363 [04:04<00:20, 2.10it/s] Loading 0: 88%|████████▊ | 319/363 [04:04<00:20, 2.10it/s] Loading 0: 89%|████████▊ | 322/363 [04:06<00:22, 1.84it/s] Loading 0: 89%|████████▊ | 322/363 [04:06<00:22, 1.84it/s] Loading 0: 89%|████████▉ | 323/363 [04:08<00:27, 1.44it/s] Loading 0: 89%|████████▉ | 323/363 [04:08<00:27, 1.44it/s] Loading 0: 89%|████████▉ | 324/363 [04:10<00:34, 1.14it/s] Loading 0: 89%|████████▉ | 324/363 [04:10<00:34, 1.14it/s] Loading 0: 90%|█████████ | 327/363 [04:12<00:28, 1.27it/s] Loading 0: 90%|█████████ | 327/363 [04:12<00:28, 1.27it/s] Loading 0: 90%|█████████ | 328/363 [04:14<00:33, 1.05it/s] Loading 0: 90%|█████████ | 328/363 [04:14<00:33, 1.05it/s] Loading 0: 91%|█████████ | 329/363 [04:16<00:38, 1.14s/it] Loading 0: 91%|█████████ | 329/363 [04:16<00:38, 1.14s/it] Loading 0: 93%|█████████▎| 337/363 [04:17<00:12, 2.12it/s] Loading 0: 93%|█████████▎| 337/363 [04:17<00:12, 2.12it/s] Loading 0: 94%|█████████▎| 340/363 [04:19<00:12, 1.84it/s] Loading 0: 94%|█████████▎| 340/363 [04:19<00:12, 1.84it/s] Loading 0: 94%|█████████▍| 341/363 [04:21<00:15, 1.44it/s] Loading 0: 94%|█████████▍| 341/363 [04:21<00:15, 1.44it/s] Loading 0: 94%|█████████▍| 342/363 [04:23<00:18, 1.14it/s] Loading 0: 94%|█████████▍| 342/363 [04:23<00:18, 1.14it/s] Loading 0: 95%|█████████▌| 345/363 [04:25<00:14, 1.27it/s] Loading 0: 95%|█████████▌| 345/363 [04:25<00:14, 1.27it/s] Loading 0: 95%|█████████▌| 346/363 [04:27<00:16, 1.05it/s] Loading 0: 95%|█████████▌| 346/363 [04:27<00:16, 1.05it/s] Loading 0: 96%|█████████▌| 347/363 [04:29<00:18, 1.14s/it] Loading 0: 96%|█████████▌| 347/363 [04:29<00:18, 1.14s/it] Loading 0: 98%|█████████▊| 355/363 [04:30<00:03, 2.10it/s] Loading 0: 98%|█████████▊| 355/363 [04:30<00:03, 2.10it/s] Loading 0: 99%|█████████▉| 359/363 [04:33<00:02, 1.89it/s] Loading 0: 99%|█████████▉| 359/363 [04:33<00:02, 1.89it/s] Loading 0: 99%|█████████▉| 360/363 [04:34<00:02, 1.49it/s] Loading 0: 99%|█████████▉| 360/363 [04:34<00:02, 1.49it/s] Loading 0: 99%|█████████▉| 361/363 [04:36<00:01, 1.19it/s] Loading 0: 99%|█████████▉| 361/363 [04:36<00:01, 1.19it/s] Loading 0: 100%|██████████| 363/363 [04:36<00:00, 1.19it/s] Loading 0: 100%|██████████| 363/363 [04:36<00:00, 1.31it/s]
chaiml-mistral-24b-2048-25909-v3-mkmlizer: The tokenizer you are loading from '/tmp/tmp9p2swb3z' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-mistral-24b-2048-25909-v3-mkmlizer: quantized model in 283.556s
chaiml-mistral-24b-2048-25909-v3-mkmlizer: Processed model ChaiML/mistral_24b_2048_gemini_opus_ds_v9_843_merged in 365.099s
chaiml-mistral-24b-2048-25909-v3-mkmlizer: creating bucket guanaco-mkml-models
chaiml-mistral-24b-2048-25909-v3-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-mistral-24b-2048-25909-v3-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v3/nvidia
chaiml-mistral-24b-2048-25909-v3-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v3/nvidia/config.json
chaiml-mistral-24b-2048-25909-v3-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v3/nvidia/special_tokens_map.json
chaiml-mistral-24b-2048-25909-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v3/nvidia/tokenizer_config.json
chaiml-mistral-24b-2048-25909-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v3/nvidia/tokenizer.json
chaiml-mistral-24b-2048-25909-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v3/nvidia/flywheel_model.1.safetensors
chaiml-mistral-24b-2048-74727-v3-mkmlizer: Loading 0: 0%| | 0.00/363 [00:00<?, ?it/s] Loading 0: 1%| | 3.00/363 [00:02<04:23, 1.37it/s] Loading 0: 1%| | 3.00/363 [00:02<04:23, 1.37it/s] Loading 0: 1%| | 4.00/363 [00:04<07:08, 1.19s/it] Loading 0: 1%| | 4.00/363 [00:04<07:08, 1.19s/it] Loading 0: 1%|▏ | 5.00/363 [00:06<09:11, 1.54s/it] Loading 0: 1%|▏ | 5.00/363 [00:06<09:11, 1.54s/it] Loading 0: 3%|▎ | 11.0/363 [00:07<03:14, 1.81it/s] Loading 0: 3%|▎ | 11.0/363 [00:07<03:14, 1.81it/s] Loading 0: 4%|▎ | 13.0/363 [00:10<04:10, 1.39it/s] Loading 0: 4%|▎ | 13.0/363 [00:10<04:10, 1.39it/s] Loading 0: 4%|▍ | 14.0/363 [00:12<05:28, 1.06it/s] Loading 0: 4%|▍ | 14.0/363 [00:12<05:28, 1.06it/s] Loading 0: 4%|▍ | 15.0/363 [00:14<06:52, 1.18s/it] Loading 0: 4%|▍ | 15.0/363 [00:14<06:52, 1.18s/it] Loading 0: 6%|▌ | 21.0/363 [00:17<04:11, 1.36it/s] Loading 0: 6%|▌ | 21.0/363 [00:17<04:11, 1.36it/s] Loading 0: 6%|▌ | 22.0/363 [00:19<05:12, 1.09it/s] Loading 0: 6%|▌ | 22.0/363 [00:19<05:12, 1.09it/s] Loading 0: 6%|▋ | 23.0/363 [00:21<06:22, 1.13s/it] Loading 0: 6%|▋ | 23.0/363 [00:21<06:22, 1.13s/it] Loading 0: 8%|▊ | 30.0/363 [00:22<03:00, 1.85it/s] Loading 0: 8%|▊ | 30.0/363 [00:22<03:00, 1.85it/s] Loading 0: 9%|▉ | 34.0/363 [00:25<03:15, 1.69it/s] Loading 0: 9%|▉ | 34.0/363 [00:25<03:15, 1.69it/s] Loading 0: 10%|▉ | 35.0/363 [00:27<04:08, 1.32it/s] Loading 0: 10%|▉ | 35.0/363 [00:27<04:08, 1.32it/s] Loading 0: 10%|▉ | 36.0/363 [00:30<05:14, 1.04it/s] Loading 0: 10%|▉ | 36.0/363 [00:30<05:14, 1.04it/s] Loading 0: 11%|█ | 39.0/363 [00:32<04:43, 1.14it/s] Loading 0: 11%|█ | 39.0/363 [00:32<04:43, 1.14it/s] Loading 0: 11%|█ | 40.0/363 [00:34<05:43, 1.06s/it] Loading 0: 11%|█ | 40.0/363 [00:34<05:43, 1.06s/it] Loading 0: 11%|█▏ | 41.0/363 [00:36<06:50, 1.28s/it] Loading 0: 11%|█▏ | 41.0/363 [00:36<06:50, 1.28s/it] Loading 0: 13%|█▎ | 48.0/363 [00:37<02:58, 1.76it/s] Loading 0: 13%|█▎ | 48.0/363 [00:37<02:58, 1.76it/s] Loading 0: 14%|█▍ | 52.0/363 [00:40<03:10, 1.64it/s] Loading 0: 14%|█▍ | 52.0/363 [00:40<03:10, 1.64it/s] Loading 0: 15%|█▍ | 53.0/363 [00:42<04:01, 1.28it/s] Loading 0: 15%|█▍ | 53.0/363 [00:42<04:01, 1.28it/s] Loading 0: 15%|█▍ | 54.0/363 [00:45<05:02, 1.02it/s] Loading 0: 15%|█▍ | 54.0/363 [00:45<05:02, 1.02it/s] Loading 0: 16%|█▌ | 57.0/363 [00:47<04:31, 1.13it/s] Loading 0: 16%|█▌ | 57.0/363 [00:47<04:31, 1.13it/s] Loading 0: 16%|█▌ | 58.0/363 [00:49<05:27, 1.08s/it] Loading 0: 16%|█▌ | 58.0/363 [00:49<05:27, 1.08s/it] Loading 0: 16%|█▋ | 59.0/363 [00:51<06:41, 1.32s/it] Loading 0: 16%|█▋ | 59.0/363 [00:51<06:41, 1.32s/it] Loading 0: 18%|█▊ | 66.0/363 [00:52<02:51, 1.73it/s] Loading 0: 18%|█▊ | 66.0/363 [00:52<02:51, 1.73it/s] Loading 0: 19%|█▉ | 70.0/363 [00:55<03:01, 1.62it/s] Loading 0: 19%|█▉ | 70.0/363 [00:55<03:01, 1.62it/s] Loading 0: 20%|█▉ | 71.0/363 [00:57<03:49, 1.27it/s] Loading 0: 20%|█▉ | 71.0/363 [00:57<03:49, 1.27it/s] Loading 0: 20%|█▉ | 72.0/363 [01:00<04:47, 1.01it/s] Loading 0: 20%|█▉ | 72.0/363 [01:00<04:47, 1.01it/s] Loading 0: 21%|██ | 75.0/363 [01:02<04:16, 1.12it/s] Loading 0: 21%|██ | 75.0/363 [01:02<04:16, 1.12it/s] Loading 0: 21%|██ | 76.0/363 [01:04<05:09, 1.08s/it] Loading 0: 21%|██ | 76.0/363 [01:04<05:09, 1.08s/it] Loading 0: 21%|██ | 77.0/363 [01:06<06:09, 1.29s/it] Loading 0: 21%|██ | 77.0/363 [01:06<06:09, 1.29s/it] Loading 0: 23%|██▎ | 84.0/363 [01:07<02:39, 1.74it/s] Loading 0: 23%|██▎ | 84.0/363 [01:07<02:39, 1.74it/s] Loading 0: 24%|██▍ | 88.0/363 [01:10<02:49, 1.62it/s] Loading 0: 24%|██▍ | 88.0/363 [01:10<02:49, 1.62it/s] Loading 0: 25%|██▍ | 89.0/363 [01:12<03:34, 1.28it/s] Loading 0: 25%|██▍ | 89.0/363 [01:12<03:34, 1.28it/s] Loading 0: 25%|██▍ | 90.0/363 [01:15<04:29, 1.01it/s] Loading 0: 25%|██▍ | 90.0/363 [01:15<04:29, 1.01it/s] Loading 0: 27%|██▋ | 97.0/363 [01:16<02:16, 1.95it/s] Loading 0: 27%|██▋ | 97.0/363 [01:16<02:16, 1.95it/s] Loading 0: 28%|██▊ | 101/363 [01:18<02:26, 1.79it/s] Loading 0: 28%|██▊ | 101/363 [01:18<02:26, 1.79it/s] Loading 0: 28%|██▊ | 102/363 [01:21<03:08, 1.38it/s] Loading 0: 28%|██▊ | 102/363 [01:21<03:08, 1.38it/s] Loading 0: 28%|██▊ | 103/363 [01:23<04:00, 1.08it/s] Loading 0: 28%|██▊ | 103/363 [01:23<04:00, 1.08it/s] Loading 0: 29%|██▉ | 106/363 [01:25<03:44, 1.14it/s] Loading 0: 29%|██▉ | 106/363 [01:25<03:44, 1.14it/s] Loading 0: 29%|██▉ | 107/363 [01:27<04:31, 1.06s/it] Loading 0: 29%|██▉ | 107/363 [01:27<04:31, 1.06s/it] Loading 0: 30%|██▉ | 108/363 [01:30<05:23, 1.27s/it] Loading 0: 30%|██▉ | 108/363 [01:30<05:23, 1.27s/it] Loading 0: 31%|███ | 111/363 [01:32<04:22, 1.04s/it] Loading 0: 31%|███ | 111/363 [01:32<04:22, 1.04s/it] Loading 0: 31%|███ | 112/363 [01:34<05:08, 1.23s/it] Loading 0: 31%|███ | 112/363 [01:34<05:08, 1.23s/it] Loading 0: 31%|███ | 113/363 [01:36<06:00, 1.44s/it] Loading 0: 31%|███ | 113/363 [01:36<06:00, 1.44s/it] Loading 0: 33%|███▎ | 120/363 [01:37<02:23, 1.69it/s] Loading 0: 33%|███▎ | 120/363 [01:37<02:23, 1.69it/s] Loading 0: 34%|███▍ | 124/363 [01:40<02:30, 1.59it/s] Loading 0: 34%|███▍ | 124/363 [01:40<02:30, 1.59it/s] Loading 0: 34%|███▍ | 125/363 [01:42<03:10, 1.25it/s] Loading 0: 34%|███▍ | 125/363 [01:42<03:10, 1.25it/s] Loading 0: 35%|███▍ | 126/363 [01:45<03:58, 1.00s/it] Loading 0: 35%|███▍ | 126/363 [01:45<03:58, 1.00s/it] Loading 0: 36%|███▌ | 129/363 [01:47<03:31, 1.11it/s] Loading 0: 36%|███▌ | 129/363 [01:47<03:31, 1.11it/s] Loading 0: 36%|███▌ | 130/363 [01:49<04:14, 1.09s/it] Loading 0: 36%|███▌ | 130/363 [01:49<04:14, 1.09s/it] Loading 0: 36%|███▌ | 131/363 [01:51<05:05, 1.32s/it] Loading 0: 36%|███▌ | 131/363 [01:51<05:05, 1.32s/it] Loading 0: 38%|███▊ | 138/363 [01:52<02:09, 1.73it/s] Loading 0: 38%|███▊ | 138/363 [01:52<02:09, 1.73it/s] Loading 0: 39%|███▉ | 142/363 [01:55<02:16, 1.62it/s] Loading 0: 39%|███▉ | 142/363 [01:55<02:16, 1.62it/s] Loading 0: 39%|███▉ | 143/363 [01:57<02:53, 1.27it/s] Loading 0: 39%|███▉ | 143/363 [01:57<02:53, 1.27it/s] Loading 0: 40%|███▉ | 144/363 [02:00<03:37, 1.00it/s] Loading 0: 40%|███▉ | 144/363 [02:00<03:37, 1.00it/s] Loading 0: 40%|████ | 147/363 [02:02<03:13, 1.11it/s] Loading 0: 40%|████ | 147/363 [02:02<03:13, 1.11it/s] Loading 0: 41%|████ | 148/363 [02:04<03:53, 1.09s/it] Loading 0: 41%|████ | 148/363 [02:04<03:53, 1.09s/it] Loading 0: 41%|████ | 149/363 [02:06<04:37, 1.30s/it] Loading 0: 41%|████ | 149/363 [02:06<04:37, 1.30s/it] Loading 0: 43%|████▎ | 156/363 [02:08<01:58, 1.75it/s] Loading 0: 43%|████▎ | 156/363 [02:08<01:58, 1.75it/s] Loading 0: 44%|████▍ | 160/363 [02:10<02:05, 1.62it/s] Loading 0: 44%|████▍ | 160/363 [02:10<02:05, 1.62it/s] Loading 0: 44%|████▍ | 161/363 [02:13<02:39, 1.27it/s] Loading 0: 44%|████▍ | 161/363 [02:13<02:39, 1.27it/s] Loading 0: 45%|████▍ | 162/363 [02:15<03:19, 1.01it/s] Loading 0: 45%|████▍ | 162/363 [02:15<03:19, 1.01it/s] Loading 0: 45%|████▌ | 165/363 [02:17<02:57, 1.12it/s] Loading 0: 45%|████▌ | 165/363 [02:17<02:57, 1.12it/s] Loading 0: 46%|████▌ | 166/363 [02:19<03:33, 1.09s/it] Loading 0: 46%|████▌ | 166/363 [02:19<03:33, 1.09s/it] Loading 0: 46%|████▌ | 167/363 [02:22<04:13, 1.30s/it] Loading 0: 46%|████▌ | 167/363 [02:22<04:13, 1.30s/it] Loading 0: 48%|████▊ | 174/363 [02:23<01:48, 1.74it/s] Loading 0: 48%|████▊ | 174/363 [02:23<01:48, 1.74it/s] Loading 0: 49%|████▉ | 178/363 [02:25<01:54, 1.62it/s] Loading 0: 49%|████▉ | 178/363 [02:25<01:54, 1.62it/s] Loading 0: 49%|████▉ | 179/363 [02:28<02:25, 1.26it/s] Loading 0: 49%|████▉ | 179/363 [02:28<02:25, 1.26it/s] Loading 0: 50%|████▉ | 180/363 [02:30<03:12, 1.05s/it] Loading 0: 50%|████▉ | 180/363 [02:30<03:12, 1.05s/it] Loading 0: 50%|█████ | 183/363 [02:33<02:48, 1.07it/s] Loading 0: 50%|█████ | 183/363 [02:33<02:48, 1.07it/s] Loading 0: 51%|█████ | 184/363 [02:35<03:20, 1.12s/it] Loading 0: 51%|█████ | 184/363 [02:35<03:20, 1.12s/it] Loading 0: 51%|█████ | 185/363 [02:37<03:55, 1.32s/it] Loading 0: 51%|█████ | 185/363 [02:37<03:55, 1.32s/it] Loading 0: 53%|█████▎ | 192/363 [02:38<01:40, 1.71it/s] Loading 0: 53%|█████▎ | 192/363 [02:38<01:40, 1.71it/s] Loading 0: 54%|█████▍ | 196/363 [02:41<01:44, 1.60it/s] Loading 0: 54%|█████▍ | 196/363 [02:41<01:44, 1.60it/s] Loading 0: 54%|█████▍ | 197/363 [02:43<02:11, 1.26it/s] Loading 0: 54%|█████▍ | 197/363 [02:43<02:11, 1.26it/s] Loading 0: 55%|█████▍ | 198/363 [02:45<02:44, 1.00it/s] Loading 0: 55%|█████▍ | 198/363 [02:45<02:44, 1.00it/s] Loading 0: 55%|█████▌ | 201/363 [02:48<02:25, 1.11it/s] Loading 0: 55%|█████▌ | 201/363 [02:48<02:25, 1.11it/s] Loading 0: 56%|█████▌ | 202/363 [02:50<02:55, 1.09s/it] Loading 0: 56%|█████▌ | 202/363 [02:50<02:55, 1.09s/it] Loading 0: 56%|█████▌ | 203/363 [02:52<03:28, 1.30s/it] Loading 0: 56%|█████▌ | 203/363 [02:52<03:28, 1.30s/it] Loading 0: 58%|█████▊ | 210/363 [02:53<01:28, 1.74it/s] Loading 0: 58%|█████▊ | 210/363 [02:53<01:28, 1.74it/s] Loading 0: 59%|█████▉ | 214/363 [02:56<01:32, 1.62it/s] Loading 0: 59%|█████▉ | 214/363 [02:56<01:32, 1.62it/s] Loading 0: 59%|█████▉ | 215/363 [02:58<01:56, 1.27it/s] Loading 0: 59%|█████▉ | 215/363 [02:58<01:56, 1.27it/s] Loading 0: 60%|█████▉ | 216/363 [03:01<02:29, 1.01s/it] Loading 0: 60%|█████▉ | 216/363 [03:01<02:29, 1.01s/it] Loading 0: 60%|██████ | 219/363 [03:03<02:19, 1.03it/s] Loading 0: 60%|██████ | 219/363 [03:03<02:19, 1.03it/s] Loading 0: 61%|██████ | 220/363 [03:06<02:43, 1.15s/it] Loading 0: 61%|██████ | 220/363 [03:06<02:43, 1.15s/it] Loading 0: 61%|██████ | 221/363 [03:08<03:11, 1.35s/it] Loading 0: 61%|██████ | 221/363 [03:08<03:11, 1.35s/it] Loading 0: 63%|██████▎ | 228/363 [03:09<01:20, 1.68it/s] Loading 0: 63%|██████▎ | 228/363 [03:09<01:20, 1.68it/s] Loading 0: 64%|██████▍ | 232/363 [03:12<01:22, 1.59it/s] Loading 0: 64%|██████▍ | 232/363 [03:12<01:22, 1.59it/s] Loading 0: 64%|██████▍ | 233/363 [03:14<01:43, 1.25it/s] Loading 0: 64%|██████▍ | 233/363 [03:14<01:43, 1.25it/s] Loading 0: 64%|██████▍ | 234/363 [03:16<02:08, 1.00it/s] Loading 0: 64%|██████▍ | 234/363 [03:16<02:08, 1.00it/s] Loading 0: 65%|██████▌ | 237/363 [03:18<01:53, 1.11it/s] Loading 0: 65%|██████▌ | 237/363 [03:18<01:53, 1.11it/s] Loading 0: 66%|██████▌ | 238/363 [03:21<02:15, 1.09s/it] Loading 0: 66%|██████▌ | 238/363 [03:21<02:15, 1.09s/it] Loading 0: 66%|██████▌ | 239/363 [03:23<02:41, 1.30s/it] Loading 0: 66%|██████▌ | 239/363 [03:23<02:41, 1.30s/it] Loading 0: 68%|██████▊ | 246/363 [03:24<01:07, 1.74it/s] Loading 0: 68%|██████▊ | 246/363 [03:24<01:07, 1.74it/s] Loading 0: 69%|██████▉ | 250/363 [03:27<01:09, 1.62it/s] Loading 0: 69%|██████▉ | 250/363 [03:27<01:09, 1.62it/s] Loading 0: 69%|██████▉ | 251/363 [03:29<01:28, 1.27it/s] Loading 0: 69%|██████▉ | 251/363 [03:29<01:28, 1.27it/s] Loading 0: 69%|██████▉ | 252/363 [03:31<01:49, 1.01it/s] Loading 0: 69%|██████▉ | 252/363 [03:31<01:49, 1.01it/s] Loading 0: 70%|███████ | 255/363 [03:33<01:36, 1.12it/s] Loading 0: 70%|███████ | 255/363 [03:33<01:36, 1.12it/s] Loading 0: 71%|███████ | 256/363 [03:36<01:55, 1.08s/it] Loading 0: 71%|███████ | 256/363 [03:36<01:55, 1.08s/it] Loading 0: 71%|███████ | 257/363 [03:38<02:17, 1.29s/it] Loading 0: 71%|███████ | 257/363 [03:38<02:17, 1.29s/it] Loading 0: 73%|███████▎ | 264/363 [03:39<00:56, 1.74it/s] Loading 0: 73%|███████▎ | 264/363 [03:39<00:56, 1.74it/s] Loading 0: 74%|███████▍ | 268/363 [03:42<00:58, 1.62it/s] Loading 0: 74%|███████▍ | 268/363 [03:42<00:58, 1.62it/s] Loading 0: 74%|███████▍ | 269/363 [03:44<01:13, 1.27it/s] Loading 0: 74%|███████▍ | 269/363 [03:44<01:13, 1.27it/s] Loading 0: 74%|███████▍ | 270/363 [03:46<01:31, 1.01it/s] Loading 0: 74%|███████▍ | 270/363 [03:46<01:31, 1.01it/s] Loading 0: 75%|███████▌ | 273/363 [04:03<03:56, 2.62s/it] Loading 0: 75%|███████▌ | 273/363 [04:03<03:56, 2.62s/it] Loading 0: 75%|███████▌ | 274/363 [04:05<03:47, 2.56s/it] Loading 0: 75%|███████▌ | 274/363 [04:05<03:47, 2.56s/it] Loading 0: 76%|███████▌ | 275/363 [04:07<03:41, 2.51s/it] Loading 0: 76%|███████▌ | 275/363 [04:07<03:41, 2.51s/it] Loading 0: 78%|███████▊ | 282/363 [04:08<01:22, 1.02s/it] Loading 0: 78%|███████▊ | 282/363 [04:08<01:22, 1.02s/it] Loading 0: 79%|███████▉ | 286/363 [04:11<01:09, 1.10it/s] Loading 0: 79%|███████▉ | 286/363 [04:11<01:09, 1.10it/s] Loading 0: 79%|███████▉ | 287/363 [04:13<01:19, 1.04s/it] Loading 0: 79%|███████▉ | 287/363 [04:13<01:19, 1.04s/it] Loading 0: 79%|███████▉ | 288/363 [04:16<01:30, 1.21s/it] Loading 0: 79%|███████▉ | 288/363 [04:16<01:30, 1.21s/it] Loading 0: 80%|████████ | 291/363 [04:18<01:14, 1.04s/it] Loading 0: 80%|████████ | 291/363 [04:18<01:14, 1.04s/it] Loading 0: 80%|████████ | 292/363 [04:20<01:25, 1.20s/it] Loading 0: 80%|████████ | 292/363 [04:20<01:25, 1.20s/it] Loading 0: 81%|████████ | 293/363 [04:22<01:37, 1.39s/it] Loading 0: 81%|████████ | 293/363 [04:22<01:37, 1.39s/it] Loading 0: 83%|████████▎ | 300/363 [04:23<00:37, 1.66it/s] Loading 0: 83%|████████▎ | 300/363 [04:23<00:37, 1.66it/s] Loading 0: 84%|████████▎ | 304/363 [04:26<00:37, 1.58it/s] Loading 0: 84%|████████▎ | 304/363 [04:26<00:37, 1.58it/s] Loading 0: 84%|████████▍ | 305/363 [04:28<00:46, 1.25it/s] Loading 0: 84%|████████▍ | 305/363 [04:28<00:46, 1.25it/s] Loading 0: 84%|████████▍ | 306/363 [04:30<00:57, 1.00s/it] Loading 0: 84%|████████▍ | 306/363 [04:30<00:57, 1.00s/it] Loading 0: 85%|████████▌ | 309/363 [04:33<00:49, 1.09it/s] Loading 0: 85%|████████▌ | 309/363 [04:33<00:49, 1.09it/s] Loading 0: 85%|████████▌ | 310/363 [04:35<00:58, 1.10s/it] Loading 0: 85%|████████▌ | 310/363 [04:35<00:58, 1.10s/it] Loading 0: 86%|████████▌ | 311/363 [04:37<01:08, 1.31s/it] Loading 0: 86%|████████▌ | 311/363 [04:37<01:08, 1.31s/it] Loading 0: 88%|████████▊ | 318/363 [04:38<00:25, 1.75it/s] Loading 0: 88%|████████▊ | 318/363 [04:38<00:25, 1.75it/s] Loading 0: 89%|████████▊ | 322/363 [04:41<00:25, 1.63it/s] Loading 0: 89%|████████▊ | 322/363 [04:41<00:25, 1.63it/s] Loading 0: 89%|████████▉ | 323/363 [04:43<00:31, 1.28it/s] Loading 0: 89%|████████▉ | 323/363 [04:43<00:31, 1.28it/s] Loading 0: 89%|████████▉ | 324/363 [04:46<00:38, 1.01it/s] Loading 0: 89%|████████▉ | 324/363 [04:46<00:38, 1.01it/s] Loading 0: 90%|█████████ | 327/363 [04:48<00:32, 1.12it/s] Loading 0: 90%|█████████ | 327/363 [04:48<00:32, 1.12it/s] Loading 0: 90%|█████████ | 328/363 [04:50<00:37, 1.08s/it] Loading 0: 90%|█████████ | 328/363 [04:50<00:37, 1.08s/it] Loading 0: 91%|█████████ | 329/363 [04:52<00:43, 1.29s/it] Loading 0: 91%|█████████ | 329/363 [04:52<00:43, 1.29s/it] Loading 0: 93%|█████████▎| 336/363 [04:53<00:15, 1.77it/s] Loading 0: 93%|█████████▎| 336/363 [04:53<00:15, 1.77it/s] Loading 0: 94%|█████████▎| 340/363 [04:56<00:14, 1.64it/s] Loading 0: 94%|█████████▎| 340/363 [04:56<00:14, 1.64it/s] Loading 0: 94%|█████████▍| 341/363 [04:58<00:17, 1.28it/s] Loading 0: 94%|█████████▍| 341/363 [04:58<00:17, 1.28it/s] Loading 0: 94%|█████████▍| 342/363 [05:00<00:20, 1.01it/s] Loading 0: 94%|█████████▍| 342/363 [05:00<00:20, 1.01it/s] Loading 0: 95%|█████████▌| 345/363 [05:03<00:16, 1.12it/s] Loading 0: 95%|█████████▌| 345/363 [05:03<00:16, 1.12it/s] Loading 0: 95%|█████████▌| 346/363 [05:05<00:18, 1.08s/it] Loading 0: 95%|█████████▌| 346/363 [05:05<00:18, 1.08s/it] Loading 0: 96%|█████████▌| 347/363 [05:07<00:20, 1.29s/it] Loading 0: 96%|█████████▌| 347/363 [05:07<00:20, 1.29s/it] Loading 0: 98%|█████████▊| 354/363 [05:08<00:05, 1.77it/s] Loading 0: 98%|█████████▊| 354/363 [05:08<00:05, 1.77it/s] Loading 0: 99%|█████████▉| 359/363 [05:11<00:02, 1.69it/s] Loading 0: 99%|█████████▉| 359/363 [05:11<00:02, 1.69it/s] Loading 0: 99%|█████████▉| 360/363 [05:14<00:02, 1.33it/s] Loading 0: 99%|█████████▉| 360/363 [05:14<00:02, 1.33it/s] Loading 0: 99%|█████████▉| 361/363 [05:16<00:01, 1.06it/s] Loading 0: 99%|█████████▉| 361/363 [05:16<00:01, 1.06it/s] Loading 0: 100%|██████████| 363/363 [05:16<00:00, 1.06it/s] Loading 0: 100%|██████████| 363/363 [05:16<00:00, 1.15it/s]
chaiml-mistral-24b-2048-74727-v3-mkmlizer: The tokenizer you are loading from '/tmp/tmptal3v4qj' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-mistral-24b-2048-74727-v3-mkmlizer: quantized model in 323.514s
chaiml-mistral-24b-2048-74727-v3-mkmlizer: Processed model ChaiML/mistral_24b_2048_gemini_opus_ds_v9_1686_merged in 410.062s
chaiml-mistral-24b-2048-74727-v3-mkmlizer: creating bucket guanaco-mkml-models
chaiml-mistral-24b-2048-74727-v3-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-mistral-24b-2048-74727-v3-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-mistral-24b-2048-74727-v3/nvidia
chaiml-mistral-24b-2048-74727-v3-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-74727-v3/nvidia/config.json
chaiml-mistral-24b-2048-74727-v3-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-74727-v3/nvidia/special_tokens_map.json
chaiml-mistral-24b-2048-74727-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-74727-v3/nvidia/tokenizer_config.json
chaiml-mistral-24b-2048-74727-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-mistral-24b-2048-74727-v3/nvidia/tokenizer.json
chaiml-mistral-24b-2048-25909-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-mistral-24b-2048-25909-v3/nvidia/flywheel_model.0.safetensors
Job chaiml-mistral-24b-2048-25909-v3-mkmlizer completed after 442.52s with status: succeeded
Stopping job with name chaiml-mistral-24b-2048-25909-v3-mkmlizer
Pipeline stage MKMLizer completed in 446.17s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.41s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-mistral-24b-2048-25909-v3
Waiting for inference service chaiml-mistral-24b-2048-25909-v3 to be ready
chaiml-mistral-24b-2048-74727-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-mistral-24b-2048-74727-v3/nvidia/flywheel_model.1.safetensors
chaiml-mistral-24b-2048-74727-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-mistral-24b-2048-74727-v3/nvidia/flywheel_model.0.safetensors
Job chaiml-mistral-24b-2048-74727-v3-mkmlizer completed after 486.32s with status: succeeded
Stopping job with name chaiml-mistral-24b-2048-74727-v3-mkmlizer
Pipeline stage MKMLizer completed in 489.47s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 1.14s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-mistral-24b-2048-74727-v3
Waiting for inference service chaiml-mistral-24b-2048-74727-v3 to be ready
Inference service chaiml-mistral-24b-2048-74727-v3 ready after 182.6200406551361s
Pipeline stage MKMLDeployer completed in 184.84s
run pipeline stage %s
Running pipeline stage StressChecker
Inference service chaiml-mistral-24b-2048-25909-v3 ready after 223.04792428016663s
Pipeline stage MKMLDeployer completed in 225.40s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.3940508365631104s
Received healthy response to inference request in 2.8161115646362305s
Received healthy response to inference request in 3.0677101612091064s
Received healthy response to inference request in 2.4905922412872314s
Received healthy response to inference request in 2.9952447414398193s
Received healthy response to inference request in 3.072671890258789s
Received healthy response to inference request in 2.7964963912963867s
Received healthy response to inference request in 2.8471078872680664s
Received healthy response to inference request in 2.8030452728271484s
5 requests
0 failed requests
5th percentile: 2.5618953704833984
10th percentile: 2.6331984996795654
20th percentile: 2.7758047580718994
30th percentile: 2.8912283420562743
40th percentile: 2.9794692516326906
Received healthy response to inference request in 2.926142454147339s
50th percentile: 3.0677101612091064
5 requests
60th percentile: 3.0696948528289796
0 failed requests
70th percentile: 3.0716795444488527
5th percentile: 2.797806167602539
80th percentile: 3.1369476795196536
10th percentile: 2.7991159439086912
90th percentile: 3.2654992580413817
20th percentile: 2.801735496520996
95th percentile: 3.329775047302246
30th percentile: 2.805658531188965
99th percentile: 3.3811956787109376
40th percentile: 2.8108850479125977
mean time: 2.974426603317261
50th percentile: 2.8161115646362305
Pipeline stage StressChecker completed in 29.07s
60th percentile: 2.8601239204406737
run pipeline stage %s
70th percentile: 2.9041362762451173
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
80th percentile: 2.939962911605835
90th percentile: 2.967603826522827
run_pipeline:run_in_cloud %s
95th percentile: 2.9814242839813234
starting trigger_guanaco_pipeline args=%s
99th percentile: 2.9924806499481202
mean time: 2.8674080848693846
triggered trigger_guanaco_pipeline args=%s
Pipeline stage StressChecker completed in 30.49s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 4.70s
run pipeline stage %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.93s
Pipeline stage TriggerMKMLProfilingPipeline completed in 2.98s
run pipeline stage %s
Shutdown handler de-registered
Running pipeline stage TriggerMKMLProfilingPipeline
Running pipeline stage TriggerMKMLProfilingPipeline
chaiml-mistral-24b-2048_74727_v3 status is now deployed due to DeploymentManager action
chaiml-mistral-24b-2048_74727_v3 status is now inactive due to auto deactivation removed underperforming models