developer_uid: richhx
submission_id: chaiml-02f4-69d4-linear-w01_v52
model_name: chaiml-02f4-69d4-linear-w01_v52
model_group: ChaiML/02f4-69d4-linear-
status: inactive
timestamp: 2026-01-20T00:38:12+00:00
num_battles: 10872
num_wins: 5460
celo_rating: 1304.63
family_friendly_score: 0.507
family_friendly_standard_error: 0.007070374813261317
submission_type: basic
model_repo: ChaiML/02f4-69d4-linear-w01
model_architecture: MistralForCausalLM
model_num_parameters: 24096691200.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
reward_model: default
latencies: [{'batch_size': 1, 'throughput': 0.5418206704355055, 'latency_mean': 1.845565038919449, 'latency_p50': 1.8575966358184814, 'latency_p90': 2.0328501224517823}, {'batch_size': 3, 'throughput': 1.0931378220729622, 'latency_mean': 2.7364771461486814, 'latency_p50': 2.7583203315734863, 'latency_p90': 3.018396997451782}, {'batch_size': 5, 'throughput': 1.4079614634571065, 'latency_mean': 3.536314581632614, 'latency_p50': 3.538602828979492, 'latency_p90': 4.03434808254242}, {'batch_size': 6, 'throughput': 1.4981533071572342, 'latency_mean': 3.9752520728111267, 'latency_p50': 4.0149922370910645, 'latency_p90': 4.470614385604859}, {'batch_size': 8, 'throughput': 1.6465398013598425, 'latency_mean': 4.817346085309982, 'latency_p50': 4.852675557136536, 'latency_p90': 5.522289133071899}, {'batch_size': 10, 'throughput': 1.703652163829832, 'latency_mean': 5.824680143594742, 'latency_p50': 5.787641167640686, 'latency_p90': 6.609719252586364}]
gpu_counts: {'NVIDIA L40S': 1}
display_name: chaiml-02f4-69d4-linear-w01_v52
is_internal_developer: True
language_model: ChaiML/02f4-69d4-linear-w01
model_size: 24B
ranking_group: single
throughput_3p7s: 1.45
us_pacific_date: 2026-01-19
win_ratio: 0.5022075055187638
generation_params: {'temperature': 0.7, 'top_p': 0.95, 'min_p': 0.025, 'top_k': 80, 'presence_penalty': 0.4, 'frequency_penalty': 0.4, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '<|im_start|>system\n{memory}<|im_end|>\n', 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{user_name}: {message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-02f4-69d4-linear-w01-v52-mkmlizer
Waiting for job on chaiml-02f4-69d4-linear-w01-v52-mkmlizer to finish
chaiml-02f4-69d4-linear-w01-v52-mkmlizer: bash: cannot set terminal process group (-1): Inappropriate ioctl for device
chaiml-02f4-69d4-linear-w01-v52-mkmlizer: bash: no job control in this shell
chaiml-02f4-69d4-linear-w01-v52-mkmlizer: Downloaded to shared memory in 54.533s
chaiml-02f4-69d4-linear-w01-v52-mkmlizer: Checking if ChaiML/02f4-69d4-linear-w01 already exists in ChaiML
chaiml-02f4-69d4-linear-w01-v52-mkmlizer: quantizing model to /dev/shm/model_cache, profile:q4, folder:/tmp/tmpuxtp6ac5, device:0
chaiml-02f4-69d4-linear-w01-v52-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-kimid-v8b-kimid-63800-v17-uploader
Waiting for job on chaiml-kimid-v8b-kimid-63800-v17-uploader to finish
chaiml-kimid-v8b-kimid-63800-v17-uploader: /root/miniconda3/envs/nvidia/lib/python3.11/site-packages/mk1/__init__.py:1: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
chaiml-kimid-v8b-kimid-63800-v17-uploader: __import__('pkg_resources').declare_namespace(__name__)
chaiml-kimid-v8b-kimid-63800-v17-uploader: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ ██████ ██████ █████ ████ ████ ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ █████ █████ █████ ░░████ █████ ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ Version: 0.30.6+torch280 ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ Features: FLYWHEEL, CUDA ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ https://mk1.ai ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ The license key for the current software has been verified as ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ belonging to: ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ Chai Research Corp. ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ║ ║
chaiml-kimid-v8b-kimid-63800-v17-uploader: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-02f4-69d4-linear-w01-v52-mkmlizer: Loading 0: 0%| | 0.00/363 [00:00<?, ?it/s] Loading 0: 1%| | 4.00/363 [00:02<03:33, 1.69it/s] Loading 0: 1%| | 4.00/363 [00:02<03:33, 1.69it/s] Loading 0: 1%|▏ | 5.00/363 [00:04<05:31, 1.08it/s] Loading 0: 1%|▏ | 5.00/363 [00:04<05:31, 1.08it/s] Loading 0: 2%|▏ | 6.00/363 [00:06<07:12, 1.21s/it] Loading 0: 2%|▏ | 6.00/363 [00:06<07:12, 1.21s/it] Loading 0: 4%|▎ | 13.0/363 [00:08<03:28, 1.68it/s] Loading 0: 4%|▎ | 13.0/363 [00:08<03:28, 1.68it/s] Loading 0: 4%|▍ | 14.0/363 [00:10<04:24, 1.32it/s] Loading 0: 4%|▍ | 14.0/363 [00:10<04:24, 1.32it/s] Loading 0: 4%|▍ | 15.0/363 [00:12<05:30, 1.05it/s] Loading 0: 4%|▍ | 15.0/363 [00:12<05:30, 1.05it/s] Loading 0: 6%|▌ | 22.0/363 [00:15<03:26, 1.65it/s] Loading 0: 6%|▌ | 22.0/363 [00:15<03:26, 1.65it/s] Loading 0: 6%|▋ | 23.0/363 [00:17<04:12, 1.35it/s] Loading 0: 6%|▋ | 23.0/363 [00:17<04:12, 1.35it/s] Loading 0: 7%|▋ | 24.0/363 [00:19<05:08, 1.10it/s] Loading 0: 7%|▋ | 24.0/363 [00:19<05:08, 1.10it/s] Loading 0: 9%|▊ | 31.0/363 [00:21<03:16, 1.69it/s] Loading 0: 9%|▊ | 31.0/363 [00:21<03:16, 1.69it/s] Loading 0: 9%|▉ | 32.0/363 [00:23<03:59, 1.38it/s] Loading 0: 9%|▉ | 32.0/363 [00:23<03:59, 1.38it/s] Loading 0: 9%|▉ | 33.0/363 [00:25<04:52, 1.13it/s] Loading 0: 9%|▉ | 33.0/363 [00:25<04:52, 1.13it/s] Loading 0: 11%|█ | 40.0/363 [00:28<03:11, 1.68it/s] Loading 0: 11%|█ | 40.0/363 [00:28<03:11, 1.68it/s] Loading 0: 11%|█▏ | 41.0/363 [00:29<03:53, 1.38it/s] Loading 0: 11%|█▏ | 41.0/363 [00:29<03:53, 1.38it/s] Loading 0: 12%|█▏ | 42.0/363 [00:31<04:44, 1.13it/s] Loading 0: 12%|█▏ | 42.0/363 [00:31<04:44, 1.13it/s] Loading 0: 13%|█▎ | 49.0/363 [00:34<03:05, 1.69it/s] Loading 0: 13%|█▎ | 49.0/363 [00:34<03:05, 1.69it/s] Loading 0: 14%|█▍ | 50.0/363 [00:36<03:46, 1.38it/s] Loading 0: 14%|█▍ | 50.0/363 [00:36<03:46, 1.38it/s] Loading 0: 14%|█▍ | 51.0/363 [00:38<04:35, 1.13it/s] Loading 0: 14%|█▍ | 51.0/363 [00:38<04:35, 1.13it/s] Loading 0: 16%|█▌ | 58.0/363 [00:40<03:00, 1.69it/s] Loading 0: 16%|█▌ | 58.0/363 [00:40<03:00, 1.69it/s] Loading 0: 16%|█▋ | 59.0/363 [00:42<03:44, 1.35it/s] Loading 0: 16%|█▋ | 59.0/363 [00:42<03:44, 1.35it/s] Loading 0: 17%|█▋ | 60.0/363 [00:44<04:31, 1.11it/s] Loading 0: 17%|█▋ | 60.0/363 [00:44<04:31, 1.11it/s] Loading 0: 18%|█▊ | 67.0/363 [00:47<02:56, 1.67it/s] Loading 0: 18%|█▊ | 67.0/363 [00:47<02:56, 1.67it/s] Loading 0: 19%|█▊ | 68.0/363 [00:49<03:34, 1.38it/s] Loading 0: 19%|█▊ | 68.0/363 [00:49<03:34, 1.38it/s] Loading 0: 19%|█▉ | 69.0/363 [00:51<04:20, 1.13it/s] Loading 0: 19%|█▉ | 69.0/363 [00:51<04:20, 1.13it/s] Loading 0: 21%|██ | 76.0/363 [00:53<02:49, 1.69it/s] Loading 0: 21%|██ | 76.0/363 [00:53<02:49, 1.69it/s] Loading 0: 21%|██ | 77.0/363 [00:55<03:26, 1.39it/s] Loading 0: 21%|██ | 77.0/363 [00:55<03:26, 1.39it/s] Loading 0: 21%|██▏ | 78.0/363 [00:57<04:10, 1.14it/s] Loading 0: 21%|██▏ | 78.0/363 [00:57<04:10, 1.14it/s] Loading 0: 23%|██▎ | 85.0/363 [01:00<02:44, 1.69it/s] Loading 0: 23%|██▎ | 85.0/363 [01:00<02:44, 1.69it/s] Loading 0: 24%|██▎ | 86.0/363 [01:01<03:19, 1.39it/s] Loading 0: 24%|██▎ | 86.0/363 [01:01<03:19, 1.39it/s] Loading 0: 24%|██▍ | 87.0/363 [01:03<04:03, 1.14it/s] Loading 0: 24%|██▍ | 87.0/363 [01:03<04:03, 1.14it/s] Loading 0: 26%|██▌ | 94.0/363 [01:06<02:38, 1.70it/s] Loading 0: 26%|██▌ | 94.0/363 [01:06<02:38, 1.70it/s] Loading 0: 26%|██▌ | 95.0/363 [01:08<03:12, 1.39it/s] Loading 0: 26%|██▌ | 95.0/363 [01:08<03:12, 1.39it/s] Loading 0: 26%|██▋ | 96.0/363 [01:10<04:01, 1.11it/s] Loading 0: 26%|██▋ | 96.0/363 [01:10<04:01, 1.11it/s] Loading 0: 28%|██▊ | 103/363 [01:12<02:35, 1.67it/s] Loading 0: 28%|██▊ | 103/363 [01:12<02:35, 1.67it/s] Loading 0: 29%|██▊ | 104/363 [01:14<03:08, 1.37it/s] Loading 0: 29%|██▊ | 104/363 [01:14<03:08, 1.37it/s] Loading 0: 29%|██▉ | 105/363 [01:16<03:48, 1.13it/s] Loading 0: 29%|██▉ | 105/363 [01:16<03:48, 1.13it/s] Loading 0: 31%|███ | 112/363 [01:19<02:28, 1.69it/s] Loading 0: 31%|███ | 112/363 [01:19<02:28, 1.69it/s] Loading 0: 31%|███ | 113/363 [01:21<02:59, 1.39it/s] Loading 0: 31%|███ | 113/363 [01:21<02:59, 1.39it/s] Loading 0: 31%|███▏ | 114/363 [01:23<03:38, 1.14it/s] Loading 0: 31%|███▏ | 114/363 [01:23<03:38, 1.14it/s] Loading 0: 33%|███▎ | 121/363 [01:25<02:22, 1.70it/s] Loading 0: 33%|███▎ | 121/363 [01:25<02:22, 1.70it/s] Loading 0: 34%|███▎ | 122/363 [01:27<02:52, 1.40it/s] Loading 0: 34%|███▎ | 122/363 [01:27<02:52, 1.40it/s] Loading 0: 34%|███▍ | 123/363 [01:29<03:30, 1.14it/s] Loading 0: 34%|███▍ | 123/363 [01:29<03:30, 1.14it/s] Loading 0: 36%|███▌ | 130/363 [01:31<02:16, 1.70it/s] Loading 0: 36%|███▌ | 130/363 [01:31<02:16, 1.70it/s] Loading 0: 36%|███▌ | 131/363 [01:33<02:46, 1.40it/s] Loading 0: 36%|███▌ | 131/363 [01:33<02:46, 1.40it/s] Loading 0: 36%|███▋ | 132/363 [01:35<03:22, 1.14it/s] Loading 0: 36%|███▋ | 132/363 [01:35<03:22, 1.14it/s] Loading 0: 38%|███▊ | 139/363 [01:38<02:13, 1.67it/s] Loading 0: 38%|███▊ | 139/363 [01:38<02:13, 1.67it/s] Loading 0: 39%|███▊ | 140/363 [01:40<02:42, 1.38it/s] Loading 0: 39%|███▊ | 140/363 [01:40<02:42, 1.38it/s] Loading 0: 39%|███▉ | 141/363 [01:42<03:16, 1.13it/s] Loading 0: 39%|███▉ | 141/363 [01:42<03:16, 1.13it/s] Loading 0: 41%|████ | 148/363 [01:44<02:07, 1.69it/s] Loading 0: 41%|████ | 148/363 [01:44<02:07, 1.69it/s] Loading 0: 41%|████ | 149/363 [01:46<02:34, 1.39it/s] Loading 0: 41%|████ | 149/363 [01:46<02:34, 1.39it/s] Loading 0: 41%|████▏ | 150/363 [01:48<03:07, 1.14it/s] Loading 0: 41%|████▏ | 150/363 [01:48<03:07, 1.14it/s] Loading 0: 43%|████▎ | 157/363 [01:51<02:01, 1.70it/s] Loading 0: 43%|████▎ | 157/363 [01:51<02:01, 1.70it/s] Loading 0: 44%|████▎ | 158/363 [01:52<02:27, 1.39it/s] Loading 0: 44%|████▎ | 158/363 [01:52<02:27, 1.39it/s] Loading 0: 44%|████▍ | 159/363 [01:54<02:58, 1.14it/s] Loading 0: 44%|████▍ | 159/363 [01:54<02:58, 1.14it/s] Loading 0: 46%|████▌ | 166/363 [01:57<01:55, 1.70it/s] Loading 0: 46%|████▌ | 166/363 [01:57<01:55, 1.70it/s] Loading 0: 46%|████▌ | 167/363 [01:59<02:20, 1.40it/s] Loading 0: 46%|████▌ | 167/363 [01:59<02:20, 1.40it/s] Loading 0: 46%|████▋ | 168/363 [02:01<02:50, 1.14it/s] Loading 0: 46%|████▋ | 168/363 [02:01<02:50, 1.14it/s] Loading 0: 48%|████▊ | 175/363 [02:03<01:50, 1.71it/s] Loading 0: 48%|████▊ | 175/363 [02:03<01:50, 1.71it/s] Loading 0: 48%|████▊ | 176/363 [02:05<02:17, 1.36it/s] Loading 0: 48%|████▊ | 176/363 [02:05<02:17, 1.36it/s] Loading 0: 49%|████▉ | 177/363 [02:07<02:45, 1.12it/s] Loading 0: 49%|████▉ | 177/363 [02:07<02:45, 1.12it/s] Loading 0: 51%|█████ | 184/363 [02:10<01:46, 1.69it/s] Loading 0: 51%|█████ | 184/363 [02:10<01:46, 1.69it/s] Loading 0: 51%|█████ | 185/363 [02:12<02:08, 1.38it/s] Loading 0: 51%|█████ | 185/363 [02:12<02:08, 1.38it/s] Loading 0: 51%|█████ | 186/363 [02:14<02:35, 1.14it/s] Loading 0: 51%|█████ | 186/363 [02:14<02:35, 1.14it/s] Loading 0: 53%|█████▎ | 193/363 [02:16<01:39, 1.70it/s] Loading 0: 53%|█████▎ | 193/363 [02:16<01:39, 1.70it/s] Loading 0: 53%|█████▎ | 194/363 [02:18<02:01, 1.39it/s] Loading 0: 53%|█████▎ | 194/363 [02:18<02:01, 1.39it/s] Loading 0: 54%|█████▎ | 195/363 [02:20<02:27, 1.14it/s] Loading 0: 54%|█████▎ | 195/363 [02:20<02:27, 1.14it/s] Loading 0: 56%|█████▌ | 202/363 [02:22<01:34, 1.70it/s] Loading 0: 56%|█████▌ | 202/363 [02:22<01:34, 1.70it/s] Loading 0: 56%|█████▌ | 203/363 [02:24<01:54, 1.40it/s] Loading 0: 56%|█████▌ | 203/363 [02:24<01:54, 1.40it/s] Loading 0: 56%|█████▌ | 204/363 [02:26<02:19, 1.14it/s] Loading 0: 56%|█████▌ | 204/363 [02:26<02:19, 1.14it/s] Loading 0: 58%|█████▊ | 211/363 [02:29<01:29, 1.71it/s] Loading 0: 58%|█████▊ | 211/363 [02:29<01:29, 1.71it/s] Loading 0: 58%|█████▊ | 212/363 [02:31<01:48, 1.40it/s] Loading 0: 58%|█████▊ | 212/363 [02:31<01:48, 1.40it/s] Loading 0: 59%|█████▊ | 213/363 [02:33<02:14, 1.11it/s] Loading 0: 59%|█████▊ | 213/363 [02:33<02:14, 1.11it/s] Loading 0: 61%|██████ | 220/363 [02:35<01:25, 1.68it/s] Loading 0: 61%|██████ | 220/363 [02:35<01:25, 1.68it/s] Loading 0: 61%|██████ | 221/363 [02:37<01:42, 1.38it/s] Loading 0: 61%|██████ | 221/363 [02:37<01:42, 1.38it/s] Loading 0: 61%|██████ | 222/363 [02:39<02:04, 1.13it/s] Loading 0: 61%|██████ | 222/363 [02:39<02:04, 1.13it/s] Loading 0: 63%|██████▎ | 229/363 [02:42<01:18, 1.70it/s] Loading 0: 63%|██████▎ | 229/363 [02:42<01:18, 1.70it/s] Loading 0: 63%|██████▎ | 230/363 [02:43<01:35, 1.39it/s] Loading 0: 63%|██████▎ | 230/363 [02:43<01:35, 1.39it/s] Loading 0: 64%|██████▎ | 231/363 [02:58<05:30, 2.50s/it] Loading 0: 64%|██████▎ | 231/363 [02:58<05:30, 2.50s/it] Loading 0: 66%|██████▌ | 238/363 [03:01<02:41, 1.29s/it] Loading 0: 66%|██████▌ | 238/363 [03:01<02:41, 1.29s/it] Loading 0: 66%|██████▌ | 239/363 [03:02<02:47, 1.35s/it] Loading 0: 66%|██████▌ | 239/363 [03:02<02:47, 1.35s/it] Loading 0: 66%|██████▌ | 240/363 [03:04<02:55, 1.43s/it] Loading 0: 66%|██████▌ | 240/363 [03:04<02:55, 1.43s/it] Loading 0: 68%|██████▊ | 247/363 [03:07<01:34, 1.23it/s] Loading 0: 68%|██████▊ | 247/363 [03:07<01:34, 1.23it/s] Loading 0: 68%|██████▊ | 248/363 [03:09<01:45, 1.09it/s] Loading 0: 68%|██████▊ | 248/363 [03:09<01:45, 1.09it/s] Loading 0: 69%|██████▊ | 249/363 [03:10<02:00, 1.05s/it] Loading 0: 69%|██████▊ | 249/363 [03:10<02:00, 1.05s/it] Loading 0: 71%|███████ | 256/363 [03:13<01:12, 1.48it/s] Loading 0: 71%|███████ | 256/363 [03:13<01:12, 1.48it/s] Loading 0: 71%|███████ | 257/363 [03:15<01:24, 1.26it/s] Loading 0: 71%|███████ | 257/363 [03:15<01:24, 1.26it/s] Loading 0: 71%|███████ | 258/363 [03:17<01:38, 1.06it/s] Loading 0: 71%|███████ | 258/363 [03:17<01:38, 1.06it/s] Loading 0: 73%|███████▎ | 265/363 [03:19<01:00, 1.63it/s] Loading 0: 73%|███████▎ | 265/363 [03:19<01:00, 1.63it/s] Loading 0: 73%|███████▎ | 266/363 [03:21<01:11, 1.35it/s] Loading 0: 73%|███████▎ | 266/363 [03:21<01:11, 1.35it/s] Loading 0: 74%|███████▎ | 267/363 [03:23<01:26, 1.11it/s] Loading 0: 74%|███████▎ | 267/363 [03:23<01:26, 1.11it/s] Loading 0: 75%|███████▌ | 274/363 [03:26<00:52, 1.68it/s] Loading 0: 75%|███████▌ | 274/363 [03:26<00:52, 1.68it/s] Loading 0: 76%|███████▌ | 275/363 [03:28<01:03, 1.38it/s] Loading 0: 76%|███████▌ | 275/363 [03:28<01:03, 1.38it/s] Loading 0: 76%|███████▌ | 276/363 [03:30<01:16, 1.14it/s] Loading 0: 76%|███████▌ | 276/363 [03:30<01:16, 1.14it/s] Loading 0: 78%|███████▊ | 283/363 [03:32<00:46, 1.71it/s] Loading 0: 78%|███████▊ | 283/363 [03:32<00:46, 1.71it/s] Loading 0: 78%|███████▊ | 284/363 [03:34<00:57, 1.38it/s] Loading 0: 78%|███████▊ | 284/363 [03:34<00:57, 1.38it/s] Loading 0: 79%|███████▊ | 285/363 [03:36<01:08, 1.14it/s] Loading 0: 79%|███████▊ | 285/363 [03:36<01:08, 1.14it/s] Loading 0: 80%|████████ | 292/363 [03:38<00:41, 1.71it/s] Loading 0: 80%|████████ | 292/363 [03:38<00:41, 1.71it/s] Loading 0: 81%|████████ | 293/363 [03:40<00:51, 1.36it/s] Loading 0: 81%|████████ | 293/363 [03:40<00:51, 1.36it/s] Loading 0: 81%|████████ | 294/363 [03:42<01:01, 1.12it/s] Loading 0: 81%|████████ | 294/363 [03:42<01:01, 1.12it/s] Loading 0: 83%|████████▎ | 301/363 [03:45<00:36, 1.68it/s] Loading 0: 83%|████████▎ | 301/363 [03:45<00:36, 1.68it/s] Loading 0: 83%|████████▎ | 302/363 [03:47<00:44, 1.38it/s] Loading 0: 83%|████████▎ | 302/363 [03:47<00:44, 1.38it/s] Loading 0: 83%|████████▎ | 303/363 [03:49<00:52, 1.14it/s] Loading 0: 83%|████████▎ | 303/363 [03:49<00:52, 1.14it/s] Loading 0: 85%|████████▌ | 310/363 [03:51<00:31, 1.71it/s] Loading 0: 85%|████████▌ | 310/363 [03:51<00:31, 1.71it/s] Loading 0: 86%|████████▌ | 311/363 [03:53<00:37, 1.40it/s] Loading 0: 86%|████████▌ | 311/363 [03:53<00:37, 1.40it/s] Loading 0: 86%|████████▌ | 312/363 [03:55<00:44, 1.14it/s] Loading 0: 86%|████████▌ | 312/363 [03:55<00:44, 1.14it/s] Loading 0: 88%|████████▊ | 319/363 [03:58<00:25, 1.71it/s] Loading 0: 88%|████████▊ | 319/363 [03:58<00:25, 1.71it/s] Loading 0: 88%|████████▊ | 320/363 [03:59<00:30, 1.40it/s] Loading 0: 88%|████████▊ | 320/363 [03:59<00:30, 1.40it/s] Loading 0: 88%|████████▊ | 321/363 [04:01<00:36, 1.15it/s] Loading 0: 88%|████████▊ | 321/363 [04:01<00:36, 1.15it/s] Loading 0: 90%|█████████ | 328/363 [04:04<00:20, 1.72it/s] Loading 0: 90%|█████████ | 328/363 [04:04<00:20, 1.72it/s] Loading 0: 91%|█████████ | 329/363 [04:06<00:24, 1.40it/s] Loading 0: 91%|█████████ | 329/363 [04:06<00:24, 1.40it/s] Loading 0: 91%|█████████ | 330/363 [04:08<00:29, 1.11it/s] Loading 0: 91%|█████████ | 330/363 [04:08<00:29, 1.11it/s] Loading 0: 93%|█████████▎| 337/363 [04:10<00:15, 1.68it/s] Loading 0: 93%|█████████▎| 337/363 [04:10<00:15, 1.68it/s] Loading 0: 93%|█████████▎| 338/363 [04:12<00:18, 1.38it/s] Loading 0: 93%|█████████▎| 338/363 [04:12<00:18, 1.38it/s] Loading 0: 93%|█████████▎| 339/363 [04:14<00:21, 1.13it/s] Loading 0: 93%|█████████▎| 339/363 [04:14<00:21, 1.13it/s] Loading 0: 95%|█████████▌| 346/363 [04:17<00:10, 1.70it/s] Loading 0: 95%|█████████▌| 346/363 [04:17<00:10, 1.70it/s] Loading 0: 96%|█████████▌| 347/363 [04:19<00:11, 1.39it/s] Loading 0: 96%|█████████▌| 347/363 [04:19<00:11, 1.39it/s] Loading 0: 96%|█████████▌| 348/363 [04:21<00:13, 1.14it/s] Loading 0: 96%|█████████▌| 348/363 [04:21<00:13, 1.14it/s] Loading 0: 98%|█████████▊| 355/363 [04:23<00:04, 1.70it/s] Loading 0: 98%|█████████▊| 355/363 [04:23<00:04, 1.70it/s] Loading 0: 98%|█████████▊| 356/363 [04:25<00:05, 1.39it/s] Loading 0: 98%|█████████▊| 356/363 [04:25<00:05, 1.39it/s] Loading 0: 98%|█████████▊| 357/363 [04:27<00:05, 1.14it/s] Loading 0: 98%|█████████▊| 357/363 [04:27<00:05, 1.14it/s] Loading 0: 100%|██████████| 363/363 [04:28<00:00, 2.08it/s] Loading 0: 100%|██████████| 363/363 [04:28<00:00, 2.08it/s] Loading 0: 100%|██████████| 363/363 [04:28<00:00, 1.35it/s]
chaiml-kimid-v8b-kimid-63800-v17-uploader: Downloaded to shared memory in 107.448s
chaiml-02f4-69d4-linear-w01-v52-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-02f4-69d4-linear-w01-v52/nvidia/flywheel_model.0.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: Processed model ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01-int4-mixed in 167.629s
Job chaiml-02f4-69d4-linear-w01-v52-mkmlizer completed after 794.68s with status: succeeded
chaiml-kimid-v8b-kimid-63800-v17-uploader: creating bucket guanaco-vllm-models
Stopping job with name chaiml-02f4-69d4-linear-w01-v52-mkmlizer
chaiml-kimid-v8b-kimid-63800-v17-uploader: Bucket 's3://guanaco-vllm-models/' created
Pipeline stage MKMLizer completed in 802.49s
chaiml-kimid-v8b-kimid-63800-v17-uploader: uploading /dev/shm/model_cache to s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17
run pipeline stage %s
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/added_tokens.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/added_tokens.json
Running pipeline stage MKMLTemplater
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/.gitattributes s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/.gitattributes
Pipeline stage MKMLTemplater completed in 4.55s
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/chat_template.jinja s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/chat_template.jinja
run pipeline stage %s
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/generation_config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/generation_config.json
Running pipeline stage MKMLDeployer
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/special_tokens_map.json
Creating inference service chaiml-02f4-69d4-linear-w01-v52
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/README.md s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/README.md
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/quantization_config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/quantization_config.json
Waiting for inference service chaiml-02f4-69d4-linear-w01-v52 to be ready
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/merges.txt s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/merges.txt
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/tokenizer_config.json
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/config.json
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/vocab.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/vocab.json
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model.safetensors.index.json
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/tokenizer.json
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00027-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00019-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00001-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00020-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00010-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00002-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00017-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00009-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00025-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00024-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00023-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00021-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00015-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00006-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00018-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00016-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00011-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00012-of-00027.safetensors
chaiml-kimid-v8b-kimid-63800-v17-uploader: cp /dev/shm/model_cache/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-63800-v17/model-00008-of-00027.safetensors
Job chaiml-kimid-v8b-kimid-63800-v17-uploader completed after 380.43s with status: succeeded
Stopping job with name chaiml-kimid-v8b-kimid-63800-v17-uploader
Pipeline stage VLLMUploader completed in 384.02s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.36s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-kimid-v8b-kimid-63800-v17
Waiting for inference service chaiml-kimid-v8b-kimid-63800-v17 to be ready
Inference service chaiml-02f4-69d4-linear-w01-v52 ready after 376.56437039375305s
Pipeline stage MKMLDeployer completed in 399.61s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.798482656478882s
Received healthy response to inference request in 1.976459264755249s
Received healthy response to inference request in 2.8762776851654053s
Received healthy response to inference request in 2.438427686691284s
Received healthy response to inference request in 1.9075284004211426s
5 requests
0 failed requests
5th percentile: 1.921314573287964
10th percentile: 1.9351007461547851
20th percentile: 1.9626730918884276
30th percentile: 2.068852949142456
40th percentile: 2.25364031791687
50th percentile: 2.438427686691284
60th percentile: 2.5824496746063232
70th percentile: 2.7264716625213623
80th percentile: 2.8140416622161863
90th percentile: 2.845159673690796
95th percentile: 2.8607186794281008
99th percentile: 2.8731658840179444
mean time: 2.3994351387023927
Pipeline stage StressChecker completed in 23.99s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 3.59s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 2.81s
Shutdown handler de-registered
chaiml-02f4-69d4-linear-w01_v52 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2255.37s
Shutdown handler de-registered
chaiml-02f4-69d4-linear-w01_v52 status is now inactive due to auto deactivation removed underperforming models