Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-mistral31-24b-s-69496-v17-mkmlizer
Waiting for job on chaiml-mistral31-24b-s-69496-v17-mkmlizer to finish
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ Version: 0.29.3 ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ https://mk1.ai ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ belonging to: ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ Chai Research Corp. ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ║ ║
chaiml-mistral31-24b-s-69496-v17-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-mistral31-24b-s-69496-v17-mkmlizer: Downloaded to shared memory in 63.751s
chaiml-mistral31-24b-s-69496-v17-mkmlizer: Checking if ChaiML/mistral31-24b-simpoexp1-s1-new-sft-retryv2top20lex-2e already exists in ChaiML
chaiml-mistral31-24b-s-69496-v17-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpg_tozkfe, device:0
chaiml-mistral31-24b-s-69496-v17-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-mistral31-24b-s-69496-v17-mkmlizer: quantized model in 48.595s
chaiml-mistral31-24b-s-69496-v17-mkmlizer: Processed model ChaiML/mistral31-24b-simpoexp1-s1-new-sft-retryv2top20lex-2e in 112.347s
chaiml-mistral31-24b-s-69496-v17-mkmlizer: creating bucket guanaco-mkml-models
chaiml-mistral31-24b-s-69496-v17-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-mistral31-24b-s-69496-v17-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-mistral31-24b-s-69496-v17
chaiml-mistral31-24b-s-69496-v17-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-mistral31-24b-s-69496-v17/config.json
chaiml-mistral31-24b-s-69496-v17-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-mistral31-24b-s-69496-v17/special_tokens_map.json
chaiml-mistral31-24b-s-69496-v17-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-mistral31-24b-s-69496-v17/tokenizer_config.json
chaiml-mistral31-24b-s-69496-v17-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-mistral31-24b-s-69496-v17/tokenizer.json
chaiml-mistral31-24b-s-69496-v17-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-mistral31-24b-s-69496-v17/flywheel_model.1.safetensors
chaiml-mistral31-24b-s-69496-v17-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 4/363 [00:00<00:09, 39.73it/s]
Loading 0: 2%|▏ | 8/363 [00:00<00:11, 30.06it/s]
Loading 0: 3%|▎ | 12/363 [00:00<00:11, 30.51it/s]
Loading 0: 4%|▍ | 16/363 [00:00<00:12, 27.36it/s]
Loading 0: 6%|▌ | 21/363 [00:00<00:10, 31.63it/s]
Loading 0: 7%|▋ | 25/363 [00:00<00:11, 28.78it/s]
Loading 0: 9%|▉ | 32/363 [00:01<00:09, 34.10it/s]
Loading 0: 10%|▉ | 36/363 [00:01<00:16, 20.28it/s]
Loading 0: 11%|█ | 40/363 [00:01<00:14, 22.79it/s]
Loading 0: 12%|█▏ | 43/363 [00:01<00:13, 23.23it/s]
Loading 0: 13%|█▎ | 48/363 [00:01<00:11, 26.40it/s]
Loading 0: 14%|█▍ | 51/363 [00:01<00:13, 23.83it/s]
Loading 0: 16%|█▌ | 57/363 [00:02<00:10, 28.45it/s]
Loading 0: 17%|█▋ | 61/363 [00:02<00:11, 26.93it/s]
Loading 0: 18%|█▊ | 65/363 [00:02<00:11, 27.00it/s]
Loading 0: 19%|█▉ | 70/363 [00:02<00:12, 23.35it/s]
Loading 0: 20%|██ | 73/363 [00:02<00:14, 20.40it/s]
Loading 0: 22%|██▏ | 79/363 [00:03<00:11, 25.45it/s]
Loading 0: 23%|██▎ | 82/363 [00:03<00:11, 24.72it/s]
Loading 0: 24%|██▎ | 86/363 [00:03<00:10, 25.91it/s]
Loading 0: 25%|██▍ | 89/363 [00:03<00:10, 25.98it/s]
Loading 0: 25%|██▌ | 92/363 [00:03<00:13, 20.72it/s]
Loading 0: 27%|██▋ | 97/363 [00:03<00:10, 26.54it/s]
Loading 0: 28%|██▊ | 101/363 [00:03<00:11, 23.29it/s]
Loading 0: 29%|██▉ | 107/363 [00:04<00:11, 22.86it/s]
Loading 0: 31%|███ | 112/363 [00:04<00:09, 25.51it/s]
Loading 0: 32%|███▏ | 115/363 [00:04<00:09, 25.42it/s]
Loading 0: 33%|███▎ | 120/363 [00:04<00:08, 27.70it/s]
Loading 0: 34%|███▍ | 123/363 [00:04<00:09, 25.00it/s]
Loading 0: 36%|███▌ | 129/363 [00:04<00:08, 29.15it/s]
Loading 0: 36%|███▋ | 132/363 [00:05<00:09, 25.13it/s]
Loading 0: 38%|███▊ | 138/363 [00:05<00:07, 28.70it/s]
Loading 0: 39%|███▉ | 141/363 [00:05<00:08, 25.08it/s]
Loading 0: 40%|████ | 146/363 [00:05<00:07, 29.98it/s]
Loading 0: 41%|████▏ | 150/363 [00:05<00:08, 23.69it/s]
Loading 0: 42%|████▏ | 153/363 [00:06<00:10, 20.45it/s]
Loading 0: 43%|████▎ | 157/363 [00:06<00:08, 23.57it/s]
Loading 0: 44%|████▍ | 160/363 [00:06<00:08, 23.89it/s]
Loading 0: 45%|████▌ | 165/363 [00:06<00:07, 26.24it/s]
Loading 0: 46%|████▋ | 168/363 [00:06<00:08, 23.41it/s]
Loading 0: 48%|████▊ | 174/363 [00:06<00:06, 27.87it/s]
Loading 0: 49%|████▉ | 177/363 [00:06<00:07, 24.82it/s]
Loading 0: 50%|█████ | 182/363 [00:07<00:06, 27.00it/s]
Loading 0: 52%|█████▏ | 187/363 [00:07<00:07, 23.77it/s]
Loading 0: 52%|█████▏ | 190/363 [00:07<00:07, 22.15it/s]
Loading 0: 53%|█████▎ | 193/363 [00:07<00:07, 23.23it/s]
Loading 0: 54%|█████▍ | 196/363 [00:07<00:07, 23.66it/s]
Loading 0: 55%|█████▌ | 200/363 [00:22<00:06, 23.66it/s]
Loading 0: 55%|█████▌ | 201/363 [00:22<03:02, 1.13s/it]
Loading 0: 56%|█████▌ | 203/363 [00:22<02:30, 1.06it/s]
Loading 0: 57%|█████▋ | 208/363 [00:22<01:30, 1.71it/s]
Loading 0: 58%|█████▊ | 211/363 [00:22<01:09, 2.17it/s]
Loading 0: 59%|█████▉ | 214/363 [00:22<00:52, 2.86it/s]
Loading 0: 60%|██████ | 218/363 [00:23<00:35, 4.07it/s]
Loading 0: 61%|██████ | 221/363 [00:23<00:27, 5.18it/s]
Loading 0: 62%|██████▏ | 224/363 [00:23<00:22, 6.14it/s]
Loading 0: 63%|██████▎ | 229/363 [00:23<00:14, 9.07it/s]
Loading 0: 64%|██████▍ | 232/363 [00:23<00:12, 10.74it/s]
Loading 0: 65%|██████▌ | 237/363 [00:23<00:08, 14.33it/s]
Loading 0: 66%|██████▌ | 240/363 [00:24<00:08, 15.09it/s]
Loading 0: 68%|██████▊ | 246/363 [00:24<00:05, 20.08it/s]
Loading 0: 69%|██████▊ | 249/363 [00:24<00:05, 19.42it/s]
Loading 0: 70%|███████ | 255/363 [00:24<00:04, 24.08it/s]
Loading 0: 71%|███████ | 258/363 [00:24<00:04, 22.10it/s]
Loading 0: 72%|███████▏ | 262/363 [00:24<00:03, 25.28it/s]
Loading 0: 74%|███████▎ | 267/363 [00:25<00:04, 21.89it/s]
Loading 0: 74%|███████▍ | 270/363 [00:25<00:04, 18.69it/s]
Loading 0: 75%|███████▌ | 274/363 [00:25<00:04, 21.93it/s]
Loading 0: 76%|███████▋ | 277/363 [00:25<00:03, 22.43it/s]
Loading 0: 77%|███████▋ | 280/363 [00:25<00:03, 23.56it/s]
Loading 0: 78%|███████▊ | 283/363 [00:25<00:03, 24.27it/s]
Loading 0: 79%|███████▉ | 286/363 [00:25<00:03, 24.30it/s]
Loading 0: 80%|████████ | 291/363 [00:26<00:02, 25.75it/s]
Loading 0: 81%|████████ | 294/363 [00:26<00:02, 23.12it/s]
Loading 0: 82%|████████▏ | 299/363 [00:26<00:02, 25.79it/s]
Loading 0: 84%|████████▎ | 304/363 [00:26<00:02, 22.66it/s]
Loading 0: 85%|████████▍ | 307/363 [00:26<00:02, 21.33it/s]
Loading 0: 85%|████████▌ | 310/363 [00:27<00:02, 22.60it/s]
Loading 0: 86%|████████▌ | 313/363 [00:27<00:02, 23.03it/s]
Loading 0: 88%|████████▊ | 318/363 [00:27<00:01, 26.08it/s]
Loading 0: 88%|████████▊ | 321/363 [00:27<00:01, 23.75it/s]
Loading 0: 90%|█████████ | 327/363 [00:27<00:01, 28.45it/s]
Loading 0: 91%|█████████ | 330/363 [00:27<00:01, 25.31it/s]
Loading 0: 92%|█████████▏| 335/363 [00:27<00:01, 27.60it/s]
Loading 0: 93%|█████████▎| 338/363 [00:28<00:00, 26.16it/s]
Loading 0: 94%|█████████▍| 341/363 [00:28<00:01, 15.25it/s]
Loading 0: 96%|█████████▌| 347/363 [00:28<00:00, 20.75it/s]
Loading 0: 96%|█████████▋| 350/363 [00:28<00:00, 21.53it/s]
Loading 0: 97%|█████████▋| 353/363 [00:28<00:00, 23.03it/s]
Loading 0: 98%|█████████▊| 357/363 [00:29<00:00, 21.07it/s]
Job chaiml-mistral31-24b-s-69496-v17-mkmlizer completed after 146.64s with status: succeeded
Stopping job with name chaiml-mistral31-24b-s-69496-v17-mkmlizer
Pipeline stage MKMLizer completed in 147.25s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-mistral31-24b-s-69496-v17
Waiting for inference service chaiml-mistral31-24b-s-69496-v17 to be ready
Inference service chaiml-mistral31-24b-s-69496-v17 ready after 210.84263944625854s
Pipeline stage MKMLDeployer completed in 211.31s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.644351005554199s
Received healthy response to inference request in 2.085150957107544s
Received healthy response to inference request in 2.0736639499664307s
Received healthy response to inference request in 1.8364880084991455s
5 requests
1 failed requests
5th percentile: 1.8839231967926025
10th percentile: 1.9313583850860596
20th percentile: 2.0262287616729737
30th percentile: 2.0759613513946533
40th percentile: 2.0805561542510986
50th percentile: 2.085150957107544
60th percentile: 2.308830976486206
70th percentile: 2.532510995864868
80th percentile: 6.145797157287601
90th percentile: 13.148689460754397
95th percentile: 16.65013561248779
99th percentile: 19.45129253387451
mean time: 5.758247137069702
%s, retrying in %s seconds...
Received healthy response to inference request in 2.05292010307312s
Received healthy response to inference request in 2.221593141555786s
Received healthy response to inference request in 1.9460511207580566s
Received healthy response to inference request in 1.923701286315918s
Received healthy response to inference request in 1.8664240837097168s
5 requests
0 failed requests
5th percentile: 1.877879524230957
10th percentile: 1.8893349647521973
20th percentile: 1.9122458457946778
30th percentile: 1.9281712532043458
40th percentile: 1.9371111869812012
50th percentile: 1.9460511207580566
60th percentile: 1.9887987136840821
70th percentile: 2.0315463066101076
80th percentile: 2.086654710769653
90th percentile: 2.1541239261627196
95th percentile: 2.187858533859253
99th percentile: 2.2148462200164794
mean time: 2.0021379470825194
Pipeline stage StressChecker completed in 41.37s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.75s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.78s
Shutdown handler de-registered
chaiml-mistral31-24b-s_69496_v17 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.12s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service chaiml-mistral31-24b-s-69496-v17-profiler
Waiting for inference service chaiml-mistral31-24b-s-69496-v17-profiler to be ready
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3202.14s
Shutdown handler de-registered
chaiml-mistral31-24b-s_69496_v17 status is now inactive due to auto deactivation removed underperforming models
chaiml-mistral31-24b-s_69496_v17 status is now torndown due to DeploymentManager action