Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name junhua024-chai-06-full-71610-v19-mkmlizer
Waiting for job on junhua024-chai-06-full-71610-v19-mkmlizer to finish
junhua024-chai-06-full-71610-v19-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
junhua024-chai-06-full-71610-v19-mkmlizer: ║ ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ Version: 0.29.15 ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ https://mk1.ai ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ The license key for the current software has been verified as ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ belonging to: ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ Chai Research Corp. ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
junhua024-chai-06-full-71610-v19-mkmlizer: ║ ║
junhua024-chai-06-full-71610-v19-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
Failed to get response for submission zmeeks-capitanito-54-2800_v10: HTTPConnectionPool(host='zmeeks-capitanito-54-2800-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
Failed to get response for submission chaiml-gy-exp186-sft-co_52976_v4: HTTPConnectionPool(host='chaiml-gy-exp186-sft-co-52976-v4-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-06-full-71610-v19-mkmlizer: Downloaded to shared memory in 67.879s
junhua024-chai-06-full-71610-v19-mkmlizer: Checking if junhua024/chai_06_full_02102_1619_2024 already exists in ChaiML
junhua024-chai-06-full-71610-v19-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp_ykmakwm, device:0
junhua024-chai-06-full-71610-v19-mkmlizer: Saving flywheel model at /dev/shm/model_cache
junhua024-chai-06-full-71610-v19-mkmlizer: quantized model in 32.396s
junhua024-chai-06-full-71610-v19-mkmlizer: Processed model junhua024/chai_06_full_02102_1619_2024 in 100.332s
junhua024-chai-06-full-71610-v19-mkmlizer: creating bucket guanaco-mkml-models
junhua024-chai-06-full-71610-v19-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
junhua024-chai-06-full-71610-v19-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/junhua024-chai-06-full-71610-v19/nvidia
junhua024-chai-06-full-71610-v19-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/junhua024-chai-06-full-71610-v19/nvidia/config.json
junhua024-chai-06-full-71610-v19-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/junhua024-chai-06-full-71610-v19/nvidia/special_tokens_map.json
junhua024-chai-06-full-71610-v19-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/junhua024-chai-06-full-71610-v19/nvidia/tokenizer_config.json
junhua024-chai-06-full-71610-v19-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/junhua024-chai-06-full-71610-v19/nvidia/tokenizer.json
junhua024-chai-06-full-71610-v19-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/junhua024-chai-06-full-71610-v19/nvidia/flywheel_model.0.safetensors
junhua024-chai-06-full-71610-v19-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 2/363 [00:00<00:29, 12.42it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:22, 15.72it/s]
Loading 0: 3%|▎ | 12/363 [00:00<00:12, 27.04it/s]
Loading 0: 5%|▍ | 17/363 [00:00<00:11, 29.11it/s]
Loading 0: 6%|▋ | 23/363 [00:00<00:10, 31.13it/s]
Loading 0: 9%|▊ | 31/363 [00:00<00:07, 42.36it/s]
Loading 0: 10%|▉ | 36/363 [00:01<00:10, 31.98it/s]
Loading 0: 11%|█▏ | 41/363 [00:01<00:09, 32.82it/s]
Loading 0: 14%|█▍ | 50/363 [00:01<00:07, 40.16it/s]
Loading 0: 15%|█▌ | 55/363 [00:01<00:09, 33.71it/s]
Loading 0: 16%|█▋ | 59/363 [00:01<00:09, 31.80it/s]
Loading 0: 18%|█▊ | 65/363 [00:02<00:08, 33.13it/s]
Loading 0: 19%|█▉ | 69/363 [00:02<00:09, 31.61it/s]
Loading 0: 20%|██ | 74/363 [00:02<00:08, 35.13it/s]
Loading 0: 21%|██▏ | 78/363 [00:02<00:08, 35.39it/s]
Loading 0: 23%|██▎ | 82/363 [00:02<00:09, 30.13it/s]
Loading 0: 24%|██▎ | 86/363 [00:02<00:08, 30.82it/s]
Loading 0: 25%|██▌ | 91/363 [00:02<00:08, 31.57it/s]
Loading 0: 27%|██▋ | 97/363 [00:03<00:08, 32.78it/s]
Loading 0: 28%|██▊ | 101/363 [00:03<00:08, 32.45it/s]
Loading 0: 29%|██▉ | 105/363 [00:03<00:08, 30.04it/s]
Loading 0: 31%|███ | 113/363 [00:03<00:06, 36.70it/s]
Loading 0: 32%|███▏ | 117/363 [00:03<00:08, 29.31it/s]
Loading 0: 34%|███▎ | 122/363 [00:03<00:07, 31.03it/s]
Loading 0: 35%|███▌ | 128/363 [00:03<00:07, 32.82it/s]
Loading 0: 36%|███▋ | 132/363 [00:04<00:07, 30.72it/s]
Loading 0: 38%|███▊ | 138/363 [00:04<00:07, 31.91it/s]
Loading 0: 39%|███▉ | 143/363 [00:04<00:06, 32.16it/s]
Loading 0: 41%|████ | 149/363 [00:04<00:06, 32.93it/s]
Loading 0: 43%|████▎ | 155/363 [00:04<00:05, 37.48it/s]
Loading 0: 44%|████▍ | 160/363 [00:04<00:05, 34.30it/s]
Loading 0: 45%|████▌ | 164/363 [00:05<00:05, 34.10it/s]
Loading 0: 46%|████▋ | 168/363 [00:05<00:05, 32.52it/s]
Loading 0: 48%|████▊ | 176/363 [00:05<00:04, 38.61it/s]
Loading 0: 50%|████▉ | 180/363 [00:05<00:05, 30.83it/s]
Loading 0: 51%|█████ | 185/363 [00:05<00:05, 32.61it/s]
Loading 0: 53%|█████▎ | 191/363 [00:05<00:04, 34.49it/s]
Loading 0: 54%|█████▎ | 195/363 [00:05<00:05, 32.51it/s]
Loading 0: 55%|█████▌ | 201/363 [00:06<00:04, 33.20it/s]
Loading 0: 57%|█████▋ | 206/363 [00:06<00:04, 33.00it/s]
Loading 0: 58%|█████▊ | 212/363 [00:06<00:04, 34.19it/s]
Loading 0: 61%|██████ | 220/363 [00:06<00:03, 43.36it/s]
Loading 0: 62%|██████▏ | 225/363 [00:06<00:04, 32.96it/s]
Loading 0: 63%|██████▎ | 230/363 [00:06<00:03, 33.26it/s]
Loading 0: 65%|██████▌ | 237/363 [00:07<00:03, 40.67it/s]
Loading 0: 67%|██████▋ | 242/363 [00:07<00:03, 36.22it/s]
Loading 0: 68%|██████▊ | 247/363 [00:07<00:03, 36.76it/s]
Loading 0: 69%|██████▉ | 252/363 [00:07<00:02, 39.27it/s]
Loading 0: 71%|███████ | 257/363 [00:07<00:03, 29.75it/s]
Loading 0: 72%|███████▏ | 263/363 [00:07<00:02, 35.65it/s]
Loading 0: 74%|███████▍ | 268/363 [00:08<00:02, 34.58it/s]
Loading 0: 75%|███████▍ | 272/363 [00:08<00:02, 34.50it/s]
Loading 0: 76%|███████▌ | 276/363 [00:08<00:02, 32.87it/s]
Loading 0: 77%|███████▋ | 281/363 [00:08<00:02, 36.71it/s]
Loading 0: 79%|███████▉ | 286/363 [00:08<00:02, 35.58it/s]
Loading 0: 80%|███████▉ | 290/363 [00:08<00:02, 35.61it/s]
Loading 0: 81%|████████ | 294/363 [00:08<00:02, 33.50it/s]
Loading 0: 83%|████████▎ | 301/363 [00:08<00:01, 42.42it/s]
Loading 0: 84%|████████▍ | 306/363 [00:09<00:01, 29.55it/s]
Loading 0: 86%|████████▌ | 311/363 [00:09<00:01, 31.72it/s]
Loading 0: 87%|████████▋ | 317/363 [00:09<00:01, 33.97it/s]
Loading 0: 88%|████████▊ | 321/363 [00:09<00:01, 33.38it/s]
Loading 0: 90%|█████████ | 327/363 [00:09<00:01, 34.30it/s]
Loading 0: 91%|█████████▏| 332/363 [00:09<00:00, 33.65it/s]
Loading 0: 93%|█████████▎| 338/363 [00:10<00:00, 34.68it/s]
Loading 0: 95%|█████████▍| 344/363 [00:10<00:00, 39.64it/s]
Loading 0: 96%|█████████▌| 349/363 [00:10<00:00, 27.06it/s]
Loading 0: 97%|█████████▋| 353/363 [00:10<00:00, 24.41it/s]
Loading 0: 98%|█████████▊| 357/363 [00:10<00:00, 26.16it/s]
Job junhua024-chai-06-full-71610-v19-mkmlizer completed after 128.38s with status: succeeded
Stopping job with name junhua024-chai-06-full-71610-v19-mkmlizer
Pipeline stage MKMLizer completed in 129.01s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service junhua024-chai-06-full-71610-v19
Waiting for inference service junhua024-chai-06-full-71610-v19 to be ready
Failed to get response for submission zmeeks-capitanito-54-2800_v10: HTTPConnectionPool(host='zmeeks-capitanito-54-2800-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-bat-boys-azeril-_87348_v1: ('http://chaiml-bat-boys-azeril-87348-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission blend_hunen_2025-06-23: HTTPConnectionPool(host='guanaco-model-mesh.k2.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission zmeeks-capitanito-53-2000_v13: HTTPConnectionPool(host='zmeeks-capitanito-53-2000-v13-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission zmeeks-capitanito-53-2000_v13: HTTPConnectionPool(host='zmeeks-capitanito-53-2000-v13-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service junhua024-chai-06-full-71610-v19 ready after 321.39320278167725s
Pipeline stage MKMLDeployer completed in 322.73s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.491734266281128s
Received healthy response to inference request in 1.562023401260376s
Received healthy response to inference request in 1.5438621044158936s
Received healthy response to inference request in 1.89760422706604s
Received healthy response to inference request in 1.7378156185150146s
5 requests
0 failed requests
5th percentile: 1.54749436378479
10th percentile: 1.5511266231536864
20th percentile: 1.5583911418914795
30th percentile: 1.5971818447113038
40th percentile: 1.667498731613159
50th percentile: 1.7378156185150146
60th percentile: 1.8017310619354248
70th percentile: 1.8656465053558349
80th percentile: 2.0164302349090577
90th percentile: 2.2540822505950926
95th percentile: 2.3729082584381103
99th percentile: 2.4679690647125243
mean time: 1.8466079235076904
Pipeline stage StressChecker completed in 11.13s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.83s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.76s
Shutdown handler de-registered
junhua024-chai-06-full_71610_v19 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.12s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.13s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service junhua024-chai-06-full-71610-v19-profiler
Waiting for inference service junhua024-chai-06-full-71610-v19-profiler to be ready
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3107.85s
Shutdown handler de-registered
junhua024-chai-06-full_71610_v19 status is now inactive due to auto deactivation removed underperforming models
junhua024-chai-06-full_71610_v19 status is now torndown due to DeploymentManager action
junhua024-chai-06-full_71610_v19 status is now torndown due to DeploymentManager action