Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name zmeeks-capitanito-37-v4-mkmlizer
Waiting for job on zmeeks-capitanito-37-v4-mkmlizer to finish
zmeeks-capitanito-37-v4-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zmeeks-capitanito-37-v4-mkmlizer: ║ ║
zmeeks-capitanito-37-v4-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
zmeeks-capitanito-37-v4-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
zmeeks-capitanito-37-v4-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
zmeeks-capitanito-37-v4-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
zmeeks-capitanito-37-v4-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
zmeeks-capitanito-37-v4-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
zmeeks-capitanito-37-v4-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
zmeeks-capitanito-37-v4-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
zmeeks-capitanito-37-v4-mkmlizer: ║ ║
zmeeks-capitanito-37-v4-mkmlizer: ║ Version: 0.29.15 ║
zmeeks-capitanito-37-v4-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
zmeeks-capitanito-37-v4-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
zmeeks-capitanito-37-v4-mkmlizer: ║ https://mk1.ai ║
zmeeks-capitanito-37-v4-mkmlizer: ║ ║
zmeeks-capitanito-37-v4-mkmlizer: ║ The license key for the current software has been verified as ║
zmeeks-capitanito-37-v4-mkmlizer: ║ belonging to: ║
zmeeks-capitanito-37-v4-mkmlizer: ║ ║
zmeeks-capitanito-37-v4-mkmlizer: ║ Chai Research Corp. ║
zmeeks-capitanito-37-v4-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zmeeks-capitanito-37-v4-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
zmeeks-capitanito-37-v4-mkmlizer: ║ ║
zmeeks-capitanito-37-v4-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
zmeeks-capitanito-37-v4-mkmlizer: Downloaded to shared memory in 32.625s
zmeeks-capitanito-37-v4-mkmlizer: Checking if zmeeks/capitanito__37 already exists in ChaiML
zmeeks-capitanito-37-v4-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpi_jeojl0, device:0
zmeeks-capitanito-37-v4-mkmlizer: Saving flywheel model at /dev/shm/model_cache
zmeeks-capitanito-37-v4-mkmlizer: quantized model in 30.981s
zmeeks-capitanito-37-v4-mkmlizer: Processed model zmeeks/capitanito__37 in 63.690s
zmeeks-capitanito-37-v4-mkmlizer: creating bucket guanaco-mkml-models
zmeeks-capitanito-37-v4-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zmeeks-capitanito-37-v4-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zmeeks-capitanito-37-v4/nvidia
zmeeks-capitanito-37-v4-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zmeeks-capitanito-37-v4/nvidia/config.json
zmeeks-capitanito-37-v4-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zmeeks-capitanito-37-v4/nvidia/special_tokens_map.json
zmeeks-capitanito-37-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zmeeks-capitanito-37-v4/nvidia/tokenizer_config.json
zmeeks-capitanito-37-v4-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zmeeks-capitanito-37-v4/nvidia/flywheel_model.0.safetensors
zmeeks-capitanito-37-v4-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:11, 30.94it/s]
Loading 0: 4%|▎ | 13/363 [00:00<00:07, 49.07it/s]
Loading 0: 5%|▌ | 19/363 [00:00<00:08, 42.74it/s]
Loading 0: 7%|▋ | 24/363 [00:00<00:08, 41.85it/s]
Loading 0: 9%|▊ | 31/363 [00:00<00:06, 47.75it/s]
Loading 0: 10%|█ | 37/363 [00:00<00:07, 44.23it/s]
Loading 0: 12%|█▏ | 42/363 [00:00<00:07, 43.20it/s]
Loading 0: 13%|█▎ | 49/363 [00:01<00:06, 47.87it/s]
Loading 0: 15%|█▌ | 55/363 [00:01<00:06, 44.27it/s]
Loading 0: 17%|█▋ | 60/363 [00:01<00:06, 44.82it/s]
Loading 0: 18%|█▊ | 65/363 [00:01<00:10, 28.97it/s]
Loading 0: 20%|█▉ | 71/363 [00:01<00:08, 33.85it/s]
Loading 0: 21%|██ | 76/363 [00:01<00:08, 34.45it/s]
Loading 0: 22%|██▏ | 80/363 [00:02<00:08, 34.89it/s]
Loading 0: 23%|██▎ | 84/363 [00:02<00:08, 34.50it/s]
Loading 0: 25%|██▍ | 89/363 [00:02<00:07, 37.85it/s]
Loading 0: 26%|██▌ | 94/363 [00:02<00:07, 37.47it/s]
Loading 0: 27%|██▋ | 99/363 [00:02<00:06, 39.04it/s]
Loading 0: 29%|██▊ | 104/363 [00:02<00:06, 40.43it/s]
Loading 0: 30%|███ | 109/363 [00:02<00:06, 41.51it/s]
Loading 0: 31%|███▏ | 114/363 [00:02<00:07, 34.08it/s]
Loading 0: 33%|███▎ | 118/363 [00:03<00:07, 32.61it/s]
Loading 0: 34%|███▍ | 125/363 [00:03<00:06, 39.49it/s]
Loading 0: 36%|███▌ | 130/363 [00:03<00:06, 37.04it/s]
Loading 0: 37%|███▋ | 134/363 [00:03<00:06, 37.57it/s]
Loading 0: 38%|███▊ | 138/363 [00:03<00:06, 36.59it/s]
Loading 0: 39%|███▉ | 142/363 [00:03<00:08, 26.60it/s]
Loading 0: 40%|████ | 146/363 [00:03<00:07, 28.77it/s]
Loading 0: 41%|████▏ | 150/363 [00:04<00:07, 29.06it/s]
Loading 0: 43%|████▎ | 156/363 [00:04<00:05, 34.73it/s]
Loading 0: 44%|████▍ | 160/363 [00:04<00:05, 34.08it/s]
Loading 0: 45%|████▌ | 165/363 [00:04<00:05, 37.03it/s]
Loading 0: 47%|████▋ | 169/363 [00:04<00:05, 36.51it/s]
Loading 0: 48%|████▊ | 174/363 [00:04<00:04, 38.97it/s]
Loading 0: 49%|████▉ | 178/363 [00:04<00:04, 38.38it/s]
Loading 0: 50%|█████ | 183/363 [00:04<00:04, 40.77it/s]
Loading 0: 52%|█████▏ | 188/363 [00:05<00:04, 40.96it/s]
Loading 0: 53%|█████▎ | 193/363 [00:05<00:04, 40.39it/s]
Loading 0: 55%|█████▍ | 198/363 [00:05<00:03, 42.41it/s]
Loading 0: 56%|█████▌ | 203/363 [00:05<00:04, 35.50it/s]
Loading 0: 58%|█████▊ | 210/363 [00:05<00:03, 41.87it/s]
Loading 0: 59%|█████▉ | 215/363 [00:05<00:03, 39.30it/s]
Loading 0: 61%|██████ | 220/363 [00:05<00:03, 41.41it/s]
Loading 0: 62%|██████▏ | 225/363 [00:06<00:05, 26.10it/s]
Loading 0: 63%|██████▎ | 230/363 [00:06<00:04, 28.80it/s]
Loading 0: 65%|██████▌ | 237/363 [00:06<00:03, 35.76it/s]
Loading 0: 67%|██████▋ | 242/363 [00:06<00:03, 37.20it/s]
Loading 0: 68%|██████▊ | 247/363 [00:06<00:03, 38.58it/s]
Loading 0: 69%|██████▉ | 252/363 [00:06<00:02, 40.38it/s]
Loading 0: 71%|███████ | 257/363 [00:06<00:03, 32.63it/s]
Loading 0: 73%|███████▎ | 264/363 [00:07<00:02, 39.27it/s]
Loading 0: 74%|███████▍ | 269/363 [00:07<00:02, 40.28it/s]
Loading 0: 75%|███████▌ | 274/363 [00:07<00:02, 40.78it/s]
Loading 0: 77%|███████▋ | 279/363 [00:07<00:01, 42.66it/s]
Loading 0: 78%|███████▊ | 284/363 [00:07<00:02, 35.87it/s]
Loading 0: 80%|████████ | 291/363 [00:07<00:01, 42.74it/s]
Loading 0: 82%|████████▏ | 296/363 [00:07<00:01, 42.24it/s]
Loading 0: 83%|████████▎ | 301/363 [00:07<00:01, 43.42it/s]
Loading 0: 84%|████████▍ | 306/363 [00:08<00:02, 23.16it/s]
Loading 0: 85%|████████▌ | 310/363 [00:08<00:02, 24.44it/s]
Loading 0: 87%|████████▋ | 314/363 [00:08<00:01, 26.76it/s]
Loading 0: 88%|████████▊ | 320/363 [00:08<00:01, 32.33it/s]
Loading 0: 90%|████████▉ | 326/363 [00:08<00:01, 34.05it/s]
Loading 0: 91%|█████████ | 330/363 [00:09<00:00, 33.74it/s]
Loading 0: 93%|█████████▎| 337/363 [00:09<00:00, 41.50it/s]
Loading 0: 94%|█████████▍| 342/363 [00:09<00:00, 35.41it/s]
Loading 0: 96%|█████████▌| 347/363 [00:09<00:00, 37.42it/s]
Loading 0: 97%|█████████▋| 352/363 [00:09<00:00, 40.12it/s]
Loading 0: 98%|█████████▊| 357/363 [00:09<00:00, 34.79it/s]
Job zmeeks-capitanito-37-v4-mkmlizer completed after 84.91s with status: succeeded
Stopping job with name zmeeks-capitanito-37-v4-mkmlizer
Pipeline stage MKMLizer completed in 85.40s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.19s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service zmeeks-capitanito-37-v4
Waiting for inference service zmeeks-capitanito-37-v4 to be ready
Failed to get response for submission zmeeks-capitanito-37_v2: HTTPConnectionPool(host='zmeeks-capitanito-37-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service zmeeks-capitanito-37-v4 ready after 221.09533190727234s
Pipeline stage MKMLDeployer completed in 221.60s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.223395824432373s
Received healthy response to inference request in 1.5648863315582275s
Received healthy response to inference request in 1.5320310592651367s
Received healthy response to inference request in 1.770275592803955s
5 requests
1 failed requests
5th percentile: 1.5386021137237549
10th percentile: 1.545173168182373
20th percentile: 1.5583152770996094
30th percentile: 1.605964183807373
40th percentile: 1.688119888305664
50th percentile: 1.770275592803955
60th percentile: 1.9515236854553222
70th percentile: 2.1327717781066893
80th percentile: 5.802347040176395
90th percentile: 12.96024947166443
95th percentile: 16.539200687408446
99th percentile: 19.402361660003663
mean time: 5.441748142242432
%s, retrying in %s seconds...
Received healthy response to inference request in 1.6418814659118652s
Received healthy response to inference request in 1.482269525527954s
Received healthy response to inference request in 1.8786664009094238s
Received healthy response to inference request in 1.6298925876617432s
Received healthy response to inference request in 1.4973859786987305s
5 requests
0 failed requests
5th percentile: 1.4852928161621093
10th percentile: 1.4883161067962647
20th percentile: 1.4943626880645753
30th percentile: 1.523887300491333
40th percentile: 1.576889944076538
50th percentile: 1.6298925876617432
60th percentile: 1.634688138961792
70th percentile: 1.6394836902618408
80th percentile: 1.689238452911377
90th percentile: 1.7839524269104003
95th percentile: 1.831309413909912
99th percentile: 1.8691950035095215
mean time: 1.6260191917419433
Pipeline stage StressChecker completed in 37.89s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.66s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.69s
Shutdown handler de-registered
zmeeks-capitanito-37_v4 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.11s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service zmeeks-capitanito-37-v4-profiler
Waiting for inference service zmeeks-capitanito-37-v4-profiler to be ready
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 5343.65s
Shutdown handler de-registered
zmeeks-capitanito-37_v4 status is now inactive due to auto deactivation removed underperforming models
zmeeks-capitanito-37_v4 status is now torndown due to DeploymentManager action