Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name zmeeks-capitancito-v13-mkmlizer
Waiting for job on zmeeks-capitancito-v13-mkmlizer to finish
zmeeks-capitancito-v13-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zmeeks-capitancito-v13-mkmlizer: ║ ║
zmeeks-capitancito-v13-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
zmeeks-capitancito-v13-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
zmeeks-capitancito-v13-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
zmeeks-capitancito-v13-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
zmeeks-capitancito-v13-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
zmeeks-capitancito-v13-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
zmeeks-capitancito-v13-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
zmeeks-capitancito-v13-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
zmeeks-capitancito-v13-mkmlizer: ║ ║
zmeeks-capitancito-v13-mkmlizer: ║ Version: 0.29.15 ║
zmeeks-capitancito-v13-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
zmeeks-capitancito-v13-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
zmeeks-capitancito-v13-mkmlizer: ║ https://mk1.ai ║
zmeeks-capitancito-v13-mkmlizer: ║ ║
zmeeks-capitancito-v13-mkmlizer: ║ The license key for the current software has been verified as ║
zmeeks-capitancito-v13-mkmlizer: ║ belonging to: ║
zmeeks-capitancito-v13-mkmlizer: ║ ║
zmeeks-capitancito-v13-mkmlizer: ║ Chai Research Corp. ║
zmeeks-capitancito-v13-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zmeeks-capitancito-v13-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
zmeeks-capitancito-v13-mkmlizer: ║ ║
zmeeks-capitancito-v13-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission zmeeks-capitanito-54-2600_v10: HTTPConnectionPool(host='zmeeks-capitanito-54-2600-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission zmeeks-capitanito-54-3000_v11: HTTPConnectionPool(host='zmeeks-capitanito-54-3000-v11-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
zmeeks-capitancito-v13-mkmlizer: Downloaded to shared memory in 42.853s
zmeeks-capitancito-v13-mkmlizer: Checking if zmeeks/capitancito already exists in ChaiML
zmeeks-capitancito-v13-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpln1im_62, device:0
zmeeks-capitancito-v13-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Failed to get response for submission zmeeks-capitanito-54-2800_v6: HTTPConnectionPool(host='zmeeks-capitanito-54-2800-v6-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
zmeeks-capitancito-v13-mkmlizer: quantized model in 30.758s
zmeeks-capitancito-v13-mkmlizer: Processed model zmeeks/capitancito in 73.694s
zmeeks-capitancito-v13-mkmlizer: creating bucket guanaco-mkml-models
zmeeks-capitancito-v13-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zmeeks-capitancito-v13-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zmeeks-capitancito-v13/nvidia
zmeeks-capitancito-v13-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zmeeks-capitancito-v13/nvidia/config.json
zmeeks-capitancito-v13-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zmeeks-capitancito-v13/nvidia/special_tokens_map.json
zmeeks-capitancito-v13-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zmeeks-capitancito-v13/nvidia/tokenizer_config.json
zmeeks-capitancito-v13-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zmeeks-capitancito-v13/nvidia/tokenizer.json
zmeeks-capitancito-v13-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zmeeks-capitancito-v13/nvidia/flywheel_model.0.safetensors
zmeeks-capitancito-v13-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 2/363 [00:06<18:34, 3.09s/it]
Loading 0: 2%|▏ | 6/363 [00:06<04:56, 1.21it/s]
Loading 0: 4%|▍ | 14/363 [00:06<01:36, 3.62it/s]
Loading 0: 5%|▌ | 19/363 [00:06<01:01, 5.56it/s]
Loading 0: 7%|▋ | 24/363 [00:06<00:44, 7.66it/s]
Loading 0: 9%|▊ | 31/363 [00:06<00:27, 12.07it/s]
Loading 0: 10%|▉ | 36/363 [00:06<00:21, 15.28it/s]
Loading 0: 11%|█▏ | 41/363 [00:07<00:20, 15.35it/s]
Loading 0: 13%|█▎ | 46/363 [00:07<00:16, 19.32it/s]
Loading 0: 14%|█▍ | 51/363 [00:07<00:14, 20.94it/s]
Loading 0: 16%|█▋ | 59/363 [00:07<00:10, 29.03it/s]
Loading 0: 18%|█▊ | 64/363 [00:07<00:09, 32.64it/s]
Loading 0: 19%|█▉ | 69/363 [00:07<00:09, 30.70it/s]
Loading 0: 21%|██ | 77/363 [00:08<00:07, 38.88it/s]
Loading 0: 23%|██▎ | 82/363 [00:08<00:06, 40.94it/s]
Loading 0: 24%|██▍ | 87/363 [00:08<00:07, 35.53it/s]
Loading 0: 26%|██▌ | 94/363 [00:08<00:06, 42.50it/s]
Loading 0: 27%|██▋ | 99/363 [00:08<00:06, 41.43it/s]
Loading 0: 29%|██▊ | 104/363 [00:08<00:06, 41.34it/s]
Loading 0: 30%|███ | 109/363 [00:08<00:05, 43.19it/s]
Loading 0: 31%|███▏ | 114/363 [00:09<00:07, 35.18it/s]
Loading 0: 33%|███▎ | 121/363 [00:09<00:08, 29.14it/s]
Loading 0: 34%|███▍ | 125/363 [00:09<00:08, 29.00it/s]
Loading 0: 36%|███▌ | 130/363 [00:09<00:07, 32.76it/s]
Loading 0: 37%|███▋ | 134/363 [00:09<00:06, 33.18it/s]
Loading 0: 39%|███▊ | 140/363 [00:09<00:05, 37.51it/s]
Loading 0: 40%|███▉ | 145/363 [00:09<00:05, 40.31it/s]
Loading 0: 41%|████▏ | 150/363 [00:10<00:06, 34.71it/s]
Loading 0: 43%|████▎ | 157/363 [00:10<00:04, 41.89it/s]
Loading 0: 45%|████▍ | 162/363 [00:10<00:04, 42.07it/s]
Loading 0: 46%|████▌ | 167/363 [00:10<00:04, 41.94it/s]
Loading 0: 47%|████▋ | 172/363 [00:10<00:04, 43.70it/s]
Loading 0: 49%|████▉ | 177/363 [00:10<00:05, 35.57it/s]
Loading 0: 51%|█████ | 184/363 [00:10<00:04, 41.83it/s]
Loading 0: 52%|█████▏ | 189/363 [00:11<00:04, 41.07it/s]
Loading 0: 53%|█████▎ | 194/363 [00:11<00:04, 40.69it/s]
Loading 0: 55%|█████▍ | 199/363 [00:11<00:03, 42.55it/s]
Loading 0: 56%|█████▌ | 204/363 [00:11<00:06, 26.10it/s]
Loading 0: 58%|█████▊ | 211/363 [00:11<00:04, 33.36it/s]
Loading 0: 60%|█████▉ | 216/363 [00:11<00:04, 35.25it/s]
Loading 0: 61%|██████ | 221/363 [00:12<00:03, 37.41it/s]
Loading 0: 63%|██████▎ | 227/363 [00:12<00:03, 37.00it/s]
Loading 0: 64%|██████▍ | 232/363 [00:12<00:03, 36.80it/s]
Loading 0: 66%|██████▌ | 238/363 [00:12<00:03, 40.81it/s]
Loading 0: 67%|██████▋ | 243/363 [00:12<00:02, 41.34it/s]
Loading 0: 68%|██████▊ | 248/363 [00:12<00:02, 40.57it/s]
Loading 0: 70%|██████▉ | 253/363 [00:12<00:02, 42.58it/s]
Loading 0: 71%|███████ | 258/363 [00:12<00:02, 35.47it/s]
Loading 0: 73%|███████▎ | 265/363 [00:13<00:02, 42.40it/s]
Loading 0: 74%|███████▍ | 270/363 [00:13<00:02, 42.57it/s]
Loading 0: 76%|███████▌ | 275/363 [00:13<00:02, 42.47it/s]
Loading 0: 77%|███████▋ | 280/363 [00:13<00:01, 43.61it/s]
Loading 0: 79%|███████▊ | 285/363 [00:13<00:03, 25.53it/s]
Loading 0: 80%|████████ | 292/363 [00:13<00:02, 32.48it/s]
Loading 0: 82%|████████▏ | 297/363 [00:14<00:01, 34.37it/s]
Loading 0: 83%|████████▎ | 302/363 [00:14<00:01, 35.92it/s]
Loading 0: 85%|████████▍ | 307/363 [00:14<00:01, 39.02it/s]
Loading 0: 86%|████████▌ | 312/363 [00:14<00:01, 34.09it/s]
Loading 0: 88%|████████▊ | 319/363 [00:14<00:01, 40.36it/s]
Loading 0: 89%|████████▉ | 324/363 [00:14<00:00, 39.60it/s]
Loading 0: 91%|█████████ | 329/363 [00:14<00:00, 40.06it/s]
Loading 0: 92%|█████████▏| 334/363 [00:14<00:00, 40.68it/s]
Loading 0: 93%|█████████▎| 339/363 [00:15<00:00, 33.63it/s]
Loading 0: 95%|█████████▌| 346/363 [00:15<00:00, 40.46it/s]
Loading 0: 97%|█████████▋| 351/363 [00:15<00:00, 40.68it/s]
Loading 0: 98%|█████████▊| 356/363 [00:15<00:00, 40.23it/s]
Loading 0: 99%|█████████▉| 361/363 [00:15<00:00, 41.05it/s]
Job zmeeks-capitancito-v13-mkmlizer completed after 95.6s with status: succeeded
Stopping job with name zmeeks-capitancito-v13-mkmlizer
Pipeline stage MKMLizer completed in 96.91s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service zmeeks-capitancito-v13
Waiting for inference service zmeeks-capitancito-v13 to be ready
Failed to get response for submission zmeeks-capitanito-54-3000_v10: HTTPConnectionPool(host='zmeeks-capitanito-54-3000-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Max retries exceeded with url: /v1/models/GPT-J-6B-lit-v2:predict (Caused by ConnectTimeoutError(<urllib3.connection.HTTPConnection object at 0x75061846c950>, 'Connection to zmeeks-capitanito-54-3000-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com timed out. (connect timeout=12.0)'))
Failed to get response for submission zmeeks-capitanito-54-2800_v6: HTTPConnectionPool(host='zmeeks-capitanito-54-2800-v6-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission zmeeks-capitanito-54-2800_v6: HTTPConnectionPool(host='zmeeks-capitanito-54-2800-v6-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission zmeeks-capitanito-54-3000_v10: HTTPConnectionPool(host='zmeeks-capitanito-54-3000-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission zmeeks-capitanito-54-2800_v6: HTTPConnectionPool(host='zmeeks-capitanito-54-2800-v6-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-bat-boys-azeril-_87348_v1: ('http://chaiml-bat-boys-azeril-87348-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission zmeeks-capitanito-54-2800_v5: HTTPConnectionPool(host='zmeeks-capitanito-54-2800-v5-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission zmeeks-capitanito-54-2600_v9: HTTPConnectionPool(host='zmeeks-capitanito-54-2600-v9-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission zmeeks-capitanito-54-3000_v11: HTTPConnectionPool(host='zmeeks-capitanito-54-3000-v11-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission zmeeks-capitanito-54-2800_v5: HTTPConnectionPool(host='zmeeks-capitanito-54-2800-v5-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission zmeeks-capitanito-54-2800_v6: HTTPConnectionPool(host='zmeeks-capitanito-54-2800-v6-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission zmeeks-capitanito-54-2600_v10: HTTPConnectionPool(host='zmeeks-capitanito-54-2600-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service zmeeks-capitancito-v13 ready after 311.5041811466217s
Pipeline stage MKMLDeployer completed in 312.05s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.374077320098877s
Received healthy response to inference request in 1.4793009757995605s
Received healthy response to inference request in 1.585433006286621s
Received healthy response to inference request in 1.54752516746521s
Failed to get response for submission chaiml-bat-boys-azeril-_87348_v1: ('http://chaiml-bat-boys-azeril-87348-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Received healthy response to inference request in 1.8604443073272705s
5 requests
0 failed requests
5th percentile: 1.4929458141326903
10th percentile: 1.5065906524658204
20th percentile: 1.5338803291320802
30th percentile: 1.5551067352294923
40th percentile: 1.5702698707580567
50th percentile: 1.585433006286621
60th percentile: 1.695437526702881
70th percentile: 1.8054420471191406
80th percentile: 1.963170909881592
90th percentile: 2.1686241149902346
95th percentile: 2.2713507175445558
99th percentile: 2.3535319995880126
mean time: 1.7693561553955077
Pipeline stage StressChecker completed in 10.16s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.75s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 1.34s
Shutdown handler de-registered
zmeeks-capitancito_v13 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError('None: None')
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError('None: None')
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4843.88s
Shutdown handler de-registered
zmeeks-capitancito_v13 status is now inactive due to auto deactivation removed underperforming models
zmeeks-capitancito_v13 status is now torndown due to DeploymentManager action
zmeeks-capitancito_v13 status is now torndown due to DeploymentManager action