Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name zmeeks-capitanito-49-2300-v3-mkmlizer
Waiting for job on zmeeks-capitanito-49-2300-v3-mkmlizer to finish
zmeeks-capitanito-49-2300-v3-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ Version: 0.29.15 ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ https://mk1.ai ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ The license key for the current software has been verified as ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ belonging to: ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ Chai Research Corp. ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ║ ║
zmeeks-capitanito-49-2300-v3-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
zmeeks-capitanito-49-2300-v3-mkmlizer: Downloaded to shared memory in 29.776s
zmeeks-capitanito-49-2300-v3-mkmlizer: Checking if zmeeks/capitanito__49-2300 already exists in ChaiML
zmeeks-capitanito-49-2300-v3-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmphg3whwrk, device:0
zmeeks-capitanito-49-2300-v3-mkmlizer: Saving flywheel model at /dev/shm/model_cache
zmeeks-capitanito-49-2300-v3-mkmlizer: quantized model in 32.328s
zmeeks-capitanito-49-2300-v3-mkmlizer: Processed model zmeeks/capitanito__49-2300 in 62.180s
zmeeks-capitanito-49-2300-v3-mkmlizer: creating bucket guanaco-mkml-models
zmeeks-capitanito-49-2300-v3-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zmeeks-capitanito-49-2300-v3-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zmeeks-capitanito-49-2300-v3/nvidia
zmeeks-capitanito-49-2300-v3-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zmeeks-capitanito-49-2300-v3/nvidia/config.json
zmeeks-capitanito-49-2300-v3-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zmeeks-capitanito-49-2300-v3/nvidia/special_tokens_map.json
Unable to record family friendly update due to error: HTTPConnectionPool(host='chaiml-nemo-guard-merged-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Max retries exceeded with url: /v1/models/GPT-J-6B-lit-v2:predict (Caused by ConnectTimeoutError(<urllib3.connection.HTTPConnection object at 0x7f9ed851d690>, 'Connection to chaiml-nemo-guard-merged-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com timed out. (connect timeout=12.0)'))
zmeeks-capitanito-49-2300-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zmeeks-capitanito-49-2300-v3/nvidia/flywheel_model.0.safetensors
zmeeks-capitanito-49-2300-v3-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:12, 28.98it/s]
Loading 0: 3%|▎ | 12/363 [00:00<00:07, 45.50it/s]
Loading 0: 5%|▍ | 17/363 [00:00<00:08, 42.28it/s]
Loading 0: 6%|▌ | 22/363 [00:00<00:08, 40.87it/s]
Loading 0: 7%|▋ | 27/363 [00:00<00:08, 41.84it/s]
Loading 0: 9%|▉ | 32/363 [00:00<00:10, 32.39it/s]
Loading 0: 11%|█ | 39/363 [00:01<00:08, 39.46it/s]
Loading 0: 12%|█▏ | 44/363 [00:01<00:08, 38.70it/s]
Loading 0: 13%|█▎ | 49/363 [00:01<00:08, 38.43it/s]
Loading 0: 15%|█▍ | 54/363 [00:01<00:07, 39.97it/s]
Loading 0: 16%|█▋ | 59/363 [00:01<00:07, 40.77it/s]
Loading 0: 18%|█▊ | 64/363 [00:01<00:13, 21.71it/s]
Loading 0: 20%|█▉ | 71/363 [00:02<00:10, 28.40it/s]
Loading 0: 21%|██ | 76/363 [00:02<00:09, 30.12it/s]
Loading 0: 22%|██▏ | 80/363 [00:02<00:09, 31.32it/s]
Loading 0: 23%|██▎ | 84/363 [00:02<00:08, 31.40it/s]
Loading 0: 25%|██▍ | 89/363 [00:02<00:07, 34.43it/s]
Loading 0: 26%|██▌ | 93/363 [00:02<00:07, 34.05it/s]
Loading 0: 27%|██▋ | 98/363 [00:02<00:07, 37.44it/s]
Loading 0: 28%|██▊ | 103/363 [00:02<00:06, 38.62it/s]
Loading 0: 30%|██▉ | 108/363 [00:03<00:06, 40.10it/s]
Loading 0: 31%|███ | 113/363 [00:03<00:07, 34.86it/s]
Loading 0: 33%|███▎ | 118/363 [00:03<00:06, 35.63it/s]
Loading 0: 34%|███▍ | 125/363 [00:03<00:05, 42.85it/s]
Loading 0: 36%|███▌ | 130/363 [00:03<00:05, 42.69it/s]
Loading 0: 37%|███▋ | 135/363 [00:03<00:05, 42.34it/s]
Loading 0: 39%|███▊ | 140/363 [00:03<00:05, 43.59it/s]
Loading 0: 40%|███▉ | 145/363 [00:04<00:08, 25.56it/s]
Loading 0: 41%|████ | 149/363 [00:04<00:08, 25.79it/s]
Loading 0: 42%|████▏ | 154/363 [00:04<00:07, 29.65it/s]
Loading 0: 44%|████▎ | 158/363 [00:04<00:07, 28.99it/s]
Loading 0: 45%|████▌ | 165/363 [00:04<00:05, 35.70it/s]
Loading 0: 47%|████▋ | 170/363 [00:04<00:05, 36.21it/s]
Loading 0: 48%|████▊ | 174/363 [00:04<00:05, 36.21it/s]
Loading 0: 49%|████▉ | 178/363 [00:05<00:05, 34.80it/s]
Loading 0: 50%|█████ | 183/363 [00:05<00:04, 36.73it/s]
Loading 0: 52%|█████▏ | 187/363 [00:05<00:05, 35.02it/s]
Loading 0: 53%|█████▎ | 192/363 [00:05<00:04, 36.81it/s]
Loading 0: 54%|█████▍ | 196/363 [00:05<00:04, 35.23it/s]
Loading 0: 55%|█████▌ | 201/363 [00:05<00:04, 36.77it/s]
Loading 0: 56%|█████▋ | 205/363 [00:05<00:04, 34.98it/s]
Loading 0: 58%|█████▊ | 209/363 [00:05<00:04, 36.20it/s]
Loading 0: 59%|█████▊ | 213/363 [00:06<00:04, 32.47it/s]
Loading 0: 60%|█████▉ | 217/363 [00:06<00:04, 34.09it/s]
Loading 0: 61%|██████ | 222/363 [00:06<00:03, 35.66it/s]
Loading 0: 62%|██████▏ | 226/363 [00:06<00:06, 22.53it/s]
Loading 0: 63%|██████▎ | 230/363 [00:06<00:05, 23.25it/s]
Loading 0: 65%|██████▍ | 235/363 [00:06<00:04, 28.24it/s]
Loading 0: 66%|██████▌ | 239/363 [00:07<00:04, 27.65it/s]
Loading 0: 67%|██████▋ | 244/363 [00:07<00:03, 32.22it/s]
Loading 0: 68%|██████▊ | 248/363 [00:07<00:04, 28.71it/s]
Loading 0: 70%|██████▉ | 253/363 [00:07<00:03, 33.11it/s]
Loading 0: 71%|███████ | 257/363 [00:07<00:03, 30.96it/s]
Loading 0: 72%|███████▏ | 263/363 [00:07<00:02, 37.70it/s]
Loading 0: 74%|███████▍ | 268/363 [00:07<00:02, 35.95it/s]
Loading 0: 75%|███████▌ | 273/363 [00:08<00:02, 37.98it/s]
Loading 0: 77%|███████▋ | 278/363 [00:08<00:02, 38.89it/s]
Loading 0: 78%|███████▊ | 283/363 [00:08<00:02, 39.52it/s]
Loading 0: 79%|███████▉ | 288/363 [00:08<00:01, 41.56it/s]
Loading 0: 81%|████████ | 293/363 [00:08<00:02, 34.82it/s]
Loading 0: 82%|████████▏ | 299/363 [00:08<00:01, 39.16it/s]
Loading 0: 84%|████████▎ | 304/363 [00:09<00:02, 22.00it/s]
Loading 0: 85%|████████▍ | 308/363 [00:09<00:02, 24.22it/s]
Loading 0: 86%|████████▌ | 312/363 [00:09<00:02, 24.19it/s]
Loading 0: 88%|████████▊ | 319/363 [00:09<00:01, 31.51it/s]
Loading 0: 89%|████████▉ | 323/363 [00:09<00:01, 31.62it/s]
Loading 0: 90%|█████████ | 328/363 [00:09<00:01, 34.18it/s]
Loading 0: 91%|█████████▏| 332/363 [00:09<00:00, 33.33it/s]
Loading 0: 93%|█████████▎| 337/363 [00:10<00:00, 35.88it/s]
Loading 0: 94%|█████████▍| 341/363 [00:10<00:00, 34.02it/s]
Loading 0: 95%|█████████▌| 346/363 [00:10<00:00, 36.24it/s]
Loading 0: 96%|█████████▋| 350/363 [00:10<00:00, 34.50it/s]
Loading 0: 98%|█████████▊| 355/363 [00:10<00:00, 36.53it/s]
Loading 0: 99%|█████████▉| 359/363 [00:10<00:00, 34.02it/s]
Job zmeeks-capitanito-49-2300-v3-mkmlizer completed after 96.55s with status: succeeded
Stopping job with name zmeeks-capitanito-49-2300-v3-mkmlizer
Pipeline stage MKMLizer completed in 97.42s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.29s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service zmeeks-capitanito-49-2300-v3
Waiting for inference service zmeeks-capitanito-49-2300-v3 to be ready
Inference service zmeeks-capitanito-49-2300-v3 ready after 251.7068793773651s
Pipeline stage MKMLDeployer completed in 252.24s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.274343490600586s
Received healthy response to inference request in 1.7578239440917969s
Received healthy response to inference request in 1.5825769901275635s
Received healthy response to inference request in 1.5700304508209229s
5 requests
1 failed requests
5th percentile: 1.572539758682251
10th percentile: 1.575049066543579
20th percentile: 1.5800676822662354
30th percentile: 1.6176263809204101
40th percentile: 1.6877251625061036
50th percentile: 1.7578239440917969
60th percentile: 1.9644317626953125
70th percentile: 2.171039581298828
80th percentile: 5.853758287429812
90th percentile: 13.012587881088258
95th percentile: 16.592002677917478
99th percentile: 19.455534515380858
mean time: 5.471238470077514
%s, retrying in %s seconds...
Received healthy response to inference request in 1.938267469406128s
Received healthy response to inference request in 1.6345312595367432s
Received healthy response to inference request in 1.6343066692352295s
Received healthy response to inference request in 1.927762746810913s
Received healthy response to inference request in 1.7622382640838623s
5 requests
0 failed requests
5th percentile: 1.6343515872955323
10th percentile: 1.634396505355835
20th percentile: 1.6344863414764403
30th percentile: 1.660072660446167
40th percentile: 1.7111554622650147
50th percentile: 1.7622382640838623
60th percentile: 1.8284480571746826
70th percentile: 1.894657850265503
80th percentile: 1.929863691329956
90th percentile: 1.934065580368042
95th percentile: 1.936166524887085
99th percentile: 1.9378472805023192
mean time: 1.7794212818145752
Pipeline stage StressChecker completed in 39.65s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.78s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.73s
Shutdown handler de-registered
zmeeks-capitanito-49-2300_v3 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError('None: None')
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4971.04s
Shutdown handler de-registered
zmeeks-capitanito-49-2300_v3 status is now inactive due to auto deactivation removed underperforming models
zmeeks-capitanito-49-2300_v3 status is now torndown due to DeploymentManager action
zmeeks-capitanito-49-2300_v3 status is now torndown due to DeploymentManager action