Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name zmeeks-capitanito-50-1600-v4-mkmlizer
Waiting for job on zmeeks-capitanito-50-1600-v4-mkmlizer to finish
zmeeks-capitanito-50-1600-v4-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ Version: 0.29.15 ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ https://mk1.ai ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ The license key for the current software has been verified as ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ belonging to: ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ Chai Research Corp. ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ║ ║
zmeeks-capitanito-50-1600-v4-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
zmeeks-capitanito-50-1600-v4-mkmlizer: Downloaded to shared memory in 31.512s
zmeeks-capitanito-50-1600-v4-mkmlizer: Checking if zmeeks/capitanito__50-1600 already exists in ChaiML
zmeeks-capitanito-50-1600-v4-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpdmb4ca6r, device:0
zmeeks-capitanito-50-1600-v4-mkmlizer: Saving flywheel model at /dev/shm/model_cache
zmeeks-capitanito-50-1600-v4-mkmlizer: quantized model in 30.659s
zmeeks-capitanito-50-1600-v4-mkmlizer: Processed model zmeeks/capitanito__50-1600 in 62.258s
zmeeks-capitanito-50-1600-v4-mkmlizer: creating bucket guanaco-mkml-models
zmeeks-capitanito-50-1600-v4-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zmeeks-capitanito-50-1600-v4-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zmeeks-capitanito-50-1600-v4/nvidia
zmeeks-capitanito-50-1600-v4-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zmeeks-capitanito-50-1600-v4/nvidia/config.json
zmeeks-capitanito-50-1600-v4-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zmeeks-capitanito-50-1600-v4/nvidia/special_tokens_map.json
zmeeks-capitanito-50-1600-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zmeeks-capitanito-50-1600-v4/nvidia/tokenizer_config.json
zmeeks-capitanito-50-1600-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zmeeks-capitanito-50-1600-v4/nvidia/tokenizer.json
zmeeks-capitanito-50-1600-v4-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zmeeks-capitanito-50-1600-v4/nvidia/flywheel_model.0.safetensors
zmeeks-capitanito-50-1600-v4-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:11, 32.05it/s]
Loading 0: 4%|▎ | 13/363 [00:00<00:06, 50.74it/s]
Loading 0: 5%|▌ | 19/363 [00:00<00:07, 44.18it/s]
Loading 0: 7%|▋ | 24/363 [00:00<00:07, 42.80it/s]
Loading 0: 9%|▊ | 31/363 [00:00<00:06, 48.94it/s]
Loading 0: 10%|█ | 37/363 [00:00<00:07, 45.09it/s]
Loading 0: 12%|█▏ | 42/363 [00:00<00:07, 43.46it/s]
Loading 0: 13%|█▎ | 49/363 [00:01<00:06, 48.22it/s]
Loading 0: 15%|█▌ | 55/363 [00:01<00:06, 45.22it/s]
Loading 0: 17%|█▋ | 61/363 [00:01<00:08, 34.37it/s]
Loading 0: 18%|█▊ | 65/363 [00:01<00:08, 33.15it/s]
Loading 0: 20%|█▉ | 71/363 [00:01<00:07, 38.40it/s]
Loading 0: 21%|██ | 76/363 [00:01<00:07, 39.25it/s]
Loading 0: 22%|██▏ | 81/363 [00:01<00:06, 41.06it/s]
Loading 0: 24%|██▎ | 86/363 [00:02<00:06, 43.23it/s]
Loading 0: 25%|██▌ | 91/363 [00:02<00:07, 36.65it/s]
Loading 0: 27%|██▋ | 98/363 [00:02<00:05, 44.39it/s]
Loading 0: 28%|██▊ | 103/363 [00:02<00:05, 44.45it/s]
Loading 0: 30%|███ | 109/363 [00:02<00:05, 48.00it/s]
Loading 0: 32%|███▏ | 115/363 [00:02<00:05, 43.06it/s]
Loading 0: 33%|███▎ | 120/363 [00:02<00:06, 39.93it/s]
Loading 0: 34%|███▍ | 125/363 [00:02<00:05, 42.09it/s]
Loading 0: 36%|███▌ | 130/363 [00:03<00:05, 42.00it/s]
Loading 0: 37%|███▋ | 135/363 [00:03<00:05, 42.21it/s]
Loading 0: 39%|███▊ | 140/363 [00:03<00:05, 43.54it/s]
Loading 0: 40%|███▉ | 145/363 [00:03<00:07, 27.44it/s]
Loading 0: 41%|████ | 149/363 [00:03<00:07, 28.32it/s]
Loading 0: 43%|████▎ | 156/363 [00:03<00:05, 35.90it/s]
Loading 0: 44%|████▍ | 161/363 [00:04<00:05, 37.85it/s]
Loading 0: 46%|████▌ | 166/363 [00:04<00:05, 39.32it/s]
Loading 0: 47%|████▋ | 171/363 [00:04<00:04, 41.17it/s]
Loading 0: 48%|████▊ | 176/363 [00:04<00:05, 34.89it/s]
Loading 0: 50%|█████ | 183/363 [00:04<00:04, 42.38it/s]
Loading 0: 52%|█████▏ | 188/363 [00:04<00:04, 41.60it/s]
Loading 0: 53%|█████▎ | 193/363 [00:04<00:04, 41.16it/s]
Loading 0: 55%|█████▍ | 198/363 [00:04<00:03, 43.07it/s]
Loading 0: 56%|█████▌ | 203/363 [00:05<00:04, 35.13it/s]
Loading 0: 58%|█████▊ | 210/363 [00:05<00:03, 40.87it/s]
Loading 0: 59%|█████▉ | 215/363 [00:05<00:03, 40.56it/s]
Loading 0: 61%|██████ | 220/363 [00:05<00:03, 40.43it/s]
Loading 0: 62%|██████▏ | 225/363 [00:05<00:05, 25.16it/s]
Loading 0: 63%|██████▎ | 230/363 [00:05<00:04, 28.01it/s]
Loading 0: 65%|██████▌ | 237/363 [00:06<00:03, 34.79it/s]
Loading 0: 67%|██████▋ | 242/363 [00:06<00:03, 36.89it/s]
Loading 0: 68%|██████▊ | 247/363 [00:06<00:03, 38.43it/s]
Loading 0: 69%|██████▉ | 252/363 [00:06<00:02, 40.12it/s]
Loading 0: 71%|███████ | 257/363 [00:06<00:03, 34.48it/s]
Loading 0: 73%|███████▎ | 264/363 [00:06<00:02, 41.58it/s]
Loading 0: 74%|███████▍ | 269/363 [00:06<00:02, 42.13it/s]
Loading 0: 75%|███████▌ | 274/363 [00:06<00:02, 42.25it/s]
Loading 0: 77%|███████▋ | 280/363 [00:07<00:02, 40.63it/s]
Loading 0: 79%|███████▊ | 285/363 [00:07<00:01, 40.37it/s]
Loading 0: 80%|████████ | 291/363 [00:07<00:01, 43.58it/s]
Loading 0: 82%|████████▏ | 296/363 [00:07<00:01, 43.44it/s]
Loading 0: 83%|████████▎ | 302/363 [00:07<00:01, 47.08it/s]
Loading 0: 85%|████████▍ | 307/363 [00:08<00:02, 22.43it/s]
Loading 0: 86%|████████▌ | 312/363 [00:08<00:02, 24.82it/s]
Loading 0: 88%|████████▊ | 320/363 [00:08<00:01, 33.28it/s]
Loading 0: 90%|████████▉ | 326/363 [00:08<00:01, 34.85it/s]
Loading 0: 91%|█████████ | 331/363 [00:08<00:00, 36.37it/s]
Loading 0: 93%|█████████▎| 338/363 [00:08<00:00, 41.92it/s]
Loading 0: 94%|█████████▍| 343/363 [00:08<00:00, 43.64it/s]
Loading 0: 96%|█████████▌| 348/363 [00:09<00:00, 37.57it/s]
Loading 0: 98%|█████████▊| 355/363 [00:09<00:00, 44.60it/s]
Loading 0: 99%|█████████▉| 360/363 [00:09<00:00, 44.42it/s]
Job zmeeks-capitanito-50-1600-v4-mkmlizer completed after 84.87s with status: succeeded
Stopping job with name zmeeks-capitanito-50-1600-v4-mkmlizer
Pipeline stage MKMLizer completed in 85.43s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service zmeeks-capitanito-50-1600-v4
Waiting for inference service zmeeks-capitanito-50-1600-v4 to be ready
Failed to get response for submission junhua024-chai-02-full-062_v2: HTTPConnectionPool(host='junhua024-chai-02-full-062-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission zmeeks-capitanito-50_v3: ('http://zmeeks-capitanito-50-v3-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:51204->127.0.0.1:8080: read: connection reset by peer\n')
Failed to get response for submission junhua024-chai-02-full-062_v2: HTTPConnectionPool(host='junhua024-chai-02-full-062-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service zmeeks-capitanito-50-1600-v4 ready after 251.31918954849243s
Pipeline stage MKMLDeployer completed in 251.76s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 7.618619918823242s
Received healthy response to inference request in 1.4739065170288086s
Received healthy response to inference request in 1.6321306228637695s
{"detail":"('http://chaiml-20250611-retune-u-1558-v3-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:60072->127.0.0.1:8080: read: connection reset by peer\\n')"}
Received unhealthy response to inference request!
5 requests
2 failed requests
5th percentile: 1.5055513381958008
10th percentile: 1.537196159362793
20th percentile: 1.6004858016967773
30th percentile: 1.6513245582580567
40th percentile: 1.6897124290466308
50th percentile: 1.728100299835205
60th percentile: 4.08430814743042
70th percentile: 6.440515995025634
80th percentile: 10.122159767150881
90th percentile: 15.129239463806154
95th percentile: 17.63277931213379
99th percentile: 19.6356111907959
mean time: 6.51781530380249
%s, retrying in %s seconds...
Received healthy response to inference request in 1.8748180866241455s
Received healthy response to inference request in 2.0507023334503174s
Received healthy response to inference request in 1.5110127925872803s
Received healthy response to inference request in 1.0392892360687256s
Received healthy response to inference request in 1.3174479007720947s
5 requests
0 failed requests
5th percentile: 1.0949209690093995
10th percentile: 1.1505527019500732
20th percentile: 1.2618161678314208
30th percentile: 1.3561608791351318
40th percentile: 1.433586835861206
50th percentile: 1.5110127925872803
60th percentile: 1.6565349102020264
70th percentile: 1.8020570278167725
80th percentile: 1.90999493598938
90th percentile: 1.9803486347198487
95th percentile: 2.015525484085083
99th percentile: 2.0436669635772704
mean time: 1.5586540699005127
Pipeline stage StressChecker completed in 43.00s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.74s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.87s
Shutdown handler de-registered
zmeeks-capitanito-50-1600_v4 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4095.11s
Shutdown handler de-registered
zmeeks-capitanito-50-1600_v4 status is now inactive due to auto deactivation removed underperforming models
zmeeks-capitanito-50-1600_v4 status is now torndown due to DeploymentManager action
zmeeks-capitanito-50-1600_v4 status is now torndown due to DeploymentManager action