Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name zmeeks-capitanito-54-2800-v7-mkmlizer
Waiting for job on zmeeks-capitanito-54-2800-v7-mkmlizer to finish
zmeeks-capitanito-54-2800-v7-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ Version: 0.29.15 ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ https://mk1.ai ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ The license key for the current software has been verified as ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ belonging to: ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ Chai Research Corp. ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ║ ║
zmeeks-capitanito-54-2800-v7-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
zmeeks-capitanito-54-2800-v7-mkmlizer: Downloaded to shared memory in 31.202s
zmeeks-capitanito-54-2800-v7-mkmlizer: Checking if zmeeks/capitanito__54-2800 already exists in ChaiML
zmeeks-capitanito-54-2800-v7-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp_6l10c65, device:0
zmeeks-capitanito-54-2800-v7-mkmlizer: Saving flywheel model at /dev/shm/model_cache
zmeeks-capitanito-54-2800-v7-mkmlizer: quantized model in 32.436s
zmeeks-capitanito-54-2800-v7-mkmlizer: Processed model zmeeks/capitanito__54-2800 in 63.728s
zmeeks-capitanito-54-2800-v7-mkmlizer: creating bucket guanaco-mkml-models
zmeeks-capitanito-54-2800-v7-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zmeeks-capitanito-54-2800-v7-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zmeeks-capitanito-54-2800-v7/nvidia
zmeeks-capitanito-54-2800-v7-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zmeeks-capitanito-54-2800-v7/nvidia/config.json
zmeeks-capitanito-54-2800-v7-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zmeeks-capitanito-54-2800-v7/nvidia/special_tokens_map.json
zmeeks-capitanito-54-2800-v7-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zmeeks-capitanito-54-2800-v7/nvidia/tokenizer_config.json
zmeeks-capitanito-54-2800-v7-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zmeeks-capitanito-54-2800-v7/nvidia/tokenizer.json
zmeeks-capitanito-54-2800-v7-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zmeeks-capitanito-54-2800-v7/nvidia/flywheel_model.0.safetensors
zmeeks-capitanito-54-2800-v7-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:13, 26.33it/s]
Loading 0: 3%|▎ | 10/363 [00:00<00:09, 35.36it/s]
Loading 0: 4%|▍ | 14/363 [00:00<00:11, 30.43it/s]
Loading 0: 6%|▌ | 21/363 [00:00<00:08, 39.90it/s]
Loading 0: 7%|▋ | 26/363 [00:00<00:08, 40.62it/s]
Loading 0: 9%|▊ | 31/363 [00:00<00:07, 42.91it/s]
Loading 0: 10%|█ | 37/363 [00:00<00:07, 42.05it/s]
Loading 0: 12%|█▏ | 42/363 [00:01<00:08, 40.12it/s]
Loading 0: 13%|█▎ | 49/363 [00:01<00:06, 45.58it/s]
Loading 0: 15%|█▌ | 55/363 [00:01<00:07, 42.79it/s]
Loading 0: 17%|█▋ | 61/363 [00:01<00:09, 31.73it/s]
Loading 0: 18%|█▊ | 65/363 [00:01<00:09, 30.96it/s]
Loading 0: 20%|█▉ | 72/363 [00:01<00:07, 37.47it/s]
Loading 0: 21%|██ | 77/363 [00:02<00:07, 38.99it/s]
Loading 0: 23%|██▎ | 82/363 [00:02<00:08, 33.51it/s]
Loading 0: 25%|██▍ | 89/363 [00:02<00:06, 39.38it/s]
Loading 0: 26%|██▌ | 94/363 [00:02<00:07, 37.92it/s]
Loading 0: 27%|██▋ | 99/363 [00:02<00:07, 37.42it/s]
Loading 0: 28%|██▊ | 103/363 [00:02<00:06, 37.50it/s]
Loading 0: 30%|██▉ | 108/363 [00:02<00:06, 39.93it/s]
Loading 0: 31%|███ | 113/363 [00:03<00:07, 32.75it/s]
Loading 0: 33%|███▎ | 118/363 [00:03<00:07, 32.31it/s]
Loading 0: 34%|███▍ | 125/363 [00:03<00:06, 39.27it/s]
Loading 0: 36%|███▌ | 130/363 [00:03<00:05, 39.02it/s]
Loading 0: 37%|███▋ | 135/363 [00:03<00:05, 38.63it/s]
Loading 0: 39%|███▊ | 140/363 [00:03<00:05, 39.87it/s]
Loading 0: 40%|███▉ | 145/363 [00:04<00:09, 23.78it/s]
Loading 0: 41%|████ | 149/363 [00:04<00:09, 23.74it/s]
Loading 0: 42%|████▏ | 154/363 [00:04<00:07, 27.98it/s]
Loading 0: 44%|████▎ | 158/363 [00:04<00:07, 27.02it/s]
Loading 0: 45%|████▍ | 163/363 [00:04<00:06, 31.45it/s]
Loading 0: 46%|████▌ | 167/363 [00:04<00:06, 29.77it/s]
Loading 0: 47%|████▋ | 172/363 [00:04<00:05, 33.86it/s]
Loading 0: 48%|████▊ | 176/363 [00:05<00:06, 30.96it/s]
Loading 0: 50%|████▉ | 181/363 [00:05<00:05, 33.96it/s]
Loading 0: 51%|█████ | 185/363 [00:05<00:05, 30.20it/s]
Loading 0: 52%|█████▏ | 190/363 [00:05<00:05, 33.80it/s]
Loading 0: 53%|█████▎ | 194/363 [00:05<00:05, 30.82it/s]
Loading 0: 55%|█████▍ | 199/363 [00:05<00:04, 34.76it/s]
Loading 0: 56%|█████▌ | 203/363 [00:05<00:05, 31.49it/s]
Loading 0: 58%|█████▊ | 210/363 [00:06<00:03, 38.71it/s]
Loading 0: 59%|█████▉ | 215/363 [00:06<00:03, 38.50it/s]
Loading 0: 61%|██████ | 220/363 [00:06<00:03, 39.83it/s]
Loading 0: 62%|██████▏ | 225/363 [00:06<00:05, 25.10it/s]
Loading 0: 63%|██████▎ | 230/363 [00:06<00:04, 27.27it/s]
Loading 0: 65%|██████▌ | 237/363 [00:06<00:03, 34.57it/s]
Loading 0: 67%|██████▋ | 242/363 [00:07<00:03, 36.42it/s]
Loading 0: 68%|██████▊ | 247/363 [00:07<00:03, 36.86it/s]
Loading 0: 69%|██████▉ | 252/363 [00:07<00:02, 38.87it/s]
Loading 0: 71%|███████ | 257/363 [00:07<00:03, 32.34it/s]
Loading 0: 73%|███████▎ | 264/363 [00:07<00:02, 39.12it/s]
Loading 0: 74%|███████▍ | 269/363 [00:07<00:02, 38.83it/s]
Loading 0: 75%|███████▌ | 274/363 [00:07<00:02, 38.50it/s]
Loading 0: 77%|███████▋ | 279/363 [00:08<00:02, 38.81it/s]
Loading 0: 78%|███████▊ | 284/363 [00:08<00:02, 32.24it/s]
Loading 0: 80%|████████ | 291/363 [00:08<00:01, 38.43it/s]
Loading 0: 82%|████████▏ | 296/363 [00:08<00:01, 37.26it/s]
Loading 0: 83%|████████▎ | 300/363 [00:08<00:01, 37.36it/s]
Loading 0: 84%|████████▎ | 304/363 [00:09<00:02, 19.92it/s]
Loading 0: 85%|████████▍ | 307/363 [00:09<00:02, 20.87it/s]
Loading 0: 86%|████████▌ | 312/363 [00:09<00:02, 22.90it/s]
Loading 0: 88%|████████▊ | 319/363 [00:09<00:01, 30.49it/s]
Loading 0: 89%|████████▉ | 323/363 [00:09<00:01, 30.40it/s]
Loading 0: 90%|█████████ | 328/363 [00:09<00:01, 33.40it/s]
Loading 0: 91%|█████████▏| 332/363 [00:09<00:00, 32.44it/s]
Loading 0: 93%|█████████▎| 336/363 [00:09<00:00, 33.95it/s]
Loading 0: 94%|█████████▎| 340/363 [00:10<00:00, 30.84it/s]
Loading 0: 95%|█████████▍| 344/363 [00:10<00:00, 32.24it/s]
Loading 0: 96%|█████████▌| 348/363 [00:10<00:00, 29.67it/s]
Loading 0: 97%|█████████▋| 353/363 [00:10<00:00, 34.01it/s]
Loading 0: 98%|█████████▊| 357/363 [00:10<00:00, 31.51it/s]
Job zmeeks-capitanito-54-2800-v7-mkmlizer completed after 96.65s with status: succeeded
Stopping job with name zmeeks-capitanito-54-2800-v7-mkmlizer
Pipeline stage MKMLizer completed in 97.17s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.17s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service zmeeks-capitanito-54-2800-v7
Waiting for inference service zmeeks-capitanito-54-2800-v7 to be ready
Failed to get response for submission chaiml-bat-boys-azeril-_87348_v1: ('http://chaiml-bat-boys-azeril-87348-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Inference service zmeeks-capitanito-54-2800-v7 ready after 322.2322599887848s
Pipeline stage MKMLDeployer completed in 322.96s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.190293788909912s
Received healthy response to inference request in 1.474287748336792s
Received healthy response to inference request in 1.4850265979766846s
Received healthy response to inference request in 1.375218152999878s
5 requests
1 failed requests
5th percentile: 1.3950320720672607
10th percentile: 1.4148459911346436
20th percentile: 1.4544738292694093
30th percentile: 1.4764355182647706
40th percentile: 1.4807310581207276
50th percentile: 1.4850265979766846
60th percentile: 1.7671334743499756
70th percentile: 2.0492403507232666
80th percentile: 5.778232812881473
90th percentile: 12.954110860824587
95th percentile: 16.54204988479614
99th percentile: 19.412401103973387
mean time: 5.330963039398194
%s, retrying in %s seconds...
Received healthy response to inference request in 1.5355501174926758s
Received healthy response to inference request in 1.6372895240783691s
Received healthy response to inference request in 1.5573759078979492s
Received healthy response to inference request in 1.9766101837158203s
Received healthy response to inference request in 1.805999755859375s
5 requests
0 failed requests
5th percentile: 1.5399152755737304
10th percentile: 1.544280433654785
20th percentile: 1.5530107498168946
30th percentile: 1.5733586311340333
40th percentile: 1.6053240776062012
50th percentile: 1.6372895240783691
60th percentile: 1.7047736167907714
70th percentile: 1.7722577095031737
80th percentile: 1.8401218414306642
90th percentile: 1.9083660125732422
95th percentile: 1.9424880981445312
99th percentile: 1.9697857666015626
mean time: 1.7025650978088378
Pipeline stage StressChecker completed in 38.73s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.67s
Shutdown handler de-registered
zmeeks-capitanito-54-2800_v7 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.12s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service zmeeks-capitanito-54-2800-v7-profiler
Waiting for inference service zmeeks-capitanito-54-2800-v7-profiler to be ready
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4981.23s
Shutdown handler de-registered
zmeeks-capitanito-54-2800_v7 status is now inactive due to auto deactivation removed underperforming models
zmeeks-capitanito-54-2800_v7 status is now torndown due to DeploymentManager action
zmeeks-capitanito-54-2800_v7 status is now torndown due to DeploymentManager action