Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name zmeeks-capitanito-50-v1-mkmlizer
Waiting for job on zmeeks-capitanito-50-v1-mkmlizer to finish
zmeeks-capitanito-50-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zmeeks-capitanito-50-v1-mkmlizer: ║ ║
zmeeks-capitanito-50-v1-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
zmeeks-capitanito-50-v1-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
zmeeks-capitanito-50-v1-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
zmeeks-capitanito-50-v1-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
zmeeks-capitanito-50-v1-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
zmeeks-capitanito-50-v1-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
zmeeks-capitanito-50-v1-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
zmeeks-capitanito-50-v1-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
zmeeks-capitanito-50-v1-mkmlizer: ║ ║
zmeeks-capitanito-50-v1-mkmlizer: ║ Version: 0.29.15 ║
zmeeks-capitanito-50-v1-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
zmeeks-capitanito-50-v1-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
zmeeks-capitanito-50-v1-mkmlizer: ║ https://mk1.ai ║
zmeeks-capitanito-50-v1-mkmlizer: ║ ║
zmeeks-capitanito-50-v1-mkmlizer: ║ The license key for the current software has been verified as ║
zmeeks-capitanito-50-v1-mkmlizer: ║ belonging to: ║
zmeeks-capitanito-50-v1-mkmlizer: ║ ║
zmeeks-capitanito-50-v1-mkmlizer: ║ Chai Research Corp. ║
zmeeks-capitanito-50-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zmeeks-capitanito-50-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
zmeeks-capitanito-50-v1-mkmlizer: ║ ║
zmeeks-capitanito-50-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
zmeeks-capitanito-50-v1-mkmlizer: Downloaded to shared memory in 44.514s
zmeeks-capitanito-50-v1-mkmlizer: Checking if zmeeks/capitanito__50 already exists in ChaiML
zmeeks-capitanito-50-v1-mkmlizer: Creating repo ChaiML/capitanito__50 and uploading /tmp/tmpkx21p6m5 to it
Failed to get response for submission junhua024-chai-02-full-061_v1: HTTPConnectionPool(host='junhua024-chai-02-full-061-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission junhua024-chai-02-full-061_v1: HTTPConnectionPool(host='junhua024-chai-02-full-061-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
zmeeks-capitanito-50-v1-mkmlizer:
0%| | 0/6 [00:00<?, ?it/s]
17%|█▋ | 1/6 [00:08<00:43, 8.73s/it]
33%|███▎ | 2/6 [00:15<00:30, 7.62s/it]
50%|█████ | 3/6 [00:24<00:24, 8.06s/it]
67%|██████▋ | 4/6 [00:28<00:13, 6.57s/it]
83%|████████▎ | 5/6 [00:35<00:06, 6.63s/it]
100%|██████████| 6/6 [00:36<00:00, 4.80s/it]
100%|██████████| 6/6 [00:36<00:00, 6.07s/it]
zmeeks-capitanito-50-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpkx21p6m5, device:0
zmeeks-capitanito-50-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
zmeeks-capitanito-50-v1-mkmlizer: quantized model in 31.665s
zmeeks-capitanito-50-v1-mkmlizer: Processed model zmeeks/capitanito__50 in 138.329s
zmeeks-capitanito-50-v1-mkmlizer: creating bucket guanaco-mkml-models
zmeeks-capitanito-50-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zmeeks-capitanito-50-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zmeeks-capitanito-50-v1/nvidia
zmeeks-capitanito-50-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zmeeks-capitanito-50-v1/nvidia/config.json
zmeeks-capitanito-50-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zmeeks-capitanito-50-v1/nvidia/special_tokens_map.json
zmeeks-capitanito-50-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zmeeks-capitanito-50-v1/nvidia/tokenizer_config.json
zmeeks-capitanito-50-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zmeeks-capitanito-50-v1/nvidia/tokenizer.json
zmeeks-capitanito-50-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zmeeks-capitanito-50-v1/nvidia/flywheel_model.0.safetensors
Job zmeeks-capitanito-50-v1-mkmlizer completed after 169.31s with status: succeeded
Stopping job with name zmeeks-capitanito-50-v1-mkmlizer
Pipeline stage MKMLizer completed in 169.80s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.22s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service zmeeks-capitanito-50-v1
Waiting for inference service zmeeks-capitanito-50-v1 to be ready
Failed to get response for submission junhua024-chai-02-full-061_v1: HTTPConnectionPool(host='junhua024-chai-02-full-061-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service zmeeks-capitanito-50-v1 ready after 261.26740169525146s
Pipeline stage MKMLDeployer completed in 261.81s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.582636833190918s
Received healthy response to inference request in 2.077929973602295s
Received healthy response to inference request in 2.0371363162994385s
Received healthy response to inference request in 1.9294931888580322s
5 requests
1 failed requests
5th percentile: 1.9510218143463134
10th percentile: 1.9725504398345948
20th percentile: 2.0156076908111573
30th percentile: 2.0452950477600096
40th percentile: 2.0616125106811523
50th percentile: 2.077929973602295
60th percentile: 2.279812717437744
70th percentile: 2.481695461273193
80th percentile: 6.093638086318973
90th percentile: 13.115640592575076
95th percentile: 16.626641845703123
99th percentile: 19.435442848205565
mean time: 5.752967882156372
%s, retrying in %s seconds...
Received healthy response to inference request in 1.6018147468566895s
Received healthy response to inference request in 2.063422679901123s
Received healthy response to inference request in 2.149169445037842s
Received healthy response to inference request in 1.149383783340454s
Received healthy response to inference request in 1.9477565288543701s
5 requests
0 failed requests
5th percentile: 1.239869976043701
10th percentile: 1.3303561687469483
20th percentile: 1.5113285541534425
30th percentile: 1.6710031032562256
40th percentile: 1.8093798160552979
50th percentile: 1.9477565288543701
60th percentile: 1.9940229892730712
70th percentile: 2.0402894496917723
80th percentile: 2.0805720329284667
90th percentile: 2.1148707389831545
95th percentile: 2.132020092010498
99th percentile: 2.145739574432373
mean time: 1.7823094367980956
Pipeline stage StressChecker completed in 40.19s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.67s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.70s
Shutdown handler de-registered
zmeeks-capitanito-50_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3257.83s
Shutdown handler de-registered
zmeeks-capitanito-50_v1 status is now inactive due to auto deactivation removed underperforming models
zmeeks-capitanito-50_v1 status is now torndown due to DeploymentManager action
zmeeks-capitanito-50_v1 status is now torndown due to DeploymentManager action