Shutdown handler not registered because Python interpreter is not running in the main thread
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-nemo-20241010-t-5991-v115-mkmlizer
Stopping job with name chaiml-nemo-20241010-t-5991-v115-mkmlizer
%s, retrying in %s seconds...
Starting job with name chaiml-nemo-20241010-t-5991-v115-mkmlizer
Waiting for job on chaiml-nemo-20241010-t-5991-v115-mkmlizer to finish
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ _____ __ __ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ /___/ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ Version: 0.11.12 ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ https://mk1.ai ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ belonging to: ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ Chai Research Corp. ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-nemo-20241010-t-5991-v115-mkmlizer: Downloaded to shared memory in 30.994s
chaiml-nemo-20241010-t-5991-v115-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpwvp4j85u, device:0
chaiml-nemo-20241010-t-5991-v115-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-nemo-20241010-t-5991-v115-mkmlizer: quantized model in 36.723s
chaiml-nemo-20241010-t-5991-v115-mkmlizer: Processed model ChaiML/nemo-20241010_tier_merge_v4-albert in 67.718s
chaiml-nemo-20241010-t-5991-v115-mkmlizer: creating bucket guanaco-mkml-models
chaiml-nemo-20241010-t-5991-v115-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-nemo-20241010-t-5991-v115-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v115
chaiml-nemo-20241010-t-5991-v115-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v115/special_tokens_map.json
chaiml-nemo-20241010-t-5991-v115-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v115/config.json
chaiml-nemo-20241010-t-5991-v115-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v115/tokenizer_config.json
chaiml-nemo-20241010-t-5991-v115-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v115/tokenizer.json
chaiml-nemo-20241010-t-5991-v115-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v115/flywheel_model.0.safetensors
Job chaiml-nemo-20241010-t-5991-v115-mkmlizer completed after 94.97s with status: succeeded
Stopping job with name chaiml-nemo-20241010-t-5991-v115-mkmlizer
%s, retrying in %s seconds...
Stopping job with name chaiml-nemo-20241010-t-5991-v115-mkmlizer
%s, retrying in %s seconds...
Stopping job with name chaiml-nemo-20241010-t-5991-v115-mkmlizer
%s, retrying in %s seconds...
Starting job with name chaiml-nemo-20241010-t-5991-v115-mkmlizer
Waiting for job on chaiml-nemo-20241010-t-5991-v115-mkmlizer to finish
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ _____ __ __ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ /___/ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ Version: 0.11.12 ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ https://mk1.ai ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ belonging to: ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ Chai Research Corp. ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v115-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-nemo-20241010-t-5991-v115-mkmlizer: Downloaded to shared memory in 28.631s
chaiml-nemo-20241010-t-5991-v115-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpp53iox95, device:0
chaiml-nemo-20241010-t-5991-v115-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Connection pool is full, discarding connection: %s. Connection pool size: %s
chaiml-nemo-20241010-t-5991-v115-mkmlizer: quantized model in 36.864s
chaiml-nemo-20241010-t-5991-v115-mkmlizer: Processed model ChaiML/nemo-20241010_tier_merge_v4-albert in 65.495s
chaiml-nemo-20241010-t-5991-v115-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-nemo-20241010-t-5991-v115-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v115
chaiml-nemo-20241010-t-5991-v115-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v115/config.json
chaiml-nemo-20241010-t-5991-v115-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v115/special_tokens_map.json
chaiml-nemo-20241010-t-5991-v115-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v115/tokenizer_config.json
chaiml-nemo-20241010-t-5991-v115-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v115/tokenizer.json
chaiml-nemo-20241010-t-5991-v115-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v115/flywheel_model.0.safetensors
chaiml-nemo-20241010-t-5991-v115-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 2/363 [00:06<18:34, 3.09s/it]
Loading 0: 2%|▏ | 6/363 [00:06<04:57, 1.20it/s]
Loading 0: 4%|▎ | 13/363 [00:06<01:46, 3.29it/s]
Loading 0: 5%|▍ | 17/363 [00:06<01:11, 4.81it/s]
Loading 0: 6%|▋ | 23/363 [00:06<00:43, 7.83it/s]
Loading 0: 8%|▊ | 29/363 [00:06<00:29, 11.23it/s]
Loading 0: 9%|▉ | 34/363 [00:06<00:22, 14.48it/s]
Loading 0: 11%|█ | 40/363 [00:07<00:19, 16.34it/s]
Loading 0: 12%|█▏ | 44/363 [00:07<00:16, 18.83it/s]
Loading 0: 14%|█▍ | 50/363 [00:07<00:12, 24.29it/s]
Loading 0: 15%|█▌ | 56/363 [00:07<00:10, 28.02it/s]
Loading 0: 17%|█▋ | 61/363 [00:07<00:09, 30.26it/s]
Loading 0: 19%|█▊ | 68/363 [00:07<00:08, 36.63it/s]
Loading 0: 20%|██ | 74/363 [00:08<00:07, 37.60it/s]
Loading 0: 22%|██▏ | 79/363 [00:08<00:07, 37.91it/s]
Loading 0: 23%|██▎ | 85/363 [00:08<00:06, 42.80it/s]
Loading 0: 25%|██▍ | 90/363 [00:08<00:06, 41.72it/s]
Loading 0: 26%|██▌ | 95/363 [00:08<00:06, 42.09it/s]
Loading 0: 28%|██▊ | 100/363 [00:08<00:06, 43.27it/s]
Loading 0: 29%|██▉ | 105/363 [00:08<00:06, 37.58it/s]
Loading 0: 31%|███ | 113/363 [00:08<00:05, 45.65it/s]
Loading 0: 33%|███▎ | 119/363 [00:09<00:05, 44.30it/s]
Loading 0: 34%|███▍ | 124/363 [00:09<00:07, 31.10it/s]
Loading 0: 36%|███▌ | 131/363 [00:09<00:06, 37.60it/s]
Loading 0: 38%|███▊ | 137/363 [00:09<00:05, 39.11it/s]
Loading 0: 39%|███▉ | 142/363 [00:09<00:06, 36.28it/s]
Loading 0: 41%|████ | 148/363 [00:09<00:05, 40.63it/s]
Loading 0: 42%|████▏ | 153/363 [00:10<00:05, 40.72it/s]
Loading 0: 44%|████▎ | 158/363 [00:10<00:04, 41.41it/s]
Loading 0: 45%|████▌ | 164/363 [00:10<00:04, 40.96it/s]
Loading 0: 47%|████▋ | 169/363 [00:10<00:04, 40.29it/s]
Loading 0: 48%|████▊ | 175/363 [00:10<00:04, 44.81it/s]
Loading 0: 50%|████▉ | 180/363 [00:10<00:04, 43.64it/s]
Loading 0: 51%|█████ | 185/363 [00:10<00:03, 44.69it/s]
Loading 0: 53%|█████▎ | 191/363 [00:10<00:04, 41.86it/s]
Loading 0: 54%|█████▍ | 196/363 [00:11<00:04, 39.49it/s]
Loading 0: 56%|█████▌ | 202/363 [00:11<00:05, 29.21it/s]
Loading 0: 57%|█████▋ | 206/363 [00:11<00:05, 30.44it/s]
Loading 0: 58%|█████▊ | 212/363 [00:11<00:04, 35.46it/s]
Loading 0: 60%|██████ | 218/363 [00:11<00:03, 37.41it/s]
Loading 0: 61%|██████▏ | 223/363 [00:11<00:03, 38.37it/s]
Loading 0: 63%|██████▎ | 230/363 [00:11<00:03, 44.00it/s]
Loading 0: 65%|██████▌ | 236/363 [00:12<00:02, 42.88it/s]
Loading 0: 66%|██████▋ | 241/363 [00:12<00:02, 42.22it/s]
Loading 0: 68%|██████▊ | 248/363 [00:12<00:02, 46.71it/s]
Loading 0: 70%|██████▉ | 253/363 [00:12<00:02, 45.59it/s]
Loading 0: 71%|███████ | 258/363 [00:12<00:02, 37.93it/s]
Loading 0: 73%|███████▎ | 266/363 [00:12<00:02, 45.29it/s]
Loading 0: 75%|███████▍ | 272/363 [00:12<00:02, 43.83it/s]
Loading 0: 76%|███████▋ | 277/363 [00:13<00:02, 41.71it/s]
Loading 0: 78%|███████▊ | 282/363 [00:13<00:01, 41.49it/s]
Loading 0: 79%|███████▉ | 287/363 [00:13<00:02, 27.79it/s]
Loading 0: 81%|████████ | 293/363 [00:13<00:02, 32.44it/s]
Loading 0: 82%|████████▏ | 297/363 [00:13<00:02, 32.92it/s]
Loading 0: 83%|████████▎ | 301/363 [00:13<00:01, 33.60it/s]
Loading 0: 84%|████████▍ | 305/363 [00:13<00:01, 33.50it/s]
Loading 0: 86%|████████▌ | 311/363 [00:14<00:01, 37.81it/s]
Loading 0: 87%|████████▋ | 315/363 [00:14<00:01, 38.24it/s]
Loading 0: 88%|████████▊ | 320/363 [00:14<00:01, 40.62it/s]
Loading 0: 90%|████████▉ | 326/363 [00:14<00:00, 40.66it/s]
Loading 0: 91%|█████████ | 331/363 [00:14<00:00, 40.33it/s]
Loading 0: 93%|█████████▎| 338/363 [00:14<00:00, 45.89it/s]
Loading 0: 95%|█████████▍| 344/363 [00:14<00:00, 44.37it/s]
Loading 0: 96%|█████████▌| 349/363 [00:15<00:00, 40.54it/s]
Loading 0: 98%|█████████▊| 355/363 [00:15<00:00, 44.94it/s]
Loading 0: 99%|█████████▉| 360/363 [00:15<00:00, 43.58it/s]
Job chaiml-nemo-20241010-t-5991-v115-mkmlizer completed after 94.27s with status: succeeded
Stopping job with name chaiml-nemo-20241010-t-5991-v115-mkmlizer
Pipeline stage MKMLizer completed in 190.97s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.19s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-nemo-20241010-t-5991-v115
Ignoring service chaiml-nemo-20241010-t-5991-v115 already deployed
Waiting for inference service chaiml-nemo-20241010-t-5991-v115 to be ready
Inference service chaiml-nemo-20241010-t-5991-v115 ready after 90.4831473827362s
Pipeline stage MKMLDeployer completed in 91.11s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.6131489276885986s
Received healthy response to inference request in 1.4751029014587402s
Received healthy response to inference request in 1.6117143630981445s
Received healthy response to inference request in 1.5740118026733398s
EOF
Received unhealthy response to inference request!
5 requests
1 failed requests
5th percentile: 1.4948846817016601
10th percentile: 1.51466646194458
20th percentile: 1.55423002243042
30th percentile: 1.5815523147583008
40th percentile: 1.5966333389282226
50th percentile: 1.6117143630981445
60th percentile: 1.6122881889343261
70th percentile: 1.6128620147705077
80th percentile: 1.6508860111236572
90th percentile: 1.7263601779937745
95th percentile: 1.764097261428833
99th percentile: 1.7942869281768798
mean time: 1.615162467956543
%s, retrying in %s seconds...
Received healthy response to inference request in 1.8939731121063232s
Received healthy response to inference request in 1.7436950206756592s
Received healthy response to inference request in 1.9768328666687012s
Received healthy response to inference request in 2.164423942565918s
Received healthy response to inference request in 1.4102115631103516s
5 requests
0 failed requests
5th percentile: 1.4769082546234131
10th percentile: 1.5436049461364747
20th percentile: 1.6769983291625976
30th percentile: 1.773750638961792
40th percentile: 1.8338618755340577
50th percentile: 1.8939731121063232
60th percentile: 1.9271170139312743
70th percentile: 1.9602609157562256
80th percentile: 2.0143510818481447
90th percentile: 2.0893875122070313
95th percentile: 2.1269057273864744
99th percentile: 2.156920299530029
mean time: 1.8378273010253907
Pipeline stage StressChecker completed in 21.92s
Shutdown handler de-registered
chaiml-nemo-20241010-t_5991_v115 status is now deployed due to DeploymentManager action
chaiml-nemo-20241010-t_5991_v115 status is now inactive due to auto deactivation removed underperforming models
chaiml-nemo-20241010-t_5991_v115 status is now torndown due to DeploymentManager action