Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-nemo-20241017-del-1301-v1-mkmlizer
Waiting for job on chaiml-nemo-20241017-del-1301-v1-mkmlizer to finish
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ _____ __ __ ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ /___/ ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ Version: 0.11.12 ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ belonging to: ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ║ ║
chaiml-nemo-20241017-del-1301-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-nemo-20241017-del-1301-v1-mkmlizer: Downloaded to shared memory in 52.710s
chaiml-nemo-20241017-del-1301-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpqxyokxg4, device:0
chaiml-nemo-20241017-del-1301-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-nemo-20241017-del-1301-v1-mkmlizer: quantized model in 37.812s
chaiml-nemo-20241017-del-1301-v1-mkmlizer: Processed model ChaiML/nemo-20241017-della-remerge_v2_5merge-albert in 90.522s
chaiml-nemo-20241017-del-1301-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-nemo-20241017-del-1301-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-nemo-20241017-del-1301-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-nemo-20241017-del-1301-v1
chaiml-nemo-20241017-del-1301-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-nemo-20241017-del-1301-v1/config.json
chaiml-nemo-20241017-del-1301-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-nemo-20241017-del-1301-v1/special_tokens_map.json
chaiml-nemo-20241017-del-1301-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-nemo-20241017-del-1301-v1/tokenizer_config.json
chaiml-nemo-20241017-del-1301-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-nemo-20241017-del-1301-v1/tokenizer.json
chaiml-nemo-20241017-del-1301-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-nemo-20241017-del-1301-v1/flywheel_model.0.safetensors
chaiml-nemo-20241017-del-1301-v1-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 2/363 [00:06<18:22, 3.05s/it]
Loading 0: 2%|▏ | 6/363 [00:06<04:54, 1.21it/s]
Loading 0: 3%|▎ | 11/363 [00:06<02:09, 2.71it/s]
Loading 0: 4%|▍ | 15/363 [00:06<01:22, 4.22it/s]
Loading 0: 6%|▌ | 22/363 [00:06<00:43, 7.86it/s]
Loading 0: 7%|▋ | 27/363 [00:06<00:31, 10.72it/s]
Loading 0: 9%|▊ | 31/363 [00:06<00:24, 13.39it/s]
Loading 0: 10%|▉ | 35/363 [00:07<00:20, 16.01it/s]
Loading 0: 11%|█ | 40/363 [00:07<00:19, 16.19it/s]
Loading 0: 12%|█▏ | 44/363 [00:07<00:17, 18.71it/s]
Loading 0: 13%|█▎ | 49/363 [00:07<00:13, 23.01it/s]
Loading 0: 15%|█▍ | 53/363 [00:07<00:12, 24.87it/s]
Loading 0: 16%|█▌ | 58/363 [00:07<00:10, 28.40it/s]
Loading 0: 17%|█▋ | 62/363 [00:07<00:10, 29.12it/s]
Loading 0: 18%|█▊ | 67/363 [00:08<00:09, 32.25it/s]
Loading 0: 20%|█▉ | 71/363 [00:08<00:09, 31.54it/s]
Loading 0: 21%|██ | 76/363 [00:08<00:08, 34.81it/s]
Loading 0: 22%|██▏ | 80/363 [00:08<00:08, 33.83it/s]
Loading 0: 23%|██▎ | 85/363 [00:08<00:07, 36.15it/s]
Loading 0: 25%|██▍ | 89/363 [00:08<00:07, 34.73it/s]
Loading 0: 26%|██▌ | 94/363 [00:08<00:07, 37.07it/s]
Loading 0: 27%|██▋ | 98/363 [00:08<00:07, 35.11it/s]
Loading 0: 28%|██▊ | 103/363 [00:09<00:06, 37.51it/s]
Loading 0: 29%|██▉ | 107/363 [00:09<00:07, 35.80it/s]
Loading 0: 31%|███ | 112/363 [00:09<00:06, 36.80it/s]
Loading 0: 32%|███▏ | 116/363 [00:09<00:07, 34.45it/s]
Loading 0: 33%|███▎ | 121/363 [00:09<00:09, 25.71it/s]
Loading 0: 34%|███▍ | 124/363 [00:09<00:09, 25.40it/s]
Loading 0: 36%|███▌ | 130/363 [00:09<00:07, 31.00it/s]
Loading 0: 37%|███▋ | 134/363 [00:10<00:07, 31.26it/s]
Loading 0: 38%|███▊ | 139/363 [00:10<00:06, 34.47it/s]
Loading 0: 39%|███▉ | 143/363 [00:10<00:06, 32.59it/s]
Loading 0: 41%|████ | 148/363 [00:10<00:06, 34.66it/s]
Loading 0: 42%|████▏ | 152/363 [00:10<00:06, 33.60it/s]
Loading 0: 43%|████▎ | 157/363 [00:10<00:05, 36.08it/s]
Loading 0: 44%|████▍ | 161/363 [00:10<00:05, 34.45it/s]
Loading 0: 46%|████▌ | 166/363 [00:10<00:05, 36.96it/s]
Loading 0: 47%|████▋ | 170/363 [00:11<00:05, 35.41it/s]
Loading 0: 48%|████▊ | 175/363 [00:11<00:05, 37.40it/s]
Loading 0: 49%|████▉ | 179/363 [00:11<00:05, 34.88it/s]
Loading 0: 51%|█████ | 184/363 [00:11<00:04, 37.60it/s]
Loading 0: 52%|█████▏ | 188/363 [00:11<00:04, 35.27it/s]
Loading 0: 53%|█████▎ | 193/363 [00:11<00:04, 36.72it/s]
Loading 0: 54%|█████▍ | 197/363 [00:11<00:04, 35.19it/s]
Loading 0: 56%|█████▌ | 202/363 [00:12<00:06, 26.47it/s]
Loading 0: 57%|█████▋ | 206/363 [00:12<00:05, 27.64it/s]
Loading 0: 58%|█████▊ | 211/363 [00:12<00:04, 31.48it/s]
Loading 0: 59%|█████▉ | 215/363 [00:12<00:04, 32.70it/s]
Loading 0: 61%|██████ | 220/363 [00:12<00:03, 35.89it/s]
Loading 0: 62%|██████▏ | 224/363 [00:12<00:03, 35.58it/s]
Loading 0: 63%|██████▎ | 229/363 [00:12<00:03, 38.68it/s]
Loading 0: 64%|██████▍ | 234/363 [00:12<00:03, 39.50it/s]
Loading 0: 66%|██████▌ | 239/363 [00:13<00:03, 40.19it/s]
Loading 0: 67%|██████▋ | 244/363 [00:13<00:02, 41.59it/s]
Loading 0: 69%|██████▊ | 249/363 [00:13<00:03, 33.70it/s]
Loading 0: 71%|███████ | 256/363 [00:13<00:02, 40.41it/s]
Loading 0: 72%|███████▏ | 261/363 [00:13<00:02, 39.89it/s]
Loading 0: 73%|███████▎ | 266/363 [00:13<00:02, 39.24it/s]
Loading 0: 75%|███████▍ | 271/363 [00:13<00:02, 40.71it/s]
Loading 0: 76%|███████▌ | 276/363 [00:14<00:02, 33.92it/s]
Loading 0: 78%|███████▊ | 283/363 [00:14<00:02, 30.35it/s]
Loading 0: 79%|███████▉ | 287/363 [00:14<00:02, 30.62it/s]
Loading 0: 80%|████████ | 292/363 [00:14<00:02, 33.87it/s]
Loading 0: 82%|████████▏ | 296/363 [00:14<00:01, 34.06it/s]
Loading 0: 83%|████████▎ | 302/363 [00:14<00:01, 38.23it/s]
Loading 0: 85%|████████▍ | 307/363 [00:14<00:01, 40.29it/s]
Loading 0: 86%|████████▌ | 312/363 [00:15<00:01, 33.76it/s]
Loading 0: 88%|████████▊ | 319/363 [00:15<00:01, 41.24it/s]
Loading 0: 89%|████████▉ | 324/363 [00:15<00:00, 41.39it/s]
Loading 0: 91%|█████████ | 329/363 [00:15<00:00, 40.66it/s]
Loading 0: 92%|█████████▏| 334/363 [00:15<00:00, 41.69it/s]
Loading 0: 93%|█████████▎| 339/363 [00:15<00:00, 34.87it/s]
Loading 0: 95%|█████████▌| 346/363 [00:15<00:00, 40.87it/s]
Loading 0: 97%|█████████▋| 351/363 [00:16<00:00, 41.27it/s]
Loading 0: 98%|█████████▊| 356/363 [00:16<00:00, 41.06it/s]
Loading 0: 99%|█████████▉| 361/363 [00:16<00:00, 42.83it/s]
Job chaiml-nemo-20241017-del-1301-v1-mkmlizer completed after 114.43s with status: succeeded
Stopping job with name chaiml-nemo-20241017-del-1301-v1-mkmlizer
Pipeline stage MKMLizer completed in 115.00s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.17s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-nemo-20241017-del-1301-v1
Waiting for inference service chaiml-nemo-20241017-del-1301-v1 to be ready
Inference service chaiml-nemo-20241017-del-1301-v1 ready after 170.83061599731445s
Pipeline stage MKMLDeployer completed in 171.35s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7677700519561768s
Received healthy response to inference request in 1.629547119140625s
Received healthy response to inference request in 1.4351117610931396s
Received healthy response to inference request in 1.4322621822357178s
Received healthy response to inference request in 1.5601201057434082s
5 requests
0 failed requests
5th percentile: 1.4328320980072022
10th percentile: 1.4334020137786865
20th percentile: 1.4345418453216552
30th percentile: 1.4601134300231933
40th percentile: 1.5101167678833007
50th percentile: 1.5601201057434082
60th percentile: 1.587890911102295
70th percentile: 1.6156617164611817
80th percentile: 1.6571917057037353
90th percentile: 1.7124808788299561
95th percentile: 1.7401254653930665
99th percentile: 1.7622411346435547
mean time: 1.5649622440338136
Pipeline stage StressChecker completed in 9.19s
Shutdown handler de-registered
chaiml-nemo-20241017-del_1301_v1 status is now deployed due to DeploymentManager action
chaiml-nemo-20241017-del_1301_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-nemo-20241017-del_1301_v1 status is now torndown due to DeploymentManager action