Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-nemo-20241010-ti-5991-v99-mkmlizer
Waiting for job on chaiml-nemo-20241010-ti-5991-v99-mkmlizer to finish
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ _____ __ __ ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ /___/ ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ Version: 0.11.12 ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ https://mk1.ai ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ belonging to: ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ Chai Research Corp. ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ║ ║
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: Downloaded to shared memory in 32.599s
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpsmmk7sk2, device:0
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: quantized model in 35.442s
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: Processed model ChaiML/nemo-20241010_tier_merge_v4-albert in 68.041s
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: creating bucket guanaco-mkml-models
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-nemo-20241010-ti-5991-v99
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-nemo-20241010-ti-5991-v99/config.json
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-nemo-20241010-ti-5991-v99/special_tokens_map.json
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-nemo-20241010-ti-5991-v99/tokenizer_config.json
chaiml-nemo-20241010-ti-5991-v99-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-nemo-20241010-ti-5991-v99/tokenizer.json
chaiml-nemo-20241010-ti-5991-v99-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 2/363 [00:05<18:00, 2.99s/it]
Loading 0: 2%|▏ | 6/363 [00:06<04:48, 1.24it/s]
Loading 0: 4%|▎ | 13/363 [00:06<01:42, 3.40it/s]
Loading 0: 5%|▍ | 18/363 [00:06<01:03, 5.40it/s]
Loading 0: 7%|▋ | 24/363 [00:06<00:41, 8.20it/s]
Loading 0: 9%|▉ | 32/363 [00:06<00:24, 13.34it/s]
Loading 0: 10%|█ | 38/363 [00:06<00:19, 17.03it/s]
Loading 0: 12%|█▏ | 43/363 [00:07<00:19, 16.38it/s]
Loading 0: 13%|█▎ | 49/363 [00:07<00:14, 21.18it/s]
Loading 0: 15%|█▍ | 54/363 [00:07<00:12, 24.20it/s]
Loading 0: 16%|█▋ | 59/363 [00:07<00:11, 27.05it/s]
Loading 0: 18%|█▊ | 64/363 [00:07<00:09, 30.32it/s]
Loading 0: 19%|█▉ | 69/363 [00:07<00:10, 28.48it/s]
Loading 0: 21%|██ | 76/363 [00:07<00:07, 36.28it/s]
Loading 0: 22%|██▏ | 81/363 [00:08<00:07, 39.04it/s]
Loading 0: 24%|██▎ | 86/363 [00:08<00:06, 40.49it/s]
Loading 0: 25%|██▌ | 92/363 [00:08<00:06, 40.84it/s]
Loading 0: 27%|██▋ | 97/363 [00:08<00:06, 39.16it/s]
Loading 0: 29%|██▊ | 104/363 [00:08<00:05, 44.79it/s]
Loading 0: 30%|███ | 110/363 [00:08<00:05, 44.84it/s]
Loading 0: 32%|███▏ | 115/363 [00:08<00:05, 44.22it/s]
Loading 0: 33%|███▎ | 121/363 [00:09<00:06, 34.91it/s]
Loading 0: 34%|███▍ | 125/363 [00:09<00:06, 35.40it/s]
Loading 0: 36%|███▌ | 131/363 [00:09<00:05, 40.72it/s]
Loading 0: 38%|███▊ | 137/363 [00:09<00:05, 42.14it/s]
Loading 0: 39%|███▉ | 142/363 [00:09<00:05, 42.50it/s]
Loading 0: 41%|████ | 149/363 [00:09<00:04, 47.37it/s]
Loading 0: 42%|████▏ | 154/363 [00:09<00:04, 47.79it/s]
Loading 0: 44%|████▍ | 159/363 [00:09<00:04, 41.75it/s]
Loading 0: 46%|████▌ | 167/363 [00:09<00:03, 49.85it/s]
Loading 0: 48%|████▊ | 173/363 [00:10<00:03, 48.41it/s]
Loading 0: 49%|████▉ | 179/363 [00:10<00:03, 46.21it/s]
Loading 0: 51%|█████ | 185/363 [00:10<00:03, 48.53it/s]
Loading 0: 53%|█████▎ | 191/363 [00:10<00:03, 46.05it/s]
Loading 0: 54%|█████▍ | 196/363 [00:10<00:03, 43.53it/s]
Loading 0: 56%|█████▌ | 202/363 [00:10<00:05, 32.20it/s]
Loading 0: 57%|█████▋ | 206/363 [00:11<00:04, 33.03it/s]
Loading 0: 58%|█████▊ | 212/363 [00:11<00:03, 38.47it/s]
Loading 0: 60%|██████ | 218/363 [00:11<00:03, 40.57it/s]
Loading 0: 61%|██████▏ | 223/363 [00:11<00:03, 40.64it/s]
Loading 0: 63%|██████▎ | 230/363 [00:11<00:02, 46.52it/s]
Loading 0: 65%|██████▍ | 235/363 [00:11<00:02, 47.24it/s]
Loading 0: 66%|██████▌ | 240/363 [00:11<00:03, 40.21it/s]
Loading 0: 68%|██████▊ | 248/363 [00:11<00:02, 48.87it/s]
Loading 0: 70%|██████▉ | 254/363 [00:12<00:02, 48.25it/s]
Loading 0: 72%|███████▏ | 260/363 [00:12<00:02, 48.34it/s]
Loading 0: 73%|███████▎ | 266/363 [00:12<00:01, 49.34it/s]
Loading 0: 75%|███████▍ | 272/363 [00:12<00:01, 45.79it/s]
Loading 0: 76%|███████▋ | 277/363 [00:12<00:01, 44.01it/s]
Loading 0: 78%|███████▊ | 283/363 [00:12<00:02, 35.20it/s]
Loading 0: 79%|███████▉ | 287/363 [00:12<00:02, 35.06it/s]
Loading 0: 81%|████████ | 293/363 [00:13<00:01, 39.31it/s]
Loading 0: 82%|████████▏ | 299/363 [00:13<00:01, 39.78it/s]
Loading 0: 84%|████████▎ | 304/363 [00:13<00:01, 40.90it/s]
Loading 0: 86%|████████▌ | 311/363 [00:13<00:01, 47.29it/s]
Loading 0: 87%|████████▋ | 317/363 [00:13<00:00, 46.69it/s]
Loading 0: 89%|████████▊ | 322/363 [00:13<00:00, 45.90it/s]
Loading 0: 91%|█████████ | 329/363 [00:13<00:00, 50.61it/s]
Loading 0: 92%|█████████▏| 335/363 [00:13<00:00, 49.09it/s]
Loading 0: 94%|█████████▎| 340/363 [00:14<00:00, 45.34it/s]
Loading 0: 95%|█████████▌| 346/363 [00:14<00:00, 48.85it/s]
Loading 0: 97%|█████████▋| 352/363 [00:14<00:00, 48.57it/s]
Loading 0: 98%|█████████▊| 357/363 [00:14<00:00, 42.40it/s]
Job chaiml-nemo-20241010-ti-5991-v99-mkmlizer completed after 93.64s with status: succeeded
Stopping job with name chaiml-nemo-20241010-ti-5991-v99-mkmlizer
Pipeline stage MKMLizer completed in 94.21s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-nemo-20241010-ti-5991-v99
Waiting for inference service chaiml-nemo-20241010-ti-5991-v99 to be ready
Inference service chaiml-nemo-20241010-ti-5991-v99 ready after 170.57781767845154s
Pipeline stage MKMLDeployer completed in 171.09s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7291052341461182s
Received healthy response to inference request in 1.5865592956542969s
Received healthy response to inference request in 1.397078514099121s
Received healthy response to inference request in 1.3556666374206543s
Received healthy response to inference request in 1.8220818042755127s
5 requests
0 failed requests
5th percentile: 1.3639490127563476
10th percentile: 1.372231388092041
20th percentile: 1.3887961387634278
30th percentile: 1.4349746704101562
40th percentile: 1.5107669830322266
50th percentile: 1.5865592956542969
60th percentile: 1.6435776710510255
70th percentile: 1.7005960464477539
80th percentile: 1.7477005481719972
90th percentile: 1.784891176223755
95th percentile: 1.8034864902496337
99th percentile: 1.818362741470337
mean time: 1.5780982971191406
Pipeline stage StressChecker completed in 9.54s
Shutdown handler de-registered
chaiml-nemo-20241010-ti_5991_v99 status is now deployed due to DeploymentManager action
chaiml-nemo-20241010-ti_5991_v99 status is now inactive due to auto deactivation removed underperforming models
chaiml-nemo-20241010-ti_5991_v99 status is now torndown due to DeploymentManager action