Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-nemo-20241016-bre-1066-v1-mkmlizer
Waiting for job on chaiml-nemo-20241016-bre-1066-v1-mkmlizer to finish
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ _____ __ __ ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ /___/ ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ Version: 0.11.12 ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ belonging to: ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ║ ║
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Retrying (%r) after connection broken by '%r': %s
Retrying (%r) after connection broken by '%r': %s
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: Downloaded to shared memory in 684.967s
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpnjs8m4gh, device:0
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: quantized model in 38.250s
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: Processed model ChaiML/nemo-20241016-breadcrumbs_ties-remerge_v1_5merge-albert in 723.216s
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-nemo-20241016-bre-1066-v1
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-nemo-20241016-bre-1066-v1/config.json
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-nemo-20241016-bre-1066-v1/special_tokens_map.json
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-nemo-20241016-bre-1066-v1/tokenizer_config.json
chaiml-nemo-20241016-bre-1066-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-nemo-20241016-bre-1066-v1/flywheel_model.0.safetensors
chaiml-nemo-20241016-bre-1066-v1-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 2/363 [00:06<18:20, 3.05s/it]
Loading 0: 2%|▏ | 6/363 [00:06<04:53, 1.22it/s]
Loading 0: 3%|▎ | 11/363 [00:06<02:08, 2.73it/s]
Loading 0: 4%|▍ | 15/363 [00:06<01:22, 4.24it/s]
Loading 0: 6%|▌ | 22/363 [00:06<00:43, 7.92it/s]
Loading 0: 7%|▋ | 27/363 [00:06<00:31, 10.81it/s]
Loading 0: 9%|▉ | 32/363 [00:06<00:23, 14.23it/s]
Loading 0: 10%|█ | 37/363 [00:06<00:18, 18.11it/s]
Loading 0: 12%|█▏ | 42/363 [00:07<00:19, 16.23it/s]
Loading 0: 13%|█▎ | 49/363 [00:07<00:13, 22.60it/s]
Loading 0: 15%|█▍ | 54/363 [00:07<00:12, 25.43it/s]
Loading 0: 16%|█▋ | 59/363 [00:07<00:10, 28.12it/s]
Loading 0: 17%|█▋ | 63/363 [00:07<00:10, 29.76it/s]
Loading 0: 18%|█▊ | 67/363 [00:07<00:09, 31.86it/s]
Loading 0: 20%|█▉ | 71/363 [00:08<00:09, 31.79it/s]
Loading 0: 21%|██ | 76/363 [00:08<00:08, 35.29it/s]
Loading 0: 22%|██▏ | 80/363 [00:08<00:08, 34.27it/s]
Loading 0: 23%|██▎ | 85/363 [00:08<00:07, 37.75it/s]
Loading 0: 25%|██▍ | 90/363 [00:08<00:07, 37.28it/s]
Loading 0: 26%|██▌ | 94/363 [00:08<00:07, 37.39it/s]
Loading 0: 27%|██▋ | 98/363 [00:08<00:07, 35.59it/s]
Loading 0: 28%|██▊ | 103/363 [00:08<00:06, 37.49it/s]
Loading 0: 29%|██▉ | 107/363 [00:09<00:07, 35.59it/s]
Loading 0: 31%|███ | 112/363 [00:09<00:06, 37.52it/s]
Loading 0: 32%|███▏ | 116/363 [00:09<00:06, 35.45it/s]
Loading 0: 33%|███▎ | 121/363 [00:09<00:09, 26.28it/s]
Loading 0: 34%|███▍ | 125/363 [00:09<00:08, 27.67it/s]
Loading 0: 36%|███▌ | 130/363 [00:09<00:07, 31.41it/s]
Loading 0: 37%|███▋ | 134/363 [00:09<00:07, 31.54it/s]
Loading 0: 38%|███▊ | 139/363 [00:10<00:06, 34.16it/s]
Loading 0: 39%|███▉ | 143/363 [00:10<00:06, 33.43it/s]
Loading 0: 41%|████ | 148/363 [00:10<00:05, 36.43it/s]
Loading 0: 42%|████▏ | 152/363 [00:10<00:06, 34.51it/s]
Loading 0: 43%|████▎ | 157/363 [00:10<00:05, 36.54it/s]
Loading 0: 44%|████▍ | 161/363 [00:10<00:05, 35.15it/s]
Loading 0: 46%|████▌ | 166/363 [00:10<00:05, 36.83it/s]
Loading 0: 47%|████▋ | 170/363 [00:10<00:05, 35.25it/s]
Loading 0: 48%|████▊ | 175/363 [00:11<00:05, 36.74it/s]
Loading 0: 49%|████▉ | 179/363 [00:11<00:05, 35.37it/s]
Loading 0: 51%|█████ | 184/363 [00:11<00:04, 38.11it/s]
Loading 0: 52%|█████▏ | 188/363 [00:11<00:04, 36.07it/s]
Loading 0: 53%|█████▎ | 193/363 [00:11<00:04, 37.59it/s]
Loading 0: 54%|█████▍ | 197/363 [00:11<00:04, 35.89it/s]
Loading 0: 56%|█████▌ | 202/363 [00:11<00:06, 25.84it/s]
Loading 0: 57%|█████▋ | 206/363 [00:12<00:05, 26.80it/s]
Loading 0: 58%|█████▊ | 211/363 [00:12<00:04, 31.20it/s]
Loading 0: 59%|█████▉ | 215/363 [00:12<00:04, 30.98it/s]
Loading 0: 61%|██████ | 220/363 [00:12<00:04, 33.42it/s]
Loading 0: 62%|██████▏ | 224/363 [00:12<00:04, 32.32it/s]
Loading 0: 63%|██████▎ | 229/363 [00:12<00:03, 34.62it/s]
Loading 0: 64%|██████▍ | 233/363 [00:12<00:03, 33.94it/s]
Loading 0: 66%|██████▌ | 238/363 [00:12<00:03, 35.86it/s]
Loading 0: 67%|██████▋ | 242/363 [00:13<00:03, 34.20it/s]
Loading 0: 68%|██████▊ | 247/363 [00:13<00:03, 35.96it/s]
Loading 0: 69%|██████▉ | 251/363 [00:13<00:03, 34.95it/s]
Loading 0: 71%|███████ | 256/363 [00:13<00:02, 36.90it/s]
Loading 0: 72%|███████▏ | 260/363 [00:13<00:02, 35.05it/s]
Loading 0: 73%|███████▎ | 265/363 [00:13<00:02, 38.19it/s]
Loading 0: 74%|███████▍ | 269/363 [00:13<00:02, 36.19it/s]
Loading 0: 75%|███████▌ | 274/363 [00:13<00:02, 36.93it/s]
Loading 0: 77%|███████▋ | 278/363 [00:14<00:02, 34.18it/s]
Loading 0: 78%|███████▊ | 283/363 [00:14<00:03, 25.06it/s]
Loading 0: 79%|███████▉ | 286/363 [00:14<00:03, 24.78it/s]
Loading 0: 80%|███████▉ | 290/363 [00:14<00:02, 27.88it/s]
Loading 0: 81%|████████ | 294/363 [00:14<00:02, 27.73it/s]
Loading 0: 83%|████████▎ | 301/363 [00:14<00:01, 35.09it/s]
Loading 0: 84%|████████▍ | 305/363 [00:15<00:01, 34.17it/s]
Loading 0: 85%|████████▌ | 310/363 [00:15<00:01, 36.58it/s]
Loading 0: 87%|████████▋ | 314/363 [00:15<00:01, 34.91it/s]
Loading 0: 88%|████████▊ | 319/363 [00:15<00:01, 32.91it/s]
Loading 0: 89%|████████▉ | 323/363 [00:15<00:01, 32.37it/s]
Loading 0: 90%|█████████ | 328/363 [00:15<00:01, 34.99it/s]
Loading 0: 91%|█████████▏| 332/363 [00:15<00:00, 34.07it/s]
Loading 0: 93%|█████████▎| 337/363 [00:15<00:00, 35.96it/s]
Loading 0: 94%|█████████▍| 341/363 [00:16<00:00, 34.86it/s]
Loading 0: 95%|█████████▌| 346/363 [00:16<00:00, 37.09it/s]
Loading 0: 96%|█████████▋| 350/363 [00:16<00:00, 35.87it/s]
Loading 0: 98%|█████████▊| 355/363 [00:16<00:00, 37.18it/s]
Loading 0: 99%|█████████▉| 359/363 [00:16<00:00, 35.68it/s]
Job chaiml-nemo-20241016-bre-1066-v1-mkmlizer completed after 747.16s with status: succeeded
Stopping job with name chaiml-nemo-20241016-bre-1066-v1-mkmlizer
Pipeline stage MKMLizer completed in 747.75s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-nemo-20241016-bre-1066-v1
Waiting for inference service chaiml-nemo-20241016-bre-1066-v1 to be ready
Inference service chaiml-nemo-20241016-bre-1066-v1 ready after 180.663818359375s
Pipeline stage MKMLDeployer completed in 181.20s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9804039001464844s
Received healthy response to inference request in 1.4925658702850342s
Received healthy response to inference request in 1.4412877559661865s
Received healthy response to inference request in 1.7525925636291504s
Received healthy response to inference request in 1.384465217590332s
5 requests
0 failed requests
5th percentile: 1.395829725265503
10th percentile: 1.407194232940674
20th percentile: 1.4299232482910156
30th percentile: 1.451543378829956
40th percentile: 1.472054624557495
50th percentile: 1.4925658702850342
60th percentile: 1.5965765476226808
70th percentile: 1.700587224960327
80th percentile: 1.7981548309326172
90th percentile: 1.8892793655395508
95th percentile: 1.9348416328430176
99th percentile: 1.971291446685791
mean time: 1.6102630615234375
Pipeline stage StressChecker completed in 10.10s
Shutdown handler de-registered
chaiml-nemo-20241016-bre_1066_v1 status is now deployed due to DeploymentManager action
chaiml-nemo-20241016-bre_1066_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-nemo-20241016-bre_1066_v1 status is now torndown due to DeploymentManager action