Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-nemo-20241017-bre-9372-v1-mkmlizer
Waiting for job on chaiml-nemo-20241017-bre-9372-v1-mkmlizer to finish
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ _____ __ __ ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ /___/ ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ Version: 0.11.12 ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ belonging to: ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ║ ║
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: Downloaded to shared memory in 53.068s
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpdtlnwt48, device:0
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: quantized model in 38.728s
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: Processed model ChaiML/nemo-20241017-breadcrumbs-remerge_v2_5merge-albert in 91.796s
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-nemo-20241017-bre-9372-v1/config.json
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-nemo-20241017-bre-9372-v1/special_tokens_map.json
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-nemo-20241017-bre-9372-v1/tokenizer_config.json
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-nemo-20241017-bre-9372-v1/tokenizer.json
chaiml-nemo-20241017-bre-9372-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-nemo-20241017-bre-9372-v1/flywheel_model.0.safetensors
chaiml-nemo-20241017-bre-9372-v1-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 2/363 [00:06<18:17, 3.04s/it]
Loading 0: 2%|▏ | 6/363 [00:06<04:53, 1.22it/s]
Loading 0: 3%|▎ | 11/363 [00:06<02:09, 2.73it/s]
Loading 0: 4%|▍ | 15/363 [00:06<01:22, 4.24it/s]
Loading 0: 6%|▌ | 22/363 [00:06<00:42, 7.93it/s]
Loading 0: 7%|▋ | 27/363 [00:06<00:31, 10.81it/s]
Loading 0: 9%|▉ | 32/363 [00:06<00:23, 14.18it/s]
Loading 0: 10%|█ | 37/363 [00:06<00:17, 18.16it/s]
Loading 0: 12%|█▏ | 42/363 [00:07<00:19, 16.49it/s]
Loading 0: 13%|█▎ | 49/363 [00:07<00:13, 23.07it/s]
Loading 0: 15%|█▍ | 54/363 [00:07<00:11, 25.92it/s]
Loading 0: 16%|█▋ | 59/363 [00:07<00:10, 27.68it/s]
Loading 0: 17%|█▋ | 63/363 [00:07<00:10, 29.56it/s]
Loading 0: 18%|█▊ | 67/363 [00:07<00:09, 30.90it/s]
Loading 0: 20%|█▉ | 71/363 [00:08<00:09, 31.30it/s]
Loading 0: 21%|██ | 76/363 [00:08<00:08, 35.23it/s]
Loading 0: 22%|██▏ | 80/363 [00:08<00:08, 35.22it/s]
Loading 0: 23%|██▎ | 85/363 [00:08<00:07, 38.91it/s]
Loading 0: 25%|██▍ | 90/363 [00:08<00:07, 38.39it/s]
Loading 0: 26%|██▌ | 95/363 [00:08<00:06, 39.44it/s]
Loading 0: 28%|██▊ | 100/363 [00:08<00:06, 41.52it/s]
Loading 0: 29%|██▉ | 105/363 [00:09<00:09, 27.83it/s]
Loading 0: 30%|███ | 110/363 [00:09<00:07, 32.09it/s]
Loading 0: 31%|███▏ | 114/363 [00:09<00:08, 30.88it/s]
Loading 0: 33%|███▎ | 121/363 [00:09<00:08, 28.86it/s]
Loading 0: 34%|███▍ | 125/363 [00:09<00:07, 29.93it/s]
Loading 0: 36%|███▌ | 130/363 [00:09<00:06, 34.04it/s]
Loading 0: 37%|███▋ | 134/363 [00:09<00:06, 34.68it/s]
Loading 0: 39%|███▊ | 140/363 [00:10<00:05, 39.33it/s]
Loading 0: 40%|███▉ | 145/363 [00:10<00:05, 41.31it/s]
Loading 0: 41%|████▏ | 150/363 [00:10<00:05, 36.13it/s]
Loading 0: 43%|████▎ | 157/363 [00:10<00:04, 43.09it/s]
Loading 0: 45%|████▍ | 162/363 [00:10<00:04, 43.12it/s]
Loading 0: 46%|████▌ | 167/363 [00:10<00:04, 43.92it/s]
Loading 0: 48%|████▊ | 173/363 [00:10<00:04, 42.01it/s]
Loading 0: 49%|████▉ | 178/363 [00:10<00:04, 40.11it/s]
Loading 0: 51%|█████ | 184/363 [00:11<00:04, 44.16it/s]
Loading 0: 52%|█████▏ | 189/363 [00:11<00:03, 44.37it/s]
Loading 0: 53%|█████▎ | 194/363 [00:11<00:03, 45.07it/s]
Loading 0: 55%|█████▌ | 200/363 [00:11<00:03, 42.45it/s]
Loading 0: 56%|█████▋ | 205/363 [00:11<00:05, 28.90it/s]
Loading 0: 58%|█████▊ | 211/363 [00:11<00:04, 33.36it/s]
Loading 0: 59%|█████▉ | 215/363 [00:11<00:04, 32.91it/s]
Loading 0: 61%|██████ | 220/363 [00:12<00:04, 35.30it/s]
Loading 0: 62%|██████▏ | 224/363 [00:12<00:04, 33.53it/s]
Loading 0: 63%|██████▎ | 228/363 [00:12<00:03, 34.16it/s]
Loading 0: 64%|██████▍ | 232/363 [00:12<00:04, 30.98it/s]
Loading 0: 65%|██████▌ | 236/363 [00:12<00:03, 32.48it/s]
Loading 0: 66%|██████▌ | 240/363 [00:12<00:04, 29.43it/s]
Loading 0: 67%|██████▋ | 245/363 [00:12<00:03, 32.92it/s]
Loading 0: 69%|██████▊ | 249/363 [00:13<00:03, 29.99it/s]
Loading 0: 70%|██████▉ | 254/363 [00:13<00:03, 34.14it/s]
Loading 0: 71%|███████ | 258/363 [00:13<00:03, 30.81it/s]
Loading 0: 72%|███████▏ | 263/363 [00:13<00:02, 34.17it/s]
Loading 0: 74%|███████▎ | 267/363 [00:13<00:03, 30.71it/s]
Loading 0: 75%|███████▍ | 272/363 [00:13<00:02, 33.29it/s]
Loading 0: 76%|███████▌ | 276/363 [00:13<00:02, 29.95it/s]
Loading 0: 77%|███████▋ | 281/363 [00:14<00:02, 33.14it/s]
Loading 0: 79%|███████▊ | 285/363 [00:14<00:03, 20.11it/s]
Loading 0: 80%|███████▉ | 290/363 [00:14<00:02, 24.33it/s]
Loading 0: 81%|████████ | 294/363 [00:14<00:02, 24.30it/s]
Loading 0: 82%|████████▏ | 299/363 [00:14<00:02, 28.74it/s]
Loading 0: 83%|████████▎ | 303/363 [00:15<00:02, 27.04it/s]
Loading 0: 85%|████████▍ | 308/363 [00:15<00:01, 31.26it/s]
Loading 0: 86%|████████▌ | 312/363 [00:15<00:01, 29.48it/s]
Loading 0: 87%|████████▋ | 317/363 [00:15<00:01, 33.22it/s]
Loading 0: 88%|████████▊ | 321/363 [00:15<00:01, 29.80it/s]
Loading 0: 90%|████████▉ | 326/363 [00:15<00:01, 33.16it/s]
Loading 0: 91%|█████████ | 330/363 [00:15<00:01, 30.66it/s]
Loading 0: 92%|█████████▏| 335/363 [00:15<00:00, 34.74it/s]
Loading 0: 93%|█████████▎| 339/363 [00:16<00:00, 31.98it/s]
Loading 0: 95%|█████████▌| 346/363 [00:16<00:00, 38.97it/s]
Loading 0: 97%|█████████▋| 351/363 [00:16<00:00, 40.34it/s]
Loading 0: 98%|█████████▊| 356/363 [00:16<00:00, 41.87it/s]
Loading 0: 100%|█████████▉| 362/363 [00:16<00:00, 40.28it/s]
Job chaiml-nemo-20241017-bre-9372-v1-mkmlizer completed after 135.63s with status: succeeded
Stopping job with name chaiml-nemo-20241017-bre-9372-v1-mkmlizer
Pipeline stage MKMLizer completed in 136.17s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.19s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-nemo-20241017-bre-9372-v1
Waiting for inference service chaiml-nemo-20241017-bre-9372-v1 to be ready
Inference service chaiml-nemo-20241017-bre-9372-v1 ready after 180.66426277160645s
Pipeline stage MKMLDeployer completed in 181.25s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.1668052673339844s
Received healthy response to inference request in 1.7370216846466064s
Received healthy response to inference request in 1.97835111618042s
Received healthy response to inference request in 1.750448226928711s
Received healthy response to inference request in 1.487224817276001s
5 requests
0 failed requests
5th percentile: 1.537184190750122
10th percentile: 1.587143564224243
20th percentile: 1.6870623111724854
30th percentile: 1.7397069931030273
40th percentile: 1.7450776100158691
50th percentile: 1.750448226928711
60th percentile: 1.8416093826293944
70th percentile: 1.9327705383300782
80th percentile: 2.016041946411133
90th percentile: 2.0914236068725587
95th percentile: 2.1291144371032713
99th percentile: 2.159267101287842
mean time: 1.8239702224731444
Pipeline stage StressChecker completed in 10.47s
Shutdown handler de-registered
chaiml-nemo-20241017-bre_9372_v1 status is now deployed due to DeploymentManager action
chaiml-nemo-20241017-bre_9372_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-nemo-20241017-bre_9372_v1 status is now torndown due to DeploymentManager action