Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-nemo-20241010-t-5991-v172-mkmlizer
Waiting for job on chaiml-nemo-20241010-t-5991-v172-mkmlizer to finish
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ _____ __ __ ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ /___/ ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ Version: 0.11.12 ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ https://mk1.ai ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ belonging to: ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ Chai Research Corp. ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v172-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-nemo-20241010-t-5991-v172-mkmlizer: Downloaded to shared memory in 38.361s
chaiml-nemo-20241010-t-5991-v172-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpeqckiukp, device:0
chaiml-nemo-20241010-t-5991-v172-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-nemo-20241010-t-5991-v172-mkmlizer: quantized model in 36.589s
chaiml-nemo-20241010-t-5991-v172-mkmlizer: Processed model ChaiML/nemo-20241010_tier_merge_v4-albert in 74.950s
chaiml-nemo-20241010-t-5991-v172-mkmlizer: creating bucket guanaco-mkml-models
chaiml-nemo-20241010-t-5991-v172-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-nemo-20241010-t-5991-v172-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v172
chaiml-nemo-20241010-t-5991-v172-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v172/config.json
chaiml-nemo-20241010-t-5991-v172-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v172/special_tokens_map.json
chaiml-nemo-20241010-t-5991-v172-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v172/tokenizer_config.json
chaiml-nemo-20241010-t-5991-v172-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v172/tokenizer.json
chaiml-nemo-20241010-t-5991-v172-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 2/363 [00:06<18:27, 3.07s/it]
Loading 0: 2%|▏ | 6/363 [00:06<04:56, 1.20it/s]
Loading 0: 3%|▎ | 11/363 [00:06<02:10, 2.70it/s]
Loading 0: 4%|▍ | 15/363 [00:06<01:22, 4.20it/s]
Loading 0: 6%|▋ | 23/363 [00:06<00:40, 8.32it/s]
Loading 0: 8%|▊ | 29/363 [00:06<00:28, 11.74it/s]
Loading 0: 9%|▉ | 34/363 [00:06<00:21, 15.00it/s]
Loading 0: 11%|█ | 40/363 [00:07<00:20, 16.15it/s]
Loading 0: 12%|█▏ | 44/363 [00:07<00:17, 18.63it/s]
Loading 0: 14%|█▍ | 50/363 [00:07<00:12, 24.17it/s]
Loading 0: 15%|█▌ | 56/363 [00:07<00:10, 28.43it/s]
Loading 0: 17%|█▋ | 61/363 [00:07<00:09, 31.15it/s]
Loading 0: 19%|█▊ | 68/363 [00:07<00:07, 37.24it/s]
Loading 0: 20%|██ | 74/363 [00:08<00:07, 39.10it/s]
Loading 0: 22%|██▏ | 79/363 [00:08<00:07, 37.86it/s]
Loading 0: 23%|██▎ | 85/363 [00:08<00:06, 42.48it/s]
Loading 0: 25%|██▍ | 90/363 [00:08<00:06, 42.10it/s]
Loading 0: 26%|██▌ | 95/363 [00:08<00:06, 43.39it/s]
Loading 0: 28%|██▊ | 101/363 [00:08<00:05, 43.79it/s]
Loading 0: 29%|██▉ | 106/363 [00:08<00:05, 44.07it/s]
Loading 0: 31%|███ | 113/363 [00:08<00:05, 48.25it/s]
Loading 0: 33%|███▎ | 119/363 [00:08<00:05, 46.68it/s]
Loading 0: 34%|███▍ | 124/363 [00:09<00:07, 30.76it/s]
Loading 0: 36%|███▌ | 130/363 [00:09<00:06, 35.00it/s]
Loading 0: 37%|███▋ | 135/363 [00:09<00:06, 36.18it/s]
Loading 0: 39%|███▊ | 140/363 [00:09<00:05, 38.18it/s]
Loading 0: 40%|████ | 146/363 [00:09<00:05, 38.96it/s]
Loading 0: 42%|████▏ | 151/363 [00:09<00:05, 39.75it/s]
Loading 0: 43%|████▎ | 157/363 [00:10<00:04, 44.54it/s]
Loading 0: 45%|████▍ | 162/363 [00:10<00:04, 43.96it/s]
Loading 0: 46%|████▋ | 168/363 [00:10<00:04, 40.84it/s]
Loading 0: 48%|████▊ | 176/363 [00:10<00:03, 47.62it/s]
Loading 0: 50%|████▉ | 181/363 [00:10<00:03, 47.97it/s]
Loading 0: 51%|█████ | 186/363 [00:10<00:04, 39.08it/s]
Loading 0: 53%|█████▎ | 193/363 [00:10<00:03, 43.91it/s]
Loading 0: 55%|█████▍ | 198/363 [00:10<00:03, 41.96it/s]
Loading 0: 56%|█████▌ | 203/363 [00:11<00:05, 28.46it/s]
Loading 0: 57%|█████▋ | 207/363 [00:11<00:05, 30.17it/s]
Loading 0: 58%|█████▊ | 212/363 [00:11<00:04, 33.64it/s]
Loading 0: 60%|██████ | 218/363 [00:11<00:04, 35.79it/s]
Loading 0: 61%|██████ | 222/363 [00:11<00:04, 33.35it/s]
Loading 0: 63%|██████▎ | 229/363 [00:11<00:03, 41.16it/s]
Loading 0: 64%|██████▍ | 234/363 [00:12<00:03, 41.37it/s]
Loading 0: 66%|██████▌ | 239/363 [00:12<00:02, 42.43it/s]
Loading 0: 67%|██████▋ | 245/363 [00:12<00:02, 42.60it/s]
Loading 0: 69%|██████▉ | 250/363 [00:12<00:02, 42.89it/s]
Loading 0: 71%|███████ | 257/363 [00:12<00:02, 48.20it/s]
Loading 0: 72%|███████▏ | 263/363 [00:12<00:02, 46.55it/s]
Loading 0: 74%|███████▍ | 268/363 [00:12<00:02, 45.26it/s]
Loading 0: 76%|███████▌ | 275/363 [00:12<00:01, 50.21it/s]
Loading 0: 77%|███████▋ | 281/363 [00:13<00:01, 46.86it/s]
Loading 0: 79%|███████▉ | 286/363 [00:13<00:02, 30.50it/s]
Loading 0: 81%|████████ | 293/363 [00:13<00:01, 36.52it/s]
Loading 0: 82%|████████▏ | 299/363 [00:13<00:01, 38.28it/s]
Loading 0: 84%|████████▎ | 304/363 [00:13<00:01, 39.43it/s]
Loading 0: 86%|████████▌ | 311/363 [00:13<00:01, 44.90it/s]
Loading 0: 87%|████████▋ | 317/363 [00:13<00:01, 44.21it/s]
Loading 0: 89%|████████▊ | 322/363 [00:14<00:00, 44.22it/s]
Loading 0: 91%|█████████ | 329/363 [00:14<00:00, 48.80it/s]
Loading 0: 92%|█████████▏| 335/363 [00:14<00:00, 42.01it/s]
Loading 0: 94%|█████████▎| 340/363 [00:14<00:00, 40.59it/s]
Loading 0: 95%|█████████▌| 346/363 [00:14<00:00, 44.48it/s]
Loading 0: 97%|█████████▋| 351/363 [00:14<00:00, 44.79it/s]
Loading 0: 98%|█████████▊| 356/363 [00:14<00:00, 45.89it/s]
Loading 0: 100%|█████████▉| 362/363 [00:15<00:00, 44.63it/s]
Job chaiml-nemo-20241010-t-5991-v172-mkmlizer completed after 104.33s with status: succeeded
Stopping job with name chaiml-nemo-20241010-t-5991-v172-mkmlizer
Pipeline stage MKMLizer completed in 104.82s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-nemo-20241010-t-5991-v172
Waiting for inference service chaiml-nemo-20241010-t-5991-v172 to be ready
Inference service chaiml-nemo-20241010-t-5991-v172 ready after 170.6086461544037s
Pipeline stage MKMLDeployer completed in 171.09s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.0225789546966553s
Received healthy response to inference request in 1.4021704196929932s
Received healthy response to inference request in 1.6970329284667969s
Received healthy response to inference request in 1.4566245079040527s
5 requests
1 failed requests
5th percentile: 1.4130612373352052
10th percentile: 1.423952054977417
20th percentile: 1.4457336902618407
30th percentile: 1.5047061920166016
40th percentile: 1.6008695602416991
50th percentile: 1.6970329284667969
60th percentile: 1.8272513389587401
70th percentile: 1.9574697494506836
80th percentile: 5.653392362594608
90th percentile: 12.915019178390505
95th percentile: 16.545832586288448
99th percentile: 19.45048331260681
mean time: 5.35101056098938
%s, retrying in %s seconds...
Received healthy response to inference request in 1.4271633625030518s
Received healthy response to inference request in 1.6678686141967773s
Received healthy response to inference request in 1.4991576671600342s
Received healthy response to inference request in 1.3729021549224854s
Received healthy response to inference request in 1.666154146194458s
5 requests
0 failed requests
5th percentile: 1.3837543964385985
10th percentile: 1.394606637954712
20th percentile: 1.4163111209869386
30th percentile: 1.4415622234344483
40th percentile: 1.4703599452972411
50th percentile: 1.4991576671600342
60th percentile: 1.5659562587738036
70th percentile: 1.6327548503875733
80th percentile: 1.666497039794922
90th percentile: 1.6671828269958495
95th percentile: 1.6675257205963134
99th percentile: 1.6678000354766847
mean time: 1.5266491889953613
Pipeline stage StressChecker completed in 37.60s
Shutdown handler de-registered
chaiml-nemo-20241010-t_5991_v172 status is now deployed due to DeploymentManager action
chaiml-nemo-20241010-t_5991_v172 status is now inactive due to auto deactivation removed underperforming models
chaiml-nemo-20241010-t_5991_v172 status is now torndown due to DeploymentManager action