Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-nemo-chai-5merge-ties-v26-mkmlizer
Waiting for job on chaiml-nemo-chai-5merge-ties-v26-mkmlizer to finish
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ _____ __ __ ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ /___/ ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ Version: 0.11.12 ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ https://mk1.ai ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ belonging to: ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ Chai Research Corp. ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ║ ║
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: Downloaded to shared memory in 46.080s
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpmra9dx8f, device:0
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Retrying (%r) after connection broken by '%r': %s
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: quantized model in 36.834s
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: Processed model ChaiML/nemo-chai-5merge-ties in 82.914s
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: creating bucket guanaco-mkml-models
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-nemo-chai-5merge-ties-v26
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-nemo-chai-5merge-ties-v26/config.json
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-nemo-chai-5merge-ties-v26/special_tokens_map.json
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-nemo-chai-5merge-ties-v26/tokenizer_config.json
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-nemo-chai-5merge-ties-v26/tokenizer.json
chaiml-nemo-chai-5merge-ties-v26-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-nemo-chai-5merge-ties-v26/flywheel_model.0.safetensors
chaiml-nemo-chai-5merge-ties-v26-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 2/363 [00:06<19:18, 3.21s/it]
Loading 0: 2%|▏ | 6/363 [00:06<05:08, 1.16it/s]
Loading 0: 4%|▎ | 13/363 [00:06<01:49, 3.19it/s]
Loading 0: 5%|▍ | 17/363 [00:06<01:14, 4.65it/s]
Loading 0: 6%|▋ | 23/363 [00:06<00:44, 7.63it/s]
Loading 0: 8%|▊ | 29/363 [00:07<00:30, 11.11it/s]
Loading 0: 9%|▉ | 34/363 [00:07<00:22, 14.42it/s]
Loading 0: 11%|█ | 40/363 [00:07<00:19, 16.60it/s]
Loading 0: 12%|█▏ | 44/363 [00:07<00:16, 18.86it/s]
Loading 0: 13%|█▎ | 49/363 [00:07<00:13, 22.69it/s]
Loading 0: 15%|█▍ | 53/363 [00:07<00:12, 24.60it/s]
Loading 0: 16%|█▌ | 58/363 [00:07<00:10, 29.27it/s]
Loading 0: 17%|█▋ | 62/363 [00:08<00:09, 30.79it/s]
Loading 0: 19%|█▊ | 68/363 [00:08<00:08, 36.19it/s]
Loading 0: 20%|██ | 74/363 [00:08<00:07, 37.40it/s]
Loading 0: 22%|██▏ | 79/363 [00:08<00:07, 37.46it/s]
Loading 0: 24%|██▎ | 86/363 [00:08<00:06, 42.92it/s]
Loading 0: 25%|██▌ | 92/363 [00:08<00:06, 42.59it/s]
Loading 0: 27%|██▋ | 97/363 [00:08<00:06, 40.60it/s]
Loading 0: 29%|██▊ | 104/363 [00:08<00:05, 45.59it/s]
Loading 0: 30%|███ | 109/363 [00:09<00:05, 46.43it/s]
Loading 0: 31%|███▏ | 114/363 [00:09<00:06, 39.86it/s]
Loading 0: 33%|███▎ | 121/363 [00:09<00:07, 33.93it/s]
Loading 0: 34%|███▍ | 125/363 [00:09<00:07, 33.56it/s]
Loading 0: 36%|███▌ | 131/363 [00:09<00:06, 38.19it/s]
Loading 0: 38%|███▊ | 137/363 [00:09<00:05, 39.30it/s]
Loading 0: 39%|███▉ | 142/363 [00:09<00:05, 39.49it/s]
Loading 0: 41%|████ | 149/363 [00:10<00:04, 44.50it/s]
Loading 0: 42%|████▏ | 154/363 [00:10<00:04, 44.74it/s]
Loading 0: 44%|████▍ | 159/363 [00:10<00:05, 36.45it/s]
Loading 0: 46%|████▌ | 167/363 [00:10<00:04, 44.56it/s]
Loading 0: 48%|████▊ | 173/363 [00:10<00:04, 44.14it/s]
Loading 0: 49%|████▉ | 178/363 [00:10<00:04, 43.58it/s]
Loading 0: 51%|█████ | 185/363 [00:10<00:03, 48.18it/s]
Loading 0: 52%|█████▏ | 190/363 [00:11<00:03, 48.07it/s]
Loading 0: 54%|█████▎ | 195/363 [00:11<00:04, 39.34it/s]
Loading 0: 56%|█████▌ | 202/363 [00:11<00:04, 32.67it/s]
Loading 0: 57%|█████▋ | 206/363 [00:11<00:04, 32.95it/s]
Loading 0: 58%|█████▊ | 212/363 [00:11<00:03, 37.80it/s]
Loading 0: 60%|█████▉ | 217/363 [00:11<00:03, 40.25it/s]
Loading 0: 61%|██████ | 222/363 [00:11<00:03, 35.78it/s]
Loading 0: 63%|██████▎ | 230/363 [00:12<00:03, 44.14it/s]
Loading 0: 65%|██████▌ | 236/363 [00:12<00:02, 44.27it/s]
Loading 0: 66%|██████▋ | 241/363 [00:12<00:02, 43.13it/s]
Loading 0: 68%|██████▊ | 248/363 [00:12<00:02, 47.15it/s]
Loading 0: 70%|██████▉ | 253/363 [00:12<00:02, 46.75it/s]
Loading 0: 71%|███████ | 258/363 [00:12<00:02, 36.48it/s]
Loading 0: 73%|███████▎ | 266/363 [00:12<00:02, 44.36it/s]
Loading 0: 75%|███████▍ | 271/363 [00:13<00:02, 45.36it/s]
Loading 0: 76%|███████▌ | 276/363 [00:13<00:02, 40.04it/s]
Loading 0: 78%|███████▊ | 283/363 [00:13<00:02, 34.57it/s]
Loading 0: 79%|███████▉ | 287/363 [00:13<00:02, 34.01it/s]
Loading 0: 80%|████████ | 292/363 [00:13<00:01, 37.21it/s]
Loading 0: 82%|████████▏ | 297/363 [00:13<00:01, 39.09it/s]
Loading 0: 83%|████████▎ | 302/363 [00:13<00:01, 40.88it/s]
Loading 0: 85%|████████▍ | 308/363 [00:14<00:01, 40.83it/s]
Loading 0: 86%|████████▌ | 313/363 [00:14<00:01, 41.17it/s]
Loading 0: 88%|████████▊ | 320/363 [00:14<00:00, 46.30it/s]
Loading 0: 90%|████████▉ | 325/363 [00:14<00:00, 46.77it/s]
Loading 0: 91%|█████████ | 330/363 [00:14<00:00, 38.51it/s]
Loading 0: 93%|█████████▎| 337/363 [00:14<00:00, 45.01it/s]
Loading 0: 94%|█████████▍| 342/363 [00:14<00:00, 43.86it/s]
Loading 0: 96%|█████████▌| 347/363 [00:14<00:00, 44.05it/s]
Loading 0: 97%|█████████▋| 353/363 [00:15<00:00, 42.78it/s]
Loading 0: 99%|█████████▊| 358/363 [00:15<00:00, 42.96it/s]
Job chaiml-nemo-chai-5merge-ties-v26-mkmlizer completed after 103.89s with status: succeeded
Stopping job with name chaiml-nemo-chai-5merge-ties-v26-mkmlizer
Pipeline stage MKMLizer completed in 104.85s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-nemo-chai-5merge-ties-v26
Waiting for inference service chaiml-nemo-chai-5merge-ties-v26 to be ready
Inference service chaiml-nemo-chai-5merge-ties-v26 ready after 180.63562035560608s
Pipeline stage MKMLDeployer completed in 181.83s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.2715883255004883s
Received healthy response to inference request in 1.934605598449707s
Received healthy response to inference request in 1.3962664604187012s
Received healthy response to inference request in 1.5006928443908691s
5 requests
1 failed requests
5th percentile: 1.4171517372131348
10th percentile: 1.4380370140075684
20th percentile: 1.4798075675964355
30th percentile: 1.5874753952026368
40th percentile: 1.761040496826172
50th percentile: 1.934605598449707
60th percentile: 2.0693986892700194
70th percentile: 2.204191780090332
80th percentile: 5.849067544937137
90th percentile: 13.004025983810426
95th percentile: 16.58150520324707
99th percentile: 19.443488578796387
mean time: 5.452427530288697
%s, retrying in %s seconds...
Received healthy response to inference request in 1.646742820739746s
Received healthy response to inference request in 1.5329513549804688s
Received healthy response to inference request in 1.5422229766845703s
Received healthy response to inference request in 1.7549519538879395s
Received healthy response to inference request in 1.796255111694336s
5 requests
0 failed requests
5th percentile: 1.534805679321289
10th percentile: 1.5366600036621094
20th percentile: 1.54036865234375
30th percentile: 1.5631269454956054
40th percentile: 1.6049348831176757
50th percentile: 1.646742820739746
60th percentile: 1.6900264739990234
70th percentile: 1.7333101272583007
80th percentile: 1.7632125854492187
90th percentile: 1.7797338485717773
95th percentile: 1.7879944801330567
99th percentile: 1.7946029853820802
mean time: 1.654624843597412
Pipeline stage StressChecker completed in 38.67s
Shutdown handler de-registered
chaiml-nemo-chai-5merge-ties_v26 status is now deployed due to DeploymentManager action
chaiml-nemo-chai-5merge-ties_v26 status is now inactive due to auto deactivation removed underperforming models
chaiml-nemo-chai-5merge-ties_v26 status is now torndown due to DeploymentManager action