Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-nemo-20241010-t-5991-v111-mkmlizer
Waiting for job on chaiml-nemo-20241010-t-5991-v111-mkmlizer to finish
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ _____ __ __ ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ /___/ ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ Version: 0.11.12 ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ https://mk1.ai ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ belonging to: ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ Chai Research Corp. ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ║ ║
chaiml-nemo-20241010-t-5991-v111-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-nemo-20241010-t-5991-v111-mkmlizer: Downloaded to shared memory in 27.645s
chaiml-nemo-20241010-t-5991-v111-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmppdf3eyj7, device:0
chaiml-nemo-20241010-t-5991-v111-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-nemo-20241010-t-5991-v111-mkmlizer: quantized model in 36.524s
chaiml-nemo-20241010-t-5991-v111-mkmlizer: Processed model ChaiML/nemo-20241010_tier_merge_v4-albert in 64.169s
chaiml-nemo-20241010-t-5991-v111-mkmlizer: creating bucket guanaco-mkml-models
chaiml-nemo-20241010-t-5991-v111-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-nemo-20241010-t-5991-v111-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v111
chaiml-nemo-20241010-t-5991-v111-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v111/special_tokens_map.json
chaiml-nemo-20241010-t-5991-v111-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v111/config.json
chaiml-nemo-20241010-t-5991-v111-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v111/tokenizer_config.json
chaiml-nemo-20241010-t-5991-v111-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-nemo-20241010-t-5991-v111/tokenizer.json
chaiml-nemo-20241010-t-5991-v111-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 2/363 [00:06<18:26, 3.06s/it]
Loading 0: 2%|▏ | 6/363 [00:06<04:55, 1.21it/s]
Loading 0: 3%|▎ | 11/363 [00:06<02:09, 2.71it/s]
Loading 0: 4%|▍ | 15/363 [00:06<01:22, 4.24it/s]
Loading 0: 6%|▋ | 23/363 [00:06<00:40, 8.40it/s]
Loading 0: 8%|▊ | 29/363 [00:06<00:28, 11.78it/s]
Loading 0: 9%|▉ | 34/363 [00:06<00:21, 15.06it/s]
Loading 0: 11%|█ | 40/363 [00:07<00:19, 16.69it/s]
Loading 0: 12%|█▏ | 44/363 [00:07<00:16, 19.06it/s]
Loading 0: 14%|█▍ | 50/363 [00:07<00:12, 24.29it/s]
Loading 0: 15%|█▌ | 55/363 [00:07<00:10, 28.24it/s]
Loading 0: 17%|█▋ | 60/363 [00:07<00:10, 28.31it/s]
Loading 0: 19%|█▊ | 68/363 [00:07<00:08, 36.79it/s]
Loading 0: 20%|██ | 74/363 [00:08<00:07, 38.06it/s]
Loading 0: 22%|██▏ | 79/363 [00:08<00:07, 38.25it/s]
Loading 0: 24%|██▎ | 86/363 [00:08<00:06, 43.55it/s]
Loading 0: 25%|██▌ | 92/363 [00:08<00:06, 42.93it/s]
Loading 0: 27%|██▋ | 97/363 [00:08<00:06, 42.00it/s]
Loading 0: 29%|██▊ | 104/363 [00:08<00:05, 47.12it/s]
Loading 0: 30%|███ | 110/363 [00:08<00:05, 45.75it/s]
Loading 0: 32%|███▏ | 115/363 [00:08<00:05, 44.18it/s]
Loading 0: 33%|███▎ | 121/363 [00:09<00:07, 31.83it/s]
Loading 0: 34%|███▍ | 125/363 [00:09<00:07, 31.92it/s]
Loading 0: 36%|███▌ | 130/363 [00:09<00:06, 35.38it/s]
Loading 0: 37%|███▋ | 134/363 [00:09<00:06, 35.20it/s]
Loading 0: 38%|███▊ | 139/363 [00:09<00:05, 38.07it/s]
Loading 0: 40%|███▉ | 144/363 [00:09<00:05, 39.03it/s]
Loading 0: 41%|████ | 149/363 [00:09<00:05, 40.81it/s]
Loading 0: 42%|████▏ | 154/363 [00:09<00:04, 42.48it/s]
Loading 0: 44%|████▍ | 159/363 [00:10<00:05, 36.09it/s]
Loading 0: 46%|████▌ | 167/363 [00:10<00:04, 44.24it/s]
Loading 0: 47%|████▋ | 172/363 [00:10<00:04, 45.31it/s]
Loading 0: 49%|████▉ | 177/363 [00:10<00:04, 37.79it/s]
Loading 0: 51%|█████ | 185/363 [00:10<00:03, 45.11it/s]
Loading 0: 53%|█████▎ | 191/363 [00:10<00:03, 43.06it/s]
Loading 0: 54%|█████▍ | 196/363 [00:10<00:03, 42.28it/s]
Loading 0: 56%|█████▌ | 202/363 [00:11<00:05, 31.74it/s]
Loading 0: 57%|█████▋ | 206/363 [00:11<00:04, 32.21it/s]
Loading 0: 58%|█████▊ | 212/363 [00:11<00:04, 36.76it/s]
Loading 0: 60%|█████▉ | 217/363 [00:11<00:03, 39.37it/s]
Loading 0: 61%|██████ | 222/363 [00:11<00:04, 33.69it/s]
Loading 0: 63%|██████▎ | 230/363 [00:11<00:03, 42.29it/s]
Loading 0: 65%|██████▌ | 236/363 [00:12<00:03, 41.21it/s]
Loading 0: 66%|██████▋ | 241/363 [00:12<00:03, 40.41it/s]
Loading 0: 68%|██████▊ | 248/363 [00:12<00:02, 45.47it/s]
Loading 0: 70%|██████▉ | 254/363 [00:12<00:02, 44.77it/s]
Loading 0: 71%|███████▏ | 259/363 [00:12<00:02, 44.25it/s]
Loading 0: 73%|███████▎ | 266/363 [00:12<00:02, 48.48it/s]
Loading 0: 75%|███████▍ | 272/363 [00:12<00:01, 45.91it/s]
Loading 0: 76%|███████▋ | 277/363 [00:12<00:01, 44.35it/s]
Loading 0: 78%|███████▊ | 283/363 [00:13<00:02, 33.57it/s]
Loading 0: 79%|███████▉ | 287/363 [00:13<00:02, 34.11it/s]
Loading 0: 81%|████████ | 293/363 [00:13<00:01, 38.98it/s]
Loading 0: 82%|████████▏ | 299/363 [00:13<00:01, 40.11it/s]
Loading 0: 84%|████████▎ | 304/363 [00:13<00:01, 40.27it/s]
Loading 0: 86%|████████▌ | 311/363 [00:13<00:01, 45.85it/s]
Loading 0: 87%|████████▋ | 317/363 [00:14<00:01, 45.12it/s]
Loading 0: 89%|████████▊ | 322/363 [00:14<00:00, 42.97it/s]
Loading 0: 91%|█████████ | 329/363 [00:14<00:00, 47.55it/s]
Loading 0: 92%|█████████▏| 335/363 [00:14<00:00, 46.04it/s]
Loading 0: 94%|█████████▎| 340/363 [00:14<00:00, 44.63it/s]
Loading 0: 96%|█████████▌| 347/363 [00:14<00:00, 48.76it/s]
Loading 0: 97%|█████████▋| 353/363 [00:14<00:00, 46.72it/s]
Loading 0: 99%|█████████▊| 358/363 [00:14<00:00, 45.73it/s]
Job chaiml-nemo-20241010-t-5991-v111-mkmlizer completed after 93.83s with status: succeeded
Stopping job with name chaiml-nemo-20241010-t-5991-v111-mkmlizer
Pipeline stage MKMLizer completed in 94.82s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.21s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-nemo-20241010-t-5991-v111
Waiting for inference service chaiml-nemo-20241010-t-5991-v111 to be ready
Failed to get response for submission zonemercy-lexical-nemov8_5966_v2: ('http://zonemercy-lexical-nemov8-5966-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:45882->127.0.0.1:8080: read: connection reset by peer\n')
Inference service chaiml-nemo-20241010-t-5991-v111 ready after 180.8278365135193s
Pipeline stage MKMLDeployer completed in 181.41s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7536933422088623s
Received healthy response to inference request in 1.2574212551116943s
Received healthy response to inference request in 1.5262537002563477s
Received healthy response to inference request in 1.6296429634094238s
Received healthy response to inference request in 1.411642074584961s
5 requests
0 failed requests
5th percentile: 1.2882654190063476
10th percentile: 1.3191095829010009
20th percentile: 1.3807979106903077
30th percentile: 1.4345643997192383
40th percentile: 1.4804090499877929
50th percentile: 1.5262537002563477
60th percentile: 1.5676094055175782
70th percentile: 1.6089651107788085
80th percentile: 1.6544530391693115
90th percentile: 1.704073190689087
95th percentile: 1.7288832664489746
99th percentile: 1.7487313270568847
mean time: 1.5157306671142579
Pipeline stage StressChecker completed in 8.89s
Shutdown handler de-registered
chaiml-nemo-20241010-t_5991_v111 status is now deployed due to DeploymentManager action
chaiml-nemo-20241010-t_5991_v111 status is now inactive due to auto deactivation removed underperforming models
chaiml-nemo-20241010-t_5991_v111 status is now torndown due to DeploymentManager action