Running pipeline stage MKMLizer
Starting job with name zonemercy-acute-nemo-v1-4488-v10-mkmlizer
Waiting for job on zonemercy-acute-nemo-v1-4488-v10-mkmlizer to finish
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ _____ __ __ ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ /___/ ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ Version: 0.9.11 ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ https://mk1.ai ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ The license key for the current software has been verified as ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ belonging to: ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ Chai Research Corp. ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ║ ║
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission undi95-meta-llama-3-70b_6209_v19: ('http://undi95-meta-llama-3-70b-6209-v19-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: Downloaded to shared memory in 84.838s
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmps8i5h77b, device:0
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: Saving flywheel model at /dev/shm/model_cache
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: quantized model in 42.450s
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: Processed model zonemercy/Acute-Nemo-v1-1e5ep1 in 127.288s
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: creating bucket guanaco-mkml-models
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zonemercy-acute-nemo-v1-4488-v10/config.json
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zonemercy-acute-nemo-v1-4488-v10/special_tokens_map.json
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zonemercy-acute-nemo-v1-4488-v10/tokenizer_config.json
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zonemercy-acute-nemo-v1-4488-v10/tokenizer.json
zonemercy-acute-nemo-v1-4488-v10-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zonemercy-acute-nemo-v1-4488-v10/flywheel_model.0.safetensors
zonemercy-acute-nemo-v1-4488-v10-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:16, 21.55it/s]
Loading 0: 3%|▎ | 10/363 [00:00<00:12, 27.82it/s]
Loading 0: 4%|▍ | 14/363 [00:00<00:14, 23.72it/s]
Loading 0: 6%|▌ | 20/363 [00:00<00:10, 32.70it/s]
Loading 0: 7%|▋ | 24/363 [00:00<00:15, 22.37it/s]
Loading 0: 7%|▋ | 27/363 [00:01<00:15, 21.53it/s]
Loading 0: 9%|▊ | 31/363 [00:01<00:13, 24.91it/s]
Loading 0: 9%|▉ | 34/363 [00:01<00:13, 25.25it/s]
Loading 0: 11%|█ | 39/363 [00:01<00:11, 27.95it/s]
Loading 0: 12%|█▏ | 43/363 [00:01<00:11, 26.86it/s]
Loading 0: 13%|█▎ | 48/363 [00:01<00:10, 29.54it/s]
Loading 0: 14%|█▍ | 52/363 [00:01<00:11, 27.87it/s]
Loading 0: 15%|█▌ | 56/363 [00:02<00:11, 27.69it/s]
Loading 0: 17%|█▋ | 61/363 [00:02<00:12, 23.32it/s]
Loading 0: 18%|█▊ | 64/363 [00:02<00:14, 20.64it/s]
Loading 0: 20%|█▉ | 71/363 [00:02<00:10, 27.38it/s]
Loading 0: 20%|██ | 74/363 [00:02<00:10, 27.74it/s]
Loading 0: 21%|██ | 77/363 [00:03<00:12, 22.65it/s]
Loading 0: 23%|██▎ | 84/363 [00:03<00:09, 29.04it/s]
Loading 0: 24%|██▍ | 88/363 [00:03<00:10, 27.41it/s]
Loading 0: 26%|██▌ | 93/363 [00:03<00:09, 29.59it/s]
Loading 0: 27%|██▋ | 97/363 [00:03<00:09, 28.18it/s]
Loading 0: 28%|██▊ | 101/363 [00:03<00:11, 22.53it/s]
Loading 0: 29%|██▊ | 104/363 [00:04<00:12, 20.25it/s]
Loading 0: 31%|███ | 111/363 [00:04<00:09, 26.91it/s]
Loading 0: 31%|███▏ | 114/363 [00:04<00:09, 25.11it/s]
Loading 0: 33%|███▎ | 120/363 [00:04<00:08, 29.59it/s]
Loading 0: 34%|███▍ | 124/363 [00:04<00:08, 27.99it/s]
Loading 0: 36%|███▌ | 129/363 [00:04<00:07, 30.07it/s]
Loading 0: 37%|███▋ | 133/363 [00:05<00:08, 28.42it/s]
Loading 0: 38%|███▊ | 137/363 [00:05<00:07, 28.30it/s]
Loading 0: 39%|███▉ | 142/363 [00:05<00:09, 24.01it/s]
Loading 0: 40%|███▉ | 145/363 [00:05<00:09, 22.41it/s]
Loading 0: 41%|████ | 149/363 [00:05<00:09, 21.41it/s]
Loading 0: 43%|████▎ | 156/363 [00:06<00:07, 27.71it/s]
Loading 0: 44%|████▍ | 159/363 [00:06<00:08, 25.33it/s]
Loading 0: 45%|████▍ | 163/363 [00:06<00:07, 27.94it/s]
Loading 0: 46%|████▌ | 167/363 [00:06<00:07, 24.65it/s]
Loading 0: 47%|████▋ | 172/363 [00:06<00:06, 29.41it/s]
Loading 0: 48%|████▊ | 176/363 [00:06<00:07, 25.70it/s]
Loading 0: 50%|████▉ | 181/363 [00:06<00:05, 30.61it/s]
Loading 0: 51%|█████ | 185/363 [00:07<00:09, 19.78it/s]
Loading 0: 52%|█████▏ | 190/363 [00:07<00:07, 24.63it/s]
Loading 0: 53%|█████▎ | 194/363 [00:07<00:07, 23.05it/s]
Loading 0: 55%|█████▍ | 199/363 [00:07<00:06, 27.04it/s]
Loading 0: 56%|█████▌ | 203/363 [00:07<00:06, 24.54it/s]
Loading 0: 57%|█████▋ | 208/363 [00:08<00:05, 28.56it/s]
Loading 0: 58%|█████▊ | 212/363 [00:08<00:06, 24.70it/s]
Loading 0: 60%|█████▉ | 217/363 [00:08<00:05, 28.96it/s]
Loading 0: 61%|██████ | 222/363 [00:08<00:04, 29.90it/s]
Loading 0: 62%|██████▏ | 226/363 [00:08<00:06, 21.70it/s]
Loading 0: 63%|██████▎ | 230/363 [00:09<00:06, 21.10it/s]
Loading 0: 65%|██████▍ | 235/363 [00:09<00:04, 26.02it/s]
Loading 0: 66%|██████▌ | 239/363 [00:09<00:05, 23.61it/s]
Loading 0: 67%|██████▋ | 244/363 [00:09<00:04, 28.14it/s]
Loading 0: 68%|██████▊ | 248/363 [00:09<00:04, 24.57it/s]
Loading 0: 70%|███████ | 255/363 [00:09<00:03, 30.62it/s]
Loading 0: 71%|███████▏ | 259/363 [00:09<00:03, 28.97it/s]
Loading 0: 72%|███████▏ | 263/363 [00:10<00:04, 23.10it/s]
Loading 0: 73%|███████▎ | 266/363 [00:10<00:04, 20.71it/s]
Loading 0: 75%|███████▍ | 271/363 [00:10<00:03, 25.44it/s]
Loading 0: 75%|███████▌ | 274/363 [00:10<00:03, 26.27it/s]
Loading 0: 76%|███████▋ | 277/363 [00:10<00:03, 25.92it/s]
Loading 0: 78%|███████▊ | 282/363 [00:10<00:02, 28.53it/s]
Loading 0: 79%|███████▉ | 286/363 [00:11<00:02, 26.82it/s]
Loading 0: 80%|████████ | 291/363 [00:11<00:02, 29.44it/s]
Loading 0: 81%|████████▏ | 295/363 [00:11<00:02, 28.27it/s]
Loading 0: 82%|████████▏ | 299/363 [00:11<00:02, 28.60it/s]
Loading 0: 84%|████████▎ | 304/363 [00:11<00:02, 24.52it/s]
Loading 0: 85%|████████▍ | 307/363 [00:11<00:02, 23.12it/s]
Loading 0: 86%|████████▌ | 311/363 [00:12<00:02, 21.86it/s]
Loading 0: 88%|████████▊ | 318/363 [00:12<00:01, 28.08it/s]
Loading 0: 88%|████████▊ | 321/363 [00:12<00:01, 25.06it/s]
Loading 0: 90%|█████████ | 327/363 [00:12<00:01, 29.13it/s]
Loading 0: 91%|█████████ | 331/363 [00:12<00:01, 27.96it/s]
Loading 0: 93%|█████████▎| 336/363 [00:12<00:00, 30.22it/s]
Loading 0: 94%|█████████▎| 340/363 [00:13<00:00, 27.84it/s]
Loading 0: 95%|█████████▍| 344/363 [00:20<00:09, 1.96it/s]
Loading 0: 96%|█████████▌| 348/363 [00:20<00:05, 2.63it/s]
Loading 0: 97%|█████████▋| 353/363 [00:20<00:02, 3.81it/s]
Loading 0: 98%|█████████▊| 357/363 [00:20<00:01, 4.91it/s]
Loading 0: 100%|█████████▉| 362/363 [00:20<00:00, 7.00it/s]
Job zonemercy-acute-nemo-v1-4488-v10-mkmlizer completed after 155.95s with status: succeeded
Stopping job with name zonemercy-acute-nemo-v1-4488-v10-mkmlizer
Pipeline stage MKMLizer completed in 156.94s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.08s
Running pipeline stage ISVCDeployer
Creating inference service zonemercy-acute-nemo-v1-4488-v10
Waiting for inference service zonemercy-acute-nemo-v1-4488-v10 to be ready
Failed to get response for submission undi95-meta-llama-3-70b_6209_v19: ('http://undi95-meta-llama-3-70b-6209-v19-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission undi95-meta-llama-3-70b_6209_v19: ('http://undi95-meta-llama-3-70b-6209-v19-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission chaiml-llama-8b-pairwis_8189_v14: ('http://chaiml-llama-8b-pairwis-8189-v14-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission undi95-meta-llama-3-70b_6209_v19: ('http://undi95-meta-llama-3-70b-6209-v19-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission undi95-meta-llama-3-70b_6209_v19: ('http://undi95-meta-llama-3-70b-6209-v19-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission undi95-meta-llama-3-70b_6209_v19: ('http://undi95-meta-llama-3-70b-6209-v19-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission undi95-meta-llama-3-70b_6209_v19: ('http://undi95-meta-llama-3-70b-6209-v19-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission chaiml-llama-8b-pairwis_8189_v14: ('http://chaiml-llama-8b-pairwis-8189-v14-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission undi95-meta-llama-3-70b_6209_v19: ('http://undi95-meta-llama-3-70b-6209-v19-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission undi95-meta-llama-3-70b_6209_v19: ('http://undi95-meta-llama-3-70b-6209-v19-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission chaiml-llama-8b-pairwis_8189_v14: ('http://chaiml-llama-8b-pairwis-8189-v14-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission chaiml-llama-8b-pairwis_8189_v14: ('http://chaiml-llama-8b-pairwis-8189-v14-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission chaiml-llama-8b-pairwis_8189_v14: ('http://chaiml-llama-8b-pairwis-8189-v14-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission jellywibble-lora-120k-pr_2801_v2: ('http://jellywibble-lora-120k-pr-2801-v2-predictor-default.tenant-chaiml-guanaco.knative.ord1.coreweave.cloud/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:57312->127.0.0.1:8080: read: connection reset by peer\n')
Failed to get response for submission undi95-meta-llama-3-70b_6209_v19: ('http://undi95-meta-llama-3-70b-6209-v19-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'activator request timeout')
Failed to get response for submission chaiml-llama-8b-pairwis_8189_v14: ('http://chaiml-llama-8b-pairwis-8189-v14-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Inference service zonemercy-acute-nemo-v1-4488-v10 ready after 241.48556303977966s
Pipeline stage ISVCDeployer completed in 242.73s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.917360544204712s
Received healthy response to inference request in 2.657965660095215s
Received healthy response to inference request in 2.638561248779297s
Received healthy response to inference request in 2.73050594329834s
Received healthy response to inference request in 2.3256471157073975s
5 requests
0 failed requests
5th percentile: 2.3882299423217774
10th percentile: 2.4508127689361574
20th percentile: 2.575978422164917
30th percentile: 2.6424421310424804
40th percentile: 2.650203895568848
50th percentile: 2.657965660095215
60th percentile: 2.686981773376465
70th percentile: 2.715997886657715
80th percentile: 2.7678768634796143
90th percentile: 2.842618703842163
95th percentile: 2.8799896240234375
99th percentile: 2.909886360168457
mean time: 2.6540081024169924
Pipeline stage StressChecker completed in 14.03s
zonemercy-acute-nemo-v1_4488_v10 status is now deployed due to DeploymentManager action
zonemercy-acute-nemo-v1_4488_v10 status is now inactive due to auto deactivation removed underperforming models
zonemercy-acute-nemo-v1_4488_v10 status is now torndown due to DeploymentManager action