Running pipeline stage MKMLizer
Starting job with name zonemercy-acute-nemo-v0-5087-v3-mkmlizer
Waiting for job on zonemercy-acute-nemo-v0-5087-v3-mkmlizer to finish
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ _____ __ __ ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ /___/ ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ Version: 0.9.11 ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ https://mk1.ai ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ The license key for the current software has been verified as ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ belonging to: ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ Chai Research Corp. ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ║ ║
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission function_fahor_2024-08-15: no entry with id "function_fahor_2024-08-15" found on database!
Failed to get response for submission zonemercy-cogent-nemo-v2-5e6_v13: ('http://zonemercy-cogent-nemo-v2-5e6-v13-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission zonemercy-cogent-nemo-v2-5e6_v13: ('http://zonemercy-cogent-nemo-v2-5e6-v13-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission function_fahor_2024-08-15: no entry with id "function_fahor_2024-08-15" found on database!
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: Downloaded to shared memory in 65.156s
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpcvjrmw8c, device:0
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Failed to get response for submission zonemercy-cogent-nemo-v2-5e6_v14: ('http://zonemercy-cogent-nemo-v2-5e6-v14-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission function_fahor_2024-08-15: no entry with id "function_fahor_2024-08-15" found on database!
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: quantized model in 41.507s
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: Processed model zonemercy/Acute-Nemo-v0-5e6ep1 in 106.663s
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: creating bucket guanaco-mkml-models
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zonemercy-acute-nemo-v0-5087-v3
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zonemercy-acute-nemo-v0-5087-v3/config.json
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zonemercy-acute-nemo-v0-5087-v3/special_tokens_map.json
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zonemercy-acute-nemo-v0-5087-v3/tokenizer.json
zonemercy-acute-nemo-v0-5087-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zonemercy-acute-nemo-v0-5087-v3/flywheel_model.0.safetensors
zonemercy-acute-nemo-v0-5087-v3-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:16, 22.18it/s]
Loading 0: 3%|▎ | 10/363 [00:00<00:12, 28.38it/s]
Loading 0: 4%|▍ | 14/363 [00:00<00:13, 25.20it/s]
Loading 0: 6%|▌ | 21/363 [00:00<00:09, 35.97it/s]
Loading 0: 7%|▋ | 26/363 [00:01<00:15, 21.56it/s]
Loading 0: 9%|▊ | 31/363 [00:01<00:12, 26.20it/s]
Loading 0: 10%|▉ | 35/363 [00:01<00:12, 27.28it/s]
Loading 0: 11%|█ | 39/363 [00:01<00:11, 27.87it/s]
Loading 0: 12%|█▏ | 43/363 [00:01<00:11, 26.73it/s]
Loading 0: 13%|█▎ | 48/363 [00:01<00:10, 29.28it/s]
Loading 0: 14%|█▍ | 52/363 [00:01<00:11, 27.74it/s]
Loading 0: 15%|█▌ | 56/363 [00:02<00:11, 27.61it/s]
Loading 0: 17%|█▋ | 61/363 [00:02<00:12, 24.86it/s]
Loading 0: 18%|█▊ | 64/363 [00:02<00:13, 21.83it/s]
Loading 0: 19%|█▉ | 69/363 [00:02<00:10, 27.00it/s]
Loading 0: 20%|██ | 73/363 [00:02<00:11, 26.35it/s]
Loading 0: 21%|██ | 77/363 [00:02<00:12, 23.51it/s]
Loading 0: 23%|██▎ | 82/363 [00:03<00:09, 28.49it/s]
Loading 0: 24%|██▎ | 86/363 [00:03<00:10, 25.42it/s]
Loading 0: 26%|██▌ | 93/363 [00:03<00:08, 31.16it/s]
Loading 0: 27%|██▋ | 97/363 [00:03<00:09, 29.32it/s]
Loading 0: 28%|██▊ | 101/363 [00:03<00:11, 23.33it/s]
Loading 0: 29%|██▊ | 104/363 [00:04<00:12, 20.77it/s]
Loading 0: 31%|███ | 111/363 [00:04<00:09, 27.25it/s]
Loading 0: 32%|███▏ | 115/363 [00:04<00:09, 26.79it/s]
Loading 0: 33%|███▎ | 120/363 [00:04<00:08, 29.55it/s]
Loading 0: 34%|███▍ | 124/363 [00:04<00:08, 28.36it/s]
Loading 0: 36%|███▌ | 129/363 [00:04<00:07, 30.54it/s]
Loading 0: 37%|███▋ | 133/363 [00:04<00:07, 28.93it/s]
Loading 0: 38%|███▊ | 137/363 [00:05<00:07, 28.37it/s]
Loading 0: 39%|███▉ | 142/363 [00:05<00:08, 25.66it/s]
Loading 0: 40%|███▉ | 145/363 [00:05<00:09, 24.16it/s]
Loading 0: 41%|████ | 149/363 [00:05<00:09, 22.62it/s]
Loading 0: 43%|████▎ | 156/363 [00:05<00:07, 28.99it/s]
Loading 0: 44%|████▍ | 160/363 [00:05<00:07, 27.82it/s]
Loading 0: 45%|████▌ | 165/363 [00:06<00:06, 29.59it/s]
Loading 0: 47%|████▋ | 169/363 [00:06<00:06, 28.07it/s]
Loading 0: 48%|████▊ | 174/363 [00:06<00:06, 29.70it/s]
Loading 0: 49%|████▉ | 178/363 [00:06<00:06, 28.26it/s]
Loading 0: 50%|█████ | 182/363 [00:06<00:07, 22.87it/s]
Loading 0: 51%|█████ | 185/363 [00:07<00:08, 20.40it/s]
Loading 0: 53%|█████▎ | 192/363 [00:07<00:06, 27.03it/s]
Loading 0: 54%|█████▎ | 195/363 [00:07<00:06, 25.04it/s]
Loading 0: 55%|█████▍ | 199/363 [00:07<00:05, 27.92it/s]
Loading 0: 56%|█████▌ | 203/363 [00:07<00:06, 24.55it/s]
Loading 0: 58%|█████▊ | 210/363 [00:07<00:04, 30.78it/s]
Loading 0: 59%|█████▉ | 214/363 [00:08<00:05, 28.80it/s]
Loading 0: 60%|██████ | 218/363 [00:08<00:05, 28.39it/s]
Loading 0: 61%|██████▏ | 223/363 [00:08<00:05, 24.39it/s]
Loading 0: 62%|██████▏ | 226/363 [00:08<00:05, 22.97it/s]
Loading 0: 63%|██████▎ | 230/363 [00:08<00:06, 21.88it/s]
Loading 0: 65%|██████▍ | 235/363 [00:08<00:04, 26.70it/s]
Loading 0: 66%|██████▌ | 239/363 [00:09<00:05, 24.41it/s]
Loading 0: 68%|██████▊ | 246/363 [00:09<00:03, 30.32it/s]
Loading 0: 69%|██████▉ | 250/363 [00:09<00:03, 28.99it/s]
Loading 0: 70%|███████ | 255/363 [00:09<00:03, 30.79it/s]
Loading 0: 71%|███████▏ | 259/363 [00:09<00:03, 29.32it/s]
Loading 0: 72%|███████▏ | 263/363 [00:09<00:04, 23.10it/s]
Loading 0: 73%|███████▎ | 266/363 [00:10<00:04, 20.61it/s]
Loading 0: 75%|███████▌ | 273/363 [00:10<00:03, 27.45it/s]
Loading 0: 76%|███████▋ | 277/363 [00:10<00:03, 27.30it/s]
Loading 0: 78%|███████▊ | 282/363 [00:10<00:02, 30.08it/s]
Loading 0: 79%|███████▉ | 286/363 [00:10<00:02, 29.02it/s]
Loading 0: 80%|████████ | 291/363 [00:10<00:02, 31.19it/s]
Loading 0: 81%|████████▏ | 295/363 [00:11<00:02, 29.16it/s]
Loading 0: 82%|████████▏ | 299/363 [00:11<00:02, 29.12it/s]
Loading 0: 84%|████████▎ | 304/363 [00:11<00:02, 25.44it/s]
Loading 0: 85%|████████▍ | 307/363 [00:11<00:02, 23.87it/s]
Loading 0: 86%|████████▌ | 311/363 [00:11<00:02, 22.51it/s]
Loading 0: 88%|████████▊ | 318/363 [00:11<00:01, 28.98it/s]
Loading 0: 89%|████████▊ | 322/363 [00:12<00:01, 28.16it/s]
Loading 0: 90%|█████████ | 327/363 [00:12<00:01, 30.90it/s]
Loading 0: 91%|█████████ | 331/363 [00:12<00:01, 29.98it/s]
Loading 0: 93%|█████████▎| 336/363 [00:12<00:00, 32.15it/s]
Loading 0: 94%|█████████▎| 340/363 [00:12<00:00, 29.80it/s]
Loading 0: 95%|█████████▍| 344/363 [00:19<00:09, 1.97it/s]
Loading 0: 96%|█████████▌| 348/363 [00:19<00:05, 2.65it/s]
Loading 0: 97%|█████████▋| 353/363 [00:19<00:02, 3.84it/s]
Loading 0: 98%|█████████▊| 357/363 [00:20<00:01, 4.96it/s]
Job zonemercy-acute-nemo-v0-5087-v3-mkmlizer completed after 125.92s with status: succeeded
Stopping job with name zonemercy-acute-nemo-v0-5087-v3-mkmlizer
Pipeline stage MKMLizer completed in 126.94s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.10s
Running pipeline stage ISVCDeployer
Creating inference service zonemercy-acute-nemo-v0-5087-v3
Waiting for inference service zonemercy-acute-nemo-v0-5087-v3 to be ready
Failed to get response for submission zonemercy-cogent-nemo-v2-5e6_v13: ('http://zonemercy-cogent-nemo-v2-5e6-v13-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission function_fahor_2024-08-15: no entry with id "function_fahor_2024-08-15" found on database!
Failed to get response for submission function_fahor_2024-08-15: no entry with id "function_fahor_2024-08-15" found on database!
Failed to get response for submission zonemercy-cogent-nemo-v2-5e6_v14: ('http://zonemercy-cogent-nemo-v2-5e6-v14-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission function_fahor_2024-08-15: no entry with id "function_fahor_2024-08-15" found on database!
Failed to get response for submission function_fahor_2024-08-15: no entry with id "function_fahor_2024-08-15" found on database!
Failed to get response for submission zonemercy-cogent-nemo-v2-5e6_v14: ('http://zonemercy-cogent-nemo-v2-5e6-v14-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission chaiml-elo-alignment-run-3_v34: ('http://chaiml-elo-alignment-run-3-v34-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission chaiml-elo-alignment-run-3_v34: ('http://chaiml-elo-alignment-run-3-v34-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission zonemercy-cogent-nemo-v2-5e6_v13: ('http://zonemercy-cogent-nemo-v2-5e6-v13-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission function_fahor_2024-08-15: no entry with id "function_fahor_2024-08-15" found on database!
Failed to get response for submission chaiml-elo-alignment-run-3_v34: ('http://chaiml-elo-alignment-run-3-v34-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission function_fahor_2024-08-15: no entry with id "function_fahor_2024-08-15" found on database!
Failed to get response for submission function_fahor_2024-08-15: no entry with id "function_fahor_2024-08-15" found on database!
Failed to get response for submission chaiml-elo-alignment-run-3_v34: ('http://chaiml-elo-alignment-run-3-v34-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Inference service zonemercy-acute-nemo-v0-5087-v3 ready after 241.67638516426086s
Pipeline stage ISVCDeployer completed in 243.48s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.4728732109069824s
Failed to get response for submission zonemercy-cogent-nemo-v2-5e6_v14: ('http://zonemercy-cogent-nemo-v2-5e6-v14-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Received healthy response to inference request in 2.053493022918701s
Received healthy response to inference request in 2.557473659515381s
Received healthy response to inference request in 3.5845117568969727s
Received healthy response to inference request in 1.940204381942749s
5 requests
0 failed requests
5th percentile: 1.9628621101379395
10th percentile: 1.9855198383331298
20th percentile: 2.0308352947235107
30th percentile: 2.1373690605163573
40th percentile: 2.30512113571167
50th percentile: 2.4728732109069824
60th percentile: 2.506713390350342
70th percentile: 2.540553569793701
80th percentile: 2.762881278991699
90th percentile: 3.173696517944336
95th percentile: 3.3791041374206543
99th percentile: 3.543430233001709
mean time: 2.5217112064361573
Pipeline stage StressChecker completed in 13.35s
zonemercy-acute-nemo-v0-_5087_v3 status is now deployed due to DeploymentManager action
zonemercy-acute-nemo-v0-_5087_v3 status is now inactive due to auto deactivation removed underperforming models
zonemercy-acute-nemo-v0-_5087_v3 status is now torndown due to DeploymentManager action