Running pipeline stage MKMLizer
Starting job with name zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer
Waiting for job on zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer to finish
Stopping job with name zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer
%s, retrying in %s seconds...
Starting job with name zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer
Waiting for job on zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer to finish
Failed to get response for submission zonemercy-acute-nemo-v1_4488_v11: ('http://zonemercy-acute-nemo-v1-4488-v11-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:50540->127.0.0.1:8080: read: connection reset by peer\n')
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ _____ __ __ ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ /___/ ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ Version: 0.9.11 ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ https://mk1.ai ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ The license key for the current software has been verified as ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ belonging to: ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ Chai Research Corp. ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ║ ║
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission zonemercy-acute-nemo-v1_4488_v11: ('http://zonemercy-acute-nemo-v1-4488-v11-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:50616->127.0.0.1:8080: read: connection reset by peer\n')
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: Downloaded to shared memory in 55.553s
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmprl9ptt3t, device:0
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Failed to get response for submission function_dorob_2024-08-17: ('http://zonemercy-lexical-nemo-1518-v12-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:45916->127.0.0.1:8080: read: connection reset by peer\n')
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: quantized model in 42.831s
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: Processed model zonemercy/Cogent-Nemo-v2-5e6 in 98.384s
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zonemercy-cogent-nemo-v2-5e6-v18
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zonemercy-cogent-nemo-v2-5e6-v18/special_tokens_map.json
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zonemercy-cogent-nemo-v2-5e6-v18/config.json
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zonemercy-cogent-nemo-v2-5e6-v18/tokenizer_config.json
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zonemercy-cogent-nemo-v2-5e6-v18/tokenizer.json
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zonemercy-cogent-nemo-v2-5e6-v18/flywheel_model.0.safetensors
zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:16, 21.86it/s]
Loading 0: 3%|▎ | 10/363 [00:00<00:13, 27.08it/s]
Loading 0: 4%|▍ | 14/363 [00:00<00:14, 23.41it/s]
Loading 0: 6%|▌ | 20/363 [00:00<00:10, 31.90it/s]
Loading 0: 7%|▋ | 24/363 [00:01<00:15, 21.28it/s]
Loading 0: 7%|▋ | 27/363 [00:01<00:16, 20.45it/s]
Loading 0: 9%|▊ | 31/363 [00:01<00:14, 23.59it/s]
Loading 0: 9%|▉ | 34/363 [00:01<00:13, 23.79it/s]
Loading 0: 11%|█ | 39/363 [00:01<00:11, 27.06it/s]
Loading 0: 12%|█▏ | 42/363 [00:01<00:13, 24.63it/s]
Loading 0: 13%|█▎ | 48/363 [00:01<00:10, 29.14it/s]
Loading 0: 14%|█▍ | 52/363 [00:02<00:11, 27.77it/s]
Loading 0: 15%|█▌ | 56/363 [00:02<00:11, 27.50it/s]
Loading 0: 17%|█▋ | 60/363 [00:02<00:10, 29.97it/s]
Loading 0: 18%|█▊ | 64/363 [00:02<00:15, 19.30it/s]
Loading 0: 19%|█▉ | 69/363 [00:02<00:12, 24.45it/s]
Loading 0: 20%|██ | 73/363 [00:02<00:11, 24.54it/s]
Loading 0: 21%|██ | 77/363 [00:03<00:12, 22.90it/s]
Loading 0: 23%|██▎ | 82/363 [00:03<00:10, 27.51it/s]
Loading 0: 24%|██▎ | 86/363 [00:03<00:11, 24.06it/s]
Loading 0: 26%|██▌ | 93/363 [00:03<00:08, 30.11it/s]
Loading 0: 27%|██▋ | 97/363 [00:03<00:09, 28.50it/s]
Loading 0: 28%|██▊ | 101/363 [00:04<00:11, 22.50it/s]
Loading 0: 29%|██▊ | 104/363 [00:04<00:12, 20.25it/s]
Loading 0: 31%|███ | 111/363 [00:04<00:09, 26.53it/s]
Loading 0: 31%|███▏ | 114/363 [00:04<00:10, 24.52it/s]
Loading 0: 33%|███▎ | 118/363 [00:04<00:09, 27.22it/s]
Loading 0: 34%|███▎ | 122/363 [00:04<00:09, 24.28it/s]
Loading 0: 36%|███▌ | 129/363 [00:05<00:07, 30.41it/s]
Loading 0: 37%|███▋ | 133/363 [00:05<00:07, 28.81it/s]
Loading 0: 38%|███▊ | 137/363 [00:05<00:07, 28.36it/s]
Loading 0: 39%|███▉ | 142/363 [00:05<00:09, 23.67it/s]
Loading 0: 40%|███▉ | 145/363 [00:05<00:09, 22.58it/s]
Loading 0: 41%|████ | 149/363 [00:06<00:10, 21.08it/s]
Loading 0: 42%|████▏ | 154/363 [00:06<00:08, 25.83it/s]
Loading 0: 44%|████▎ | 158/363 [00:06<00:08, 23.37it/s]
Loading 0: 45%|████▍ | 163/363 [00:06<00:07, 28.00it/s]
Loading 0: 46%|████▌ | 167/363 [00:06<00:07, 24.84it/s]
Loading 0: 47%|████▋ | 172/363 [00:06<00:06, 29.76it/s]
Loading 0: 48%|████▊ | 176/363 [00:06<00:07, 25.48it/s]
Loading 0: 50%|████▉ | 181/363 [00:07<00:06, 30.13it/s]
Loading 0: 51%|█████ | 185/363 [00:07<00:08, 19.91it/s]
Loading 0: 53%|█████▎ | 192/363 [00:07<00:06, 26.04it/s]
Loading 0: 54%|█████▍ | 196/363 [00:07<00:06, 25.76it/s]
Loading 0: 55%|█████▌ | 201/363 [00:07<00:05, 28.42it/s]
Loading 0: 56%|█████▋ | 205/363 [00:08<00:05, 27.10it/s]
Loading 0: 58%|█████▊ | 210/363 [00:08<00:05, 28.83it/s]
Loading 0: 59%|█████▉ | 214/363 [00:08<00:05, 27.52it/s]
Loading 0: 60%|██████ | 218/363 [00:08<00:05, 27.61it/s]
Loading 0: 61%|██████ | 222/363 [00:08<00:04, 30.13it/s]
Loading 0: 62%|██████▏ | 226/363 [00:08<00:06, 20.94it/s]
Loading 0: 63%|██████▎ | 230/363 [00:09<00:06, 20.24it/s]
Loading 0: 65%|██████▍ | 235/363 [00:09<00:05, 25.29it/s]
Loading 0: 66%|██████▌ | 239/363 [00:09<00:05, 23.34it/s]
Loading 0: 67%|██████▋ | 244/363 [00:09<00:04, 27.99it/s]
Loading 0: 68%|██████▊ | 248/363 [00:09<00:04, 24.70it/s]
Loading 0: 70%|██████▉ | 253/363 [00:09<00:03, 29.55it/s]
Loading 0: 71%|███████ | 257/363 [00:10<00:04, 25.80it/s]
Loading 0: 72%|███████▏ | 262/363 [00:10<00:03, 30.21it/s]
Loading 0: 73%|███████▎ | 266/363 [00:10<00:04, 19.98it/s]
Loading 0: 75%|███████▍ | 271/363 [00:10<00:03, 24.84it/s]
Loading 0: 76%|███████▌ | 275/363 [00:10<00:03, 23.17it/s]
Loading 0: 77%|███████▋ | 280/363 [00:11<00:02, 28.09it/s]
Loading 0: 78%|███████▊ | 284/363 [00:11<00:03, 24.76it/s]
Loading 0: 80%|███████▉ | 289/363 [00:11<00:02, 29.39it/s]
Loading 0: 81%|████████ | 293/363 [00:11<00:02, 25.75it/s]
Loading 0: 82%|████████▏ | 299/363 [00:11<00:02, 29.63it/s]
Loading 0: 84%|████████▎ | 304/363 [00:11<00:02, 24.98it/s]
Loading 0: 85%|████████▍ | 307/363 [00:12<00:02, 23.76it/s]
Loading 0: 86%|████████▌ | 311/363 [00:12<00:02, 22.42it/s]
Loading 0: 88%|████████▊ | 318/363 [00:12<00:01, 28.67it/s]
Loading 0: 89%|████████▊ | 322/363 [00:12<00:01, 27.56it/s]
Loading 0: 90%|█████████ | 327/363 [00:12<00:01, 29.56it/s]
Loading 0: 91%|█████████ | 331/363 [00:12<00:01, 28.07it/s]
Loading 0: 93%|█████████▎| 336/363 [00:13<00:00, 30.07it/s]
Loading 0: 94%|█████████▎| 340/363 [00:13<00:00, 28.40it/s]
Loading 0: 95%|█████████▍| 344/363 [00:20<00:09, 1.96it/s]
Loading 0: 96%|█████████▌| 348/363 [00:20<00:05, 2.63it/s]
Loading 0: 97%|█████████▋| 353/363 [00:20<00:02, 3.80it/s]
Loading 0: 98%|█████████▊| 357/363 [00:20<00:01, 4.91it/s]
Loading 0: 100%|█████████▉| 362/363 [00:20<00:00, 6.98it/s]
Job zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer completed after 126.43s with status: succeeded
Stopping job with name zonemercy-cogent-nemo-v2-5e6-v18-mkmlizer
Pipeline stage MKMLizer completed in 128.06s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.09s
Running pipeline stage ISVCDeployer
Creating inference service zonemercy-cogent-nemo-v2-5e6-v18
Waiting for inference service zonemercy-cogent-nemo-v2-5e6-v18 to be ready
Failed to get response for submission zonemercy-acute-nemo-v1_4488_v10: ('http://zonemercy-acute-nemo-v1-4488-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission zonemercy-acute-nemo-v1_4488_v10: ('http://zonemercy-acute-nemo-v1-4488-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:48698->127.0.0.1:8080: read: connection reset by peer\n')
Failed to get response for submission undi95-meta-llama-3-70b_6209_v19: ('http://undi95-meta-llama-3-70b-6209-v19-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission zonemercy-acute-nemo-v1_4488_v10: ('http://zonemercy-acute-nemo-v1-4488-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission undi95-meta-llama-3-70b_6209_v19: ('http://undi95-meta-llama-3-70b-6209-v19-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Inference service zonemercy-cogent-nemo-v2-5e6-v18 ready after 241.63526844978333s
Pipeline stage ISVCDeployer completed in 242.87s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.5072991847991943s
Received healthy response to inference request in 2.0346500873565674s
Received healthy response to inference request in 2.05572772026062s
Received healthy response to inference request in 2.0312798023223877s
Received healthy response to inference request in 2.079730272293091s
5 requests
0 failed requests
5th percentile: 2.0319538593292235
10th percentile: 2.0326279163360597
20th percentile: 2.0339760303497316
30th percentile: 2.0388656139373778
40th percentile: 2.047296667098999
50th percentile: 2.05572772026062
60th percentile: 2.0653287410736083
70th percentile: 2.0749297618865965
80th percentile: 2.1652440547943117
90th percentile: 2.336271619796753
95th percentile: 2.4217854022979735
99th percentile: 2.49019642829895
mean time: 2.1417374134063722
Pipeline stage StressChecker completed in 11.46s
zonemercy-cogent-nemo-v2-5e6_v18 status is now deployed due to DeploymentManager action
zonemercy-cogent-nemo-v2-5e6_v18 status is now inactive due to auto deactivation removed underperforming models
zonemercy-cogent-nemo-v2-5e6_v18 status is now torndown due to DeploymentManager action