Running pipeline stage MKMLizer
Starting job with name zonemercy-lexical-nemo-1518-v19-mkmlizer
Waiting for job on zonemercy-lexical-nemo-1518-v19-mkmlizer to finish
zonemercy-lexical-nemo-1518-v19-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ _____ __ __ ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ /___/ ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ Version: 0.9.11 ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ https://mk1.ai ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ The license key for the current software has been verified as ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ belonging to: ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ Chai Research Corp. ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ║ ║
zonemercy-lexical-nemo-1518-v19-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
zonemercy-lexical-nemo-1518-v19-mkmlizer: Downloaded to shared memory in 57.120s
zonemercy-lexical-nemo-1518-v19-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpo_0ti74d, device:0
zonemercy-lexical-nemo-1518-v19-mkmlizer: Saving flywheel model at /dev/shm/model_cache
zonemercy-lexical-nemo-1518-v19-mkmlizer: quantized model in 40.330s
zonemercy-lexical-nemo-1518-v19-mkmlizer: Processed model zonemercy/Lexical-Nemo-v4-1k1e5 in 97.450s
zonemercy-lexical-nemo-1518-v19-mkmlizer: creating bucket guanaco-mkml-models
zonemercy-lexical-nemo-1518-v19-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zonemercy-lexical-nemo-1518-v19-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zonemercy-lexical-nemo-1518-v19
zonemercy-lexical-nemo-1518-v19-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zonemercy-lexical-nemo-1518-v19/config.json
zonemercy-lexical-nemo-1518-v19-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zonemercy-lexical-nemo-1518-v19/special_tokens_map.json
zonemercy-lexical-nemo-1518-v19-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zonemercy-lexical-nemo-1518-v19/tokenizer_config.json
zonemercy-lexical-nemo-1518-v19-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zonemercy-lexical-nemo-1518-v19/tokenizer.json
zonemercy-lexical-nemo-1518-v19-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zonemercy-lexical-nemo-1518-v19/flywheel_model.0.safetensors
zonemercy-lexical-nemo-1518-v19-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:16, 22.36it/s]
Loading 0: 3%|▎ | 10/363 [00:00<00:12, 28.19it/s]
Loading 0: 4%|▍ | 14/363 [00:00<00:13, 25.07it/s]
Loading 0: 6%|▌ | 21/363 [00:00<00:09, 35.82it/s]
Loading 0: 7%|▋ | 26/363 [00:01<00:15, 21.40it/s]
Loading 0: 9%|▊ | 31/363 [00:01<00:12, 26.14it/s]
Loading 0: 10%|▉ | 35/363 [00:01<00:11, 27.60it/s]
Loading 0: 11%|█ | 39/363 [00:01<00:11, 29.14it/s]
Loading 0: 12%|█▏ | 43/363 [00:01<00:11, 28.30it/s]
Loading 0: 13%|█▎ | 48/363 [00:01<00:10, 31.25it/s]
Loading 0: 14%|█▍ | 52/363 [00:01<00:10, 29.65it/s]
Loading 0: 15%|█▌ | 56/363 [00:01<00:10, 29.97it/s]
Loading 0: 17%|█▋ | 61/363 [00:02<00:11, 25.23it/s]
Loading 0: 18%|█▊ | 64/363 [00:02<00:13, 22.33it/s]
Loading 0: 20%|█▉ | 71/363 [00:02<00:09, 29.23it/s]
Loading 0: 21%|██ | 75/363 [00:02<00:10, 28.52it/s]
Loading 0: 22%|██▏ | 79/363 [00:02<00:10, 27.37it/s]
Loading 0: 23%|██▎ | 84/363 [00:03<00:09, 29.95it/s]
Loading 0: 24%|██▍ | 88/363 [00:03<00:09, 28.76it/s]
Loading 0: 26%|██▌ | 93/363 [00:03<00:08, 31.08it/s]
Loading 0: 27%|██▋ | 97/363 [00:03<00:09, 29.55it/s]
Loading 0: 28%|██▊ | 101/363 [00:03<00:11, 23.76it/s]
Loading 0: 29%|██▊ | 104/363 [00:03<00:12, 21.22it/s]
Loading 0: 30%|███ | 109/363 [00:04<00:09, 26.66it/s]
Loading 0: 31%|███ | 113/363 [00:04<00:10, 24.44it/s]
Loading 0: 33%|███▎ | 120/363 [00:04<00:07, 31.19it/s]
Loading 0: 34%|███▍ | 124/363 [00:04<00:08, 29.65it/s]
Loading 0: 36%|███▌ | 129/363 [00:04<00:07, 31.47it/s]
Loading 0: 37%|███▋ | 133/363 [00:04<00:07, 29.36it/s]
Loading 0: 38%|███▊ | 137/363 [00:04<00:07, 28.90it/s]
Loading 0: 39%|███▉ | 142/363 [00:05<00:08, 25.08it/s]
Loading 0: 40%|███▉ | 145/363 [00:05<00:09, 23.88it/s]
Loading 0: 41%|████ | 149/363 [00:05<00:09, 22.91it/s]
Loading 0: 43%|████▎ | 156/363 [00:05<00:06, 29.67it/s]
Loading 0: 44%|████▍ | 160/363 [00:05<00:07, 28.78it/s]
Loading 0: 45%|████▌ | 165/363 [00:05<00:06, 30.75it/s]
Loading 0: 47%|████▋ | 169/363 [00:06<00:06, 29.29it/s]
Loading 0: 48%|████▊ | 174/363 [00:06<00:05, 31.64it/s]
Loading 0: 49%|████▉ | 178/363 [00:06<00:06, 29.84it/s]
Loading 0: 50%|█████ | 182/363 [00:06<00:07, 23.49it/s]
Loading 0: 51%|█████ | 185/363 [00:06<00:08, 21.26it/s]
Loading 0: 53%|█████▎ | 192/363 [00:06<00:06, 28.24it/s]
Loading 0: 54%|█████▍ | 196/363 [00:07<00:06, 27.75it/s]
Loading 0: 55%|█████▌ | 201/363 [00:07<00:05, 30.42it/s]
Loading 0: 56%|█████▋ | 205/363 [00:07<00:05, 29.09it/s]
Loading 0: 58%|█████▊ | 210/363 [00:07<00:04, 31.49it/s]
Loading 0: 59%|█████▉ | 214/363 [00:07<00:04, 29.84it/s]
Loading 0: 60%|██████ | 218/363 [00:07<00:04, 29.66it/s]
Loading 0: 61%|██████▏ | 223/363 [00:08<00:05, 25.53it/s]
Loading 0: 62%|██████▏ | 226/363 [00:08<00:05, 24.15it/s]
Loading 0: 63%|██████▎ | 230/363 [00:08<00:05, 23.03it/s]
Loading 0: 65%|██████▌ | 237/363 [00:08<00:04, 29.88it/s]
Loading 0: 66%|██████▋ | 241/363 [00:08<00:04, 28.87it/s]
Loading 0: 68%|██████▊ | 246/363 [00:08<00:03, 31.36it/s]
Loading 0: 69%|██████▉ | 250/363 [00:09<00:03, 29.98it/s]
Loading 0: 70%|███████ | 255/363 [00:09<00:03, 31.92it/s]
Loading 0: 71%|███████▏ | 259/363 [00:09<00:03, 29.72it/s]
Loading 0: 72%|███████▏ | 263/363 [00:09<00:04, 24.04it/s]
Loading 0: 73%|███████▎ | 266/363 [00:09<00:04, 21.32it/s]
Loading 0: 75%|███████▌ | 273/363 [00:09<00:03, 28.12it/s]
Loading 0: 76%|███████▋ | 277/363 [00:10<00:03, 27.51it/s]
Loading 0: 78%|███████▊ | 282/363 [00:10<00:02, 29.68it/s]
Loading 0: 79%|███████▉ | 286/363 [00:10<00:02, 28.56it/s]
Loading 0: 80%|████████ | 291/363 [00:10<00:02, 30.31it/s]
Loading 0: 81%|████████▏ | 295/363 [00:10<00:02, 28.99it/s]
Loading 0: 82%|████████▏ | 299/363 [00:10<00:02, 28.75it/s]
Loading 0: 84%|████████▎ | 304/363 [00:11<00:02, 25.09it/s]
Loading 0: 85%|████████▍ | 307/363 [00:11<00:02, 23.89it/s]
Loading 0: 86%|████████▌ | 311/363 [00:11<00:02, 22.98it/s]
Loading 0: 88%|████████▊ | 318/363 [00:11<00:01, 29.80it/s]
Loading 0: 89%|████████▊ | 322/363 [00:11<00:01, 28.80it/s]
Loading 0: 90%|█████████ | 327/363 [00:11<00:01, 31.27it/s]
Loading 0: 91%|█████████ | 331/363 [00:11<00:01, 29.79it/s]
Loading 0: 93%|█████████▎| 336/363 [00:12<00:00, 32.07it/s]
Loading 0: 94%|█████████▎| 340/363 [00:12<00:00, 30.32it/s]
Loading 0: 95%|█████████▍| 344/363 [00:19<00:09, 1.99it/s]
Loading 0: 96%|█████████▌| 348/363 [00:19<00:05, 2.67it/s]
Loading 0: 97%|█████████▋| 353/363 [00:19<00:02, 3.87it/s]
Loading 0: 98%|█████████▊| 357/363 [00:19<00:01, 5.02it/s]
Failed to get response for submission blend_gases_2024-08-15: ('http://zonemercy-lexical-nemov8-5966-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:34664->127.0.0.1:8080: read: connection reset by peer\n')
Job zonemercy-lexical-nemo-1518-v19-mkmlizer completed after 125.83s with status: succeeded
Stopping job with name zonemercy-lexical-nemo-1518-v19-mkmlizer
Pipeline stage MKMLizer completed in 126.61s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.09s
Running pipeline stage ISVCDeployer
Creating inference service zonemercy-lexical-nemo-1518-v19
Waiting for inference service zonemercy-lexical-nemo-1518-v19 to be ready
Failed to get response for submission function_fahor_2024-08-15: no entry with id "function_fahor_2024-08-15" found on database!
Failed to get response for submission function_fahor_2024-08-15: no entry with id "function_fahor_2024-08-15" found on database!
Inference service zonemercy-lexical-nemo-1518-v19 ready after 221.40770173072815s
Pipeline stage ISVCDeployer completed in 223.12s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2906715869903564s
Received healthy response to inference request in 2.7498793601989746s
Received healthy response to inference request in 1.7737512588500977s
Received healthy response to inference request in 1.7469151020050049s
Received healthy response to inference request in 1.6830976009368896s
5 requests
0 failed requests
5th percentile: 1.6958611011505127
10th percentile: 1.7086246013641357
20th percentile: 1.7341516017913818
30th percentile: 1.7522823333740234
40th percentile: 1.7630167961120606
50th percentile: 1.7737512588500977
60th percentile: 1.980519390106201
70th percentile: 2.1872875213623044
80th percentile: 2.38251314163208
90th percentile: 2.566196250915527
95th percentile: 2.658037805557251
99th percentile: 2.73151104927063
mean time: 2.0488629817962645
Pipeline stage StressChecker completed in 11.24s
zonemercy-lexical-nemo-_1518_v19 status is now deployed due to DeploymentManager action
zonemercy-lexical-nemo-_1518_v19 status is now inactive due to auto deactivation removed underperforming models
zonemercy-lexical-nemo-_1518_v19 status is now torndown due to DeploymentManager action