Running pipeline stage MKMLizer
Starting job with name trace2333-ultra1w-dol1w-6284-v5-mkmlizer
Waiting for job on trace2333-ultra1w-dol1w-6284-v5-mkmlizer to finish
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ _____ __ __ ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ /___/ ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ Version: 0.10.1 ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ https://mk1.ai ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ The license key for the current software has been verified as ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ belonging to: ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ Chai Research Corp. ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ║ ║
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission blend_berib_2024-08-16: ('http://zonemercy-lexical-nemo-1518-v18-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'EOF\n')
Failed to get response for submission blend_jenes_2024-08-16: ('http://zonemercy-lexical-nemo-1518-v18-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'EOF\n')
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: Downloaded to shared memory in 69.720s
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp_qmbcoto, device:0
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Failed to get response for submission blend_remul_2024-08-22: ('http://zonemercy-lexical-nemo-1518-v18-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: quantized model in 28.476s
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: Processed model Trace2333/ultra1w_dol1w_fd2w_r32a16_qkvo_epoch6_v2 in 98.196s
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: creating bucket guanaco-mkml-models
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/trace2333-ultra1w-dol1w-6284-v5
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/trace2333-ultra1w-dol1w-6284-v5/config.json
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/trace2333-ultra1w-dol1w-6284-v5/special_tokens_map.json
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/trace2333-ultra1w-dol1w-6284-v5/tokenizer_config.json
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/trace2333-ultra1w-dol1w-6284-v5/tokenizer.json
trace2333-ultra1w-dol1w-6284-v5-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/trace2333-ultra1w-dol1w-6284-v5/flywheel_model.0.safetensors
trace2333-ultra1w-dol1w-6284-v5-mkmlizer:
Loading 0: 0%| | 0/291 [00:00<?, ?it/s]
Loading 0: 2%|▏ | 5/291 [00:00<00:09, 28.81it/s]
Loading 0: 4%|▍ | 12/291 [00:00<00:07, 35.80it/s]
Loading 0: 5%|▌ | 16/291 [00:00<00:08, 32.84it/s]
Loading 0: 7%|▋ | 21/291 [00:00<00:07, 36.65it/s]
Loading 0: 9%|▊ | 25/291 [00:00<00:07, 34.73it/s]
Loading 0: 11%|█ | 31/291 [00:00<00:06, 40.98it/s]
Loading 0: 12%|█▏ | 36/291 [00:01<00:10, 25.10it/s]
Loading 0: 14%|█▍ | 41/291 [00:01<00:09, 26.50it/s]
Loading 0: 16%|█▋ | 48/291 [00:01<00:07, 33.36it/s]
Loading 0: 18%|█▊ | 52/291 [00:01<00:07, 32.91it/s]
Loading 0: 20%|█▉ | 57/291 [00:01<00:06, 34.99it/s]
Loading 0: 21%|██ | 61/291 [00:01<00:06, 33.43it/s]
Loading 0: 23%|██▎ | 66/291 [00:01<00:06, 36.02it/s]
Loading 0: 24%|██▍ | 70/291 [00:02<00:06, 34.66it/s]
Loading 0: 25%|██▌ | 74/291 [00:02<00:06, 34.60it/s]
Loading 0: 27%|██▋ | 78/291 [00:02<00:06, 34.72it/s]
Loading 0: 28%|██▊ | 82/291 [00:02<00:08, 23.57it/s]
Loading 0: 29%|██▉ | 85/291 [00:02<00:08, 24.50it/s]
Loading 0: 31%|███ | 90/291 [00:02<00:06, 29.11it/s]
Loading 0: 32%|███▏ | 94/291 [00:02<00:06, 29.42it/s]
Loading 0: 34%|███▍ | 99/291 [00:03<00:05, 32.34it/s]
Loading 0: 35%|███▌ | 103/291 [00:03<00:05, 32.26it/s]
Loading 0: 37%|███▋ | 108/291 [00:03<00:05, 35.41it/s]
Loading 0: 38%|███▊ | 112/291 [00:03<00:05, 34.08it/s]
Loading 0: 40%|███▉ | 116/291 [00:03<00:05, 33.59it/s]
Loading 0: 42%|████▏ | 122/291 [00:03<00:04, 37.88it/s]
Loading 0: 44%|████▎ | 127/291 [00:03<00:04, 35.49it/s]
Loading 0: 46%|████▌ | 133/291 [00:04<00:05, 30.45it/s]
Loading 0: 47%|████▋ | 137/291 [00:04<00:05, 30.60it/s]
Loading 0: 48%|████▊ | 141/291 [00:04<00:05, 28.69it/s]
Loading 0: 51%|█████ | 147/291 [00:04<00:04, 33.42it/s]
Loading 0: 52%|█████▏ | 151/291 [00:04<00:04, 32.09it/s]
Loading 0: 54%|█████▎ | 156/291 [00:04<00:03, 34.52it/s]
Loading 0: 55%|█████▍ | 160/291 [00:04<00:03, 33.80it/s]
Loading 0: 57%|█████▋ | 165/291 [00:05<00:03, 36.71it/s]
Loading 0: 58%|█████▊ | 169/291 [00:05<00:03, 34.61it/s]
Loading 0: 60%|█████▉ | 174/291 [00:05<00:03, 36.42it/s]
Loading 0: 61%|██████ | 178/291 [00:05<00:03, 34.76it/s]
Loading 0: 63%|██████▎ | 184/291 [00:05<00:02, 40.17it/s]
Loading 0: 65%|██████▍ | 189/291 [00:05<00:04, 24.16it/s]
Loading 0: 67%|██████▋ | 194/291 [00:06<00:03, 26.17it/s]
Loading 0: 69%|██████▉ | 201/291 [00:06<00:02, 32.89it/s]
Loading 0: 70%|███████ | 205/291 [00:06<00:02, 32.68it/s]
Loading 0: 72%|███████▏ | 210/291 [00:06<00:02, 35.53it/s]
Loading 0: 74%|███████▎ | 214/291 [00:06<00:02, 34.57it/s]
Loading 0: 75%|███████▌ | 219/291 [00:06<00:01, 37.23it/s]
Loading 0: 77%|███████▋ | 223/291 [00:06<00:01, 35.29it/s]
Loading 0: 78%|███████▊ | 227/291 [00:06<00:01, 35.12it/s]
Loading 0: 79%|███████▉ | 231/291 [00:07<00:01, 34.79it/s]
Loading 0: 81%|████████ | 235/291 [00:07<00:02, 26.19it/s]
Loading 0: 82%|████████▏ | 239/291 [00:07<00:01, 26.17it/s]
Loading 0: 85%|████████▍ | 246/291 [00:07<00:01, 33.95it/s]
Loading 0: 86%|████████▌ | 250/291 [00:07<00:01, 33.36it/s]
Loading 0: 88%|████████▊ | 255/291 [00:07<00:01, 35.71it/s]
Loading 0: 89%|████████▉ | 259/291 [00:07<00:00, 33.97it/s]
Loading 0: 91%|█████████ | 264/291 [00:08<00:00, 36.21it/s]
Loading 0: 92%|█████████▏| 268/291 [00:08<00:00, 34.56it/s]
Loading 0: 94%|█████████▍| 273/291 [00:08<00:00, 36.93it/s]
Loading 0: 95%|█████████▌| 277/291 [00:08<00:00, 33.97it/s]
Loading 0: 97%|█████████▋| 281/291 [00:08<00:00, 34.15it/s]
Loading 0: 98%|█████████▊| 286/291 [00:14<00:01, 2.63it/s]
Loading 0: 99%|█████████▉| 289/291 [00:14<00:00, 3.28it/s]
Job trace2333-ultra1w-dol1w-6284-v5-mkmlizer completed after 118.95s with status: succeeded
Stopping job with name trace2333-ultra1w-dol1w-6284-v5-mkmlizer
Pipeline stage MKMLizer completed in 120.00s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.11s
Running pipeline stage ISVCDeployer
Creating inference service trace2333-ultra1w-dol1w-6284-v5
Waiting for inference service trace2333-ultra1w-dol1w-6284-v5 to be ready
Failed to get response for submission blend_jidor_2024-08-22: ('http://zonemercy-lexical-nemo-1518-v18-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'EOF\n')
Failed to get response for submission blend_jidor_2024-08-22: ('http://zonemercy-lexical-nemo-1518-v18-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'EOF\n')
Failed to get response for submission blend_berib_2024-08-16: ('http://zonemercy-graft-cogent-v-7573-v6-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'activator request timeout')
Failed to get response for submission blend_remul_2024-08-22: ('http://zonemercy-graft-cogent-v-7573-v6-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'EOF\n')
Failed to get response for submission blend_susol_2024-08-22: ('http://zonemercy-lexical-nemo-1518-v18-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'EOF\n')
Failed to get response for submission blend_fedek_2024-08-24: ('http://zonemercy-lexical-nemov8-5966-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission blend_susol_2024-08-22: ('http://zonemercy-lexical-nemo-1518-v18-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'EOF\n')
Failed to get response for submission blend_dedat_2024-08-16: ('http://zonemercy-graft-cogent-v-7573-v6-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'EOF\n')
Inference service trace2333-ultra1w-dol1w-6284-v5 ready after 303.6803517341614s
Pipeline stage ISVCDeployer completed in 305.15s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.131520986557007s
Received healthy response to inference request in 2.6886305809020996s
Received healthy response to inference request in 1.437669038772583s
Received healthy response to inference request in 1.3287513256072998s
Received healthy response to inference request in 6.492115020751953s
5 requests
0 failed requests
5th percentile: 1.3505348682403564
10th percentile: 1.372318410873413
20th percentile: 1.4158854961395264
30th percentile: 1.5764394283294678
40th percentile: 1.8539802074432374
50th percentile: 2.131520986557007
60th percentile: 2.354364824295044
70th percentile: 2.577208662033081
80th percentile: 3.449327468872071
90th percentile: 4.970721244812012
95th percentile: 5.7314181327819815
99th percentile: 6.339975643157959
mean time: 2.8157373905181884
Pipeline stage StressChecker completed in 14.87s
trace2333-ultra1w-dol1w-_6284_v5 status is now deployed due to DeploymentManager action
trace2333-ultra1w-dol1w-_6284_v5 status is now inactive due to auto deactivation removed underperforming models
trace2333-ultra1w-dol1w-_6284_v5 status is now torndown due to DeploymentManager action