Running pipeline stage MKMLizer
Starting job with name sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer
Waiting for job on sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer to finish
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ _____ __ __ ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ /___/ ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ Version: 0.9.11 ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ https://mk1.ai ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ The license key for the current software has been verified as ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ belonging to: ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ Chai Research Corp. ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ║ ║
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: Downloaded to shared memory in 29.700s
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmplocj0oge, device:0
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: quantized model in 25.852s
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: Processed model Sao10K/L3.1-8B-Stheno-v3.4.1 in 55.553s
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: creating bucket guanaco-mkml-models
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/sao10k-l3-1-8b-stheno-v3-4-1-v1/config.json
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/sao10k-l3-1-8b-stheno-v3-4-1-v1/special_tokens_map.json
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/sao10k-l3-1-8b-stheno-v3-4-1-v1/tokenizer_config.json
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/sao10k-l3-1-8b-stheno-v3-4-1-v1/tokenizer.json
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/sao10k-l3-1-8b-stheno-v3-4-1-v1/flywheel_model.0.safetensors
sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer:
Loading 0: 0%| | 0/291 [00:00<?, ?it/s]
Loading 0: 1%| | 2/291 [00:04<11:24, 2.37s/it]
Loading 0: 2%|▏ | 6/291 [00:04<03:02, 1.56it/s]
Loading 0: 4%|▍ | 13/291 [00:04<01:05, 4.26it/s]
Loading 0: 7%|▋ | 19/291 [00:05<00:37, 7.23it/s]
Loading 0: 8%|▊ | 24/291 [00:05<00:27, 9.76it/s]
Loading 0: 11%|█ | 32/291 [00:05<00:16, 15.83it/s]
Loading 0: 13%|█▎ | 38/291 [00:05<00:12, 19.57it/s]
Loading 0: 15%|█▍ | 43/291 [00:05<00:10, 23.21it/s]
Loading 0: 17%|█▋ | 50/291 [00:05<00:07, 30.15it/s]
Loading 0: 19%|█▉ | 56/291 [00:05<00:07, 32.61it/s]
Loading 0: 21%|██ | 61/291 [00:06<00:06, 35.00it/s]
Loading 0: 23%|██▎ | 68/291 [00:06<00:05, 41.30it/s]
Loading 0: 25%|██▌ | 74/291 [00:06<00:05, 41.00it/s]
Loading 0: 27%|██▋ | 79/291 [00:06<00:05, 42.03it/s]
Loading 0: 29%|██▉ | 84/291 [00:06<00:06, 30.16it/s]
Loading 0: 30%|███ | 88/291 [00:06<00:06, 31.26it/s]
Loading 0: 33%|███▎ | 95/291 [00:06<00:05, 38.81it/s]
Loading 0: 35%|███▍ | 101/291 [00:07<00:04, 38.92it/s]
Loading 0: 36%|███▋ | 106/291 [00:07<00:04, 40.34it/s]
Loading 0: 39%|███▉ | 113/291 [00:07<00:03, 46.70it/s]
Loading 0: 41%|████ | 119/291 [00:07<00:03, 44.35it/s]
Loading 0: 43%|████▎ | 124/291 [00:07<00:03, 44.26it/s]
Loading 0: 45%|████▌ | 131/291 [00:07<00:03, 49.62it/s]
Loading 0: 47%|████▋ | 137/291 [00:07<00:03, 45.80it/s]
Loading 0: 49%|████▉ | 142/291 [00:07<00:03, 45.53it/s]
Loading 0: 51%|█████ | 149/291 [00:08<00:02, 50.78it/s]
Loading 0: 53%|█████▎ | 155/291 [00:08<00:02, 46.77it/s]
Loading 0: 55%|█████▍ | 160/291 [00:08<00:02, 46.64it/s]
Loading 0: 57%|█████▋ | 167/291 [00:08<00:02, 51.12it/s]
Loading 0: 59%|█████▉ | 173/291 [00:08<00:02, 46.66it/s]
Loading 0: 61%|██████ | 178/291 [00:08<00:02, 46.35it/s]
Loading 0: 64%|██████▎ | 185/291 [00:08<00:02, 50.42it/s]
Loading 0: 66%|██████▌ | 191/291 [00:08<00:02, 47.03it/s]
Loading 0: 67%|██████▋ | 196/291 [00:09<00:02, 46.88it/s]
Loading 0: 70%|██████▉ | 203/291 [00:09<00:01, 51.89it/s]
Loading 0: 72%|███████▏ | 209/291 [00:09<00:02, 33.62it/s]
Loading 0: 74%|███████▎ | 214/291 [00:09<00:02, 36.14it/s]
Loading 0: 76%|███████▌ | 221/291 [00:09<00:01, 42.32it/s]
Loading 0: 78%|███████▊ | 227/291 [00:09<00:01, 41.54it/s]
Loading 0: 80%|███████▉ | 232/291 [00:09<00:01, 42.77it/s]
Loading 0: 82%|████████▏ | 239/291 [00:10<00:01, 48.67it/s]
Loading 0: 84%|████████▍ | 245/291 [00:10<00:01, 45.39it/s]
Loading 0: 86%|████████▌ | 250/291 [00:10<00:00, 45.03it/s]
Loading 0: 88%|████████▊ | 257/291 [00:10<00:00, 50.51it/s]
Loading 0: 90%|█████████ | 263/291 [00:10<00:00, 46.40it/s]
Loading 0: 92%|█████████▏| 268/291 [00:10<00:00, 45.31it/s]
Loading 0: 95%|█████████▍| 275/291 [00:10<00:00, 49.47it/s]
Loading 0: 97%|█████████▋| 281/291 [00:10<00:00, 44.77it/s]
Loading 0: 98%|█████████▊| 286/291 [00:11<00:00, 45.07it/s]
Job sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer completed after 74.07s with status: succeeded
Stopping job with name sao10k-l3-1-8b-stheno-v3-4-1-v1-mkmlizer
Pipeline stage MKMLizer completed in 74.98s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.11s
Running pipeline stage ISVCDeployer
Creating inference service sao10k-l3-1-8b-stheno-v3-4-1-v1
Waiting for inference service sao10k-l3-1-8b-stheno-v3-4-1-v1 to be ready
Inference service sao10k-l3-1-8b-stheno-v3-4-1-v1 ready after 251.7501573562622s
Pipeline stage ISVCDeployer completed in 252.53s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.1029586791992188s
Received healthy response to inference request in 2.3346664905548096s
Received healthy response to inference request in 1.4554905891418457s
Received healthy response to inference request in 1.388002634048462s
Received healthy response to inference request in 1.8436658382415771s
5 requests
0 failed requests
5th percentile: 1.4015002250671387
10th percentile: 1.4149978160858154
20th percentile: 1.441992998123169
30th percentile: 1.533125638961792
40th percentile: 1.6883957386016846
50th percentile: 1.8436658382415771
60th percentile: 1.9473829746246338
70th percentile: 2.0511001110076905
80th percentile: 2.149300241470337
90th percentile: 2.2419833660125734
95th percentile: 2.2883249282836915
99th percentile: 2.325398178100586
mean time: 1.8249568462371826
Pipeline stage StressChecker completed in 9.77s
sao10k-l3-1-8b-stheno-v3-4-1_v1 status is now deployed due to DeploymentManager action
sao10k-l3-1-8b-stheno-v3-4-1_v1 status is now inactive due to auto deactivation removed underperforming models
sao10k-l3-1-8b-stheno-v3-4-1_v1 status is now torndown due to DeploymentManager action