Running pipeline stage MKMLizer
Starting job with name trace2333-ultra4w-dol4w-2453-v1-mkmlizer
Waiting for job on trace2333-ultra4w-dol4w-2453-v1-mkmlizer to finish
Stopping job with name trace2333-ultra4w-dol4w-2453-v1-mkmlizer
%s, retrying in %s seconds...
Starting job with name trace2333-ultra4w-dol4w-2453-v1-mkmlizer
Waiting for job on trace2333-ultra4w-dol4w-2453-v1-mkmlizer to finish
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ _____ __ __ ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ /___/ ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ Version: 0.10.1 ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ https://mk1.ai ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ The license key for the current software has been verified as ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ belonging to: ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ Chai Research Corp. ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ║ ║
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: Downloaded to shared memory in 91.676s
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpeinlrgmj, device:0
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: quantized model in 28.806s
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: Processed model Trace2333/ultra4w_dol4w_fd5w_r32a16_qkvo_epoch3_v3 in 120.482s
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: creating bucket guanaco-mkml-models
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/trace2333-ultra4w-dol4w-2453-v1
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/trace2333-ultra4w-dol4w-2453-v1/config.json
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/trace2333-ultra4w-dol4w-2453-v1/special_tokens_map.json
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/trace2333-ultra4w-dol4w-2453-v1/tokenizer_config.json
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/trace2333-ultra4w-dol4w-2453-v1/tokenizer.json
trace2333-ultra4w-dol4w-2453-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/trace2333-ultra4w-dol4w-2453-v1/flywheel_model.0.safetensors
trace2333-ultra4w-dol4w-2453-v1-mkmlizer:
Loading 0: 0%| | 0/291 [00:00<?, ?it/s]
Loading 0: 2%|▏ | 5/291 [00:00<00:10, 26.54it/s]
Loading 0: 4%|▍ | 12/291 [00:00<00:07, 36.50it/s]
Loading 0: 5%|▌ | 16/291 [00:00<00:08, 33.98it/s]
Loading 0: 7%|▋ | 21/291 [00:00<00:07, 37.00it/s]
Loading 0: 9%|▊ | 25/291 [00:00<00:07, 34.40it/s]
Loading 0: 10%|█ | 30/291 [00:00<00:06, 38.13it/s]
Loading 0: 12%|█▏ | 34/291 [00:01<00:10, 24.88it/s]
Loading 0: 13%|█▎ | 38/291 [00:01<00:09, 26.49it/s]
Loading 0: 14%|█▍ | 42/291 [00:01<00:09, 25.50it/s]
Loading 0: 16%|█▋ | 48/291 [00:01<00:07, 30.95it/s]
Loading 0: 18%|█▊ | 52/291 [00:01<00:07, 30.93it/s]
Loading 0: 20%|█▉ | 57/291 [00:01<00:06, 33.66it/s]
Loading 0: 21%|██ | 61/291 [00:01<00:06, 33.10it/s]
Loading 0: 23%|██▎ | 66/291 [00:02<00:06, 36.11it/s]
Loading 0: 24%|██▍ | 70/291 [00:02<00:06, 34.55it/s]
Loading 0: 25%|██▌ | 74/291 [00:02<00:06, 34.82it/s]
Loading 0: 27%|██▋ | 78/291 [00:02<00:06, 34.88it/s]
Loading 0: 28%|██▊ | 82/291 [00:02<00:08, 23.77it/s]
Loading 0: 30%|██▉ | 86/291 [00:02<00:07, 26.57it/s]
Loading 0: 31%|███ | 90/291 [00:02<00:07, 28.30it/s]
Loading 0: 32%|███▏ | 94/291 [00:03<00:06, 28.76it/s]
Loading 0: 34%|███▍ | 99/291 [00:03<00:05, 32.50it/s]
Loading 0: 35%|███▌ | 103/291 [00:03<00:05, 31.85it/s]
Loading 0: 37%|███▋ | 108/291 [00:03<00:05, 34.77it/s]
Loading 0: 38%|███▊ | 112/291 [00:03<00:05, 33.51it/s]
Loading 0: 40%|███▉ | 116/291 [00:03<00:05, 33.87it/s]
Loading 0: 42%|████▏ | 122/291 [00:03<00:04, 38.43it/s]
Loading 0: 44%|████▎ | 127/291 [00:03<00:04, 36.44it/s]
Loading 0: 46%|████▌ | 133/291 [00:04<00:05, 31.10it/s]
Loading 0: 47%|████▋ | 137/291 [00:04<00:04, 31.19it/s]
Loading 0: 48%|████▊ | 141/291 [00:04<00:05, 28.87it/s]
Loading 0: 51%|█████ | 147/291 [00:04<00:04, 33.25it/s]
Loading 0: 52%|█████▏ | 151/291 [00:04<00:04, 32.00it/s]
Loading 0: 54%|█████▎ | 156/291 [00:04<00:03, 33.84it/s]
Loading 0: 55%|█████▍ | 160/291 [00:05<00:03, 32.96it/s]
Loading 0: 57%|█████▋ | 165/291 [00:05<00:03, 35.87it/s]
Loading 0: 58%|█████▊ | 169/291 [00:05<00:03, 34.49it/s]
Loading 0: 60%|█████▉ | 174/291 [00:05<00:03, 36.64it/s]
Loading 0: 61%|██████ | 178/291 [00:05<00:03, 34.85it/s]
Loading 0: 63%|██████▎ | 184/291 [00:05<00:02, 40.49it/s]
Loading 0: 65%|██████▍ | 189/291 [00:06<00:04, 24.34it/s]
Loading 0: 67%|██████▋ | 194/291 [00:06<00:03, 25.75it/s]
Loading 0: 69%|██████▉ | 201/291 [00:06<00:02, 32.35it/s]
Loading 0: 70%|███████ | 205/291 [00:06<00:02, 31.92it/s]
Loading 0: 72%|███████▏ | 210/291 [00:06<00:02, 34.36it/s]
Loading 0: 74%|███████▎ | 214/291 [00:06<00:02, 33.20it/s]
Loading 0: 75%|███████▌ | 219/291 [00:06<00:02, 35.86it/s]
Loading 0: 77%|███████▋ | 223/291 [00:06<00:01, 34.23it/s]
Loading 0: 78%|███████▊ | 227/291 [00:07<00:01, 34.05it/s]
Loading 0: 79%|███████▉ | 231/291 [00:07<00:01, 34.13it/s]
Loading 0: 81%|████████ | 235/291 [00:07<00:02, 25.27it/s]
Loading 0: 82%|████████▏ | 239/291 [00:07<00:02, 25.14it/s]
Loading 0: 85%|████████▍ | 246/291 [00:07<00:01, 32.71it/s]
Loading 0: 86%|████████▌ | 250/291 [00:07<00:01, 32.30it/s]
Loading 0: 88%|████████▊ | 255/291 [00:07<00:01, 35.02it/s]
Loading 0: 89%|████████▉ | 259/291 [00:08<00:00, 33.94it/s]
Loading 0: 91%|█████████ | 264/291 [00:08<00:00, 36.57it/s]
Loading 0: 92%|█████████▏| 268/291 [00:08<00:00, 35.19it/s]
Loading 0: 94%|█████████▍| 273/291 [00:08<00:00, 37.44it/s]
Loading 0: 95%|█████████▌| 277/291 [00:08<00:00, 34.22it/s]
Loading 0: 97%|█████████▋| 281/291 [00:08<00:00, 34.33it/s]
Loading 0: 98%|█████████▊| 286/291 [00:14<00:01, 2.61it/s]
Loading 0: 99%|█████████▉| 289/291 [00:14<00:00, 3.25it/s]
Job trace2333-ultra4w-dol4w-2453-v1-mkmlizer completed after 147.36s with status: succeeded
Stopping job with name trace2333-ultra4w-dol4w-2453-v1-mkmlizer
Pipeline stage MKMLizer completed in 149.74s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.10s
Running pipeline stage ISVCDeployer
Creating inference service trace2333-ultra4w-dol4w-2453-v1
Waiting for inference service trace2333-ultra4w-dol4w-2453-v1 to be ready
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Inference service trace2333-ultra4w-dol4w-2453-v1 ready after 170.6598093509674s
Pipeline stage ISVCDeployer completed in 174.26s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0815622806549072s
Received healthy response to inference request in 3.149104118347168s
Received healthy response to inference request in 2.409116744995117s
Received healthy response to inference request in 1.7414839267730713s
Received healthy response to inference request in 1.5126276016235352s
5 requests
0 failed requests
5th percentile: 1.5583988666534423
10th percentile: 1.6041701316833497
20th percentile: 1.6957126617431642
30th percentile: 1.8094995975494386
40th percentile: 1.945530939102173
50th percentile: 2.0815622806549072
60th percentile: 2.212584066390991
70th percentile: 2.3436058521270753
80th percentile: 2.5571142196655274
90th percentile: 2.8531091690063475
95th percentile: 3.0011066436767577
99th percentile: 3.119504623413086
mean time: 2.1787789344787596
Pipeline stage StressChecker completed in 12.71s
trace2333-ultra4w-dol4w-_2453_v1 status is now deployed due to DeploymentManager action
trace2333-ultra4w-dol4w-_2453_v1 status is now inactive due to auto deactivation removed underperforming models
trace2333-ultra4w-dol4w-_2453_v1 status is now torndown due to DeploymentManager action