Running pipeline stage MKMLizer
Starting job with name trace2333-ultra4w-dol4w-2313-v1-mkmlizer
Waiting for job on trace2333-ultra4w-dol4w-2313-v1-mkmlizer to finish
Stopping job with name trace2333-ultra4w-dol4w-2313-v1-mkmlizer
%s, retrying in %s seconds...
Starting job with name trace2333-ultra4w-dol4w-2313-v1-mkmlizer
Waiting for job on trace2333-ultra4w-dol4w-2313-v1-mkmlizer to finish
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ _____ __ __ ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ /___/ ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ Version: 0.10.1 ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ https://mk1.ai ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ The license key for the current software has been verified as ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ belonging to: ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ Chai Research Corp. ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ║ ║
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: Downloaded to shared memory in 70.016s
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp84rhmhaq, device:0
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Connection pool is full, discarding connection: %s. Connection pool size: %s
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: quantized model in 29.195s
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: Processed model Trace2333/ultra4w_dol4w_fd5w_r32a16_qkvo_epoch3_v1 in 99.211s
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/trace2333-ultra4w-dol4w-2313-v1
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/trace2333-ultra4w-dol4w-2313-v1/config.json
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/trace2333-ultra4w-dol4w-2313-v1/special_tokens_map.json
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/trace2333-ultra4w-dol4w-2313-v1/tokenizer_config.json
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/trace2333-ultra4w-dol4w-2313-v1/tokenizer.json
trace2333-ultra4w-dol4w-2313-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/trace2333-ultra4w-dol4w-2313-v1/flywheel_model.0.safetensors
trace2333-ultra4w-dol4w-2313-v1-mkmlizer:
Loading 0: 0%| | 0/291 [00:00<?, ?it/s]
Loading 0: 2%|▏ | 5/291 [00:00<00:11, 25.80it/s]
Loading 0: 4%|▍ | 12/291 [00:00<00:07, 34.96it/s]
Loading 0: 5%|▌ | 16/291 [00:00<00:08, 32.18it/s]
Loading 0: 7%|▋ | 21/291 [00:00<00:07, 34.89it/s]
Loading 0: 9%|▊ | 25/291 [00:00<00:08, 32.73it/s]
Loading 0: 10%|█ | 30/291 [00:00<00:07, 37.02it/s]
Loading 0: 12%|█▏ | 34/291 [00:01<00:10, 24.10it/s]
Loading 0: 13%|█▎ | 38/291 [00:01<00:09, 25.61it/s]
Loading 0: 14%|█▍ | 42/291 [00:01<00:10, 24.50it/s]
Loading 0: 16%|█▋ | 48/291 [00:01<00:08, 29.80it/s]
Loading 0: 18%|█▊ | 52/291 [00:01<00:08, 29.43it/s]
Loading 0: 20%|█▉ | 57/291 [00:01<00:07, 31.78it/s]
Loading 0: 21%|██ | 61/291 [00:02<00:07, 31.13it/s]
Loading 0: 23%|██▎ | 66/291 [00:02<00:06, 34.09it/s]
Loading 0: 24%|██▍ | 70/291 [00:02<00:06, 32.63it/s]
Loading 0: 25%|██▌ | 74/291 [00:02<00:06, 32.67it/s]
Loading 0: 27%|██▋ | 78/291 [00:02<00:06, 33.25it/s]
Loading 0: 28%|██▊ | 82/291 [00:02<00:09, 22.99it/s]
Loading 0: 29%|██▉ | 85/291 [00:02<00:08, 23.76it/s]
Loading 0: 31%|███ | 90/291 [00:03<00:07, 27.67it/s]
Loading 0: 32%|███▏ | 94/291 [00:03<00:07, 27.47it/s]
Loading 0: 34%|███▍ | 99/291 [00:03<00:06, 31.05it/s]
Loading 0: 35%|███▌ | 103/291 [00:03<00:06, 30.87it/s]
Loading 0: 37%|███▋ | 108/291 [00:03<00:05, 33.45it/s]
Loading 0: 38%|███▊ | 112/291 [00:03<00:05, 32.29it/s]
Loading 0: 40%|███▉ | 116/291 [00:03<00:05, 32.64it/s]
Loading 0: 42%|████▏ | 122/291 [00:03<00:04, 36.60it/s]
Loading 0: 44%|████▎ | 127/291 [00:04<00:04, 34.79it/s]
Loading 0: 46%|████▌ | 133/291 [00:04<00:05, 29.04it/s]
Loading 0: 47%|████▋ | 137/291 [00:04<00:05, 29.13it/s]
Loading 0: 48%|████▊ | 141/291 [00:04<00:05, 27.62it/s]
Loading 0: 50%|████▉ | 145/291 [00:04<00:04, 30.07it/s]
Loading 0: 51%|█████ | 149/291 [00:04<00:05, 28.32it/s]
Loading 0: 54%|█████▎ | 156/291 [00:05<00:03, 35.27it/s]
Loading 0: 55%|█████▍ | 160/291 [00:05<00:03, 33.99it/s]
Loading 0: 57%|█████▋ | 165/291 [00:05<00:03, 35.85it/s]
Loading 0: 58%|█████▊ | 169/291 [00:05<00:03, 34.48it/s]
Loading 0: 60%|█████▉ | 174/291 [00:05<00:03, 36.61it/s]
Loading 0: 61%|██████ | 178/291 [00:05<00:03, 34.37it/s]
Loading 0: 63%|██████▎ | 183/291 [00:05<00:02, 37.53it/s]
Loading 0: 64%|██████▍ | 187/291 [00:06<00:03, 26.12it/s]
Loading 0: 66%|██████▌ | 191/291 [00:06<00:03, 27.19it/s]
Loading 0: 67%|██████▋ | 195/291 [00:06<00:03, 25.83it/s]
Loading 0: 69%|██████▉ | 201/291 [00:06<00:02, 31.21it/s]
Loading 0: 70%|███████ | 205/291 [00:06<00:02, 31.14it/s]
Loading 0: 72%|███████▏ | 210/291 [00:06<00:02, 34.32it/s]
Loading 0: 74%|███████▎ | 214/291 [00:06<00:02, 33.55it/s]
Loading 0: 75%|███████▌ | 219/291 [00:07<00:01, 36.58it/s]
Loading 0: 77%|███████▋ | 223/291 [00:07<00:02, 33.54it/s]
Loading 0: 78%|███████▊ | 227/291 [00:07<00:01, 33.88it/s]
Loading 0: 79%|███████▉ | 231/291 [00:07<00:01, 34.04it/s]
Loading 0: 81%|████████ | 235/291 [00:07<00:02, 25.10it/s]
Loading 0: 82%|████████▏ | 239/291 [00:07<00:02, 25.16it/s]
Loading 0: 85%|████████▍ | 246/291 [00:07<00:01, 32.43it/s]
Loading 0: 86%|████████▌ | 250/291 [00:08<00:01, 32.21it/s]
Loading 0: 88%|████████▊ | 255/291 [00:08<00:01, 35.33it/s]
Loading 0: 89%|████████▉ | 259/291 [00:08<00:00, 34.32it/s]
Loading 0: 91%|█████████ | 264/291 [00:08<00:00, 36.72it/s]
Loading 0: 92%|█████████▏| 268/291 [00:08<00:00, 35.04it/s]
Loading 0: 94%|█████████▍| 273/291 [00:08<00:00, 37.71it/s]
Loading 0: 95%|█████████▌| 277/291 [00:08<00:00, 35.91it/s]
Loading 0: 97%|█████████▋| 281/291 [00:08<00:00, 35.56it/s]
Loading 0: 98%|█████████▊| 286/291 [00:14<00:01, 2.62it/s]
Loading 0: 99%|█████████▉| 289/291 [00:14<00:00, 3.26it/s]
Job trace2333-ultra4w-dol4w-2313-v1-mkmlizer completed after 117.78s with status: succeeded
Stopping job with name trace2333-ultra4w-dol4w-2313-v1-mkmlizer
Pipeline stage MKMLizer completed in 119.60s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.37s
Running pipeline stage ISVCDeployer
Creating inference service trace2333-ultra4w-dol4w-2313-v1
Waiting for inference service trace2333-ultra4w-dol4w-2313-v1 to be ready
Failed to get response for submission blend_jerun_2024-08-22: ('http://neversleep-noromaid-v0-8068-v150-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '{"error":"ValueError : [TypeError(\\"\'numpy.int64\' object is not iterable\\"), TypeError(\'vars() argument must have __dict__ attribute\')]"}')
Inference service trace2333-ultra4w-dol4w-2313-v1 ready after 171.5172836780548s
Pipeline stage ISVCDeployer completed in 172.33s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.173095703125s
Received healthy response to inference request in 1.4886798858642578s
Received healthy response to inference request in 1.9198296070098877s
Received healthy response to inference request in 1.4931299686431885s
Received healthy response to inference request in 1.7080259323120117s
5 requests
0 failed requests
5th percentile: 1.489569902420044
10th percentile: 1.49045991897583
20th percentile: 1.4922399520874023
30th percentile: 1.536109161376953
40th percentile: 1.6220675468444825
50th percentile: 1.7080259323120117
60th percentile: 1.792747402191162
70th percentile: 1.8774688720703125
80th percentile: 1.9704828262329102
90th percentile: 2.071789264678955
95th percentile: 2.1224424839019775
99th percentile: 2.1629650592803955
mean time: 1.7565522193908691
Pipeline stage StressChecker completed in 9.56s
trace2333-ultra4w-dol4w-_2313_v1 status is now deployed due to DeploymentManager action
trace2333-ultra4w-dol4w-_2313_v1 status is now inactive due to auto deactivation removed underperforming models
trace2333-ultra4w-dol4w-_2313_v1 status is now torndown due to DeploymentManager action