run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name bbchicago-brt-v1-15-with-6838-v1-mkmlizer
Waiting for job on bbchicago-brt-v1-15-with-6838-v1-mkmlizer to finish
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ _____ __ __ ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ /___/ ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ Version: 0.10.1 ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ https://mk1.ai ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ The license key for the current software has been verified as ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ belonging to: ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ Chai Research Corp. ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ║ ║
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: Downloaded to shared memory in 34.965s
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpzat99z69, device:0
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: quantized model in 26.031s
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: Processed model BBChicago/Brt_v1.15_with_113k_DPO_s3000 in 60.996s
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: creating bucket guanaco-mkml-models
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/bbchicago-brt-v1-15-with-6838-v1
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/bbchicago-brt-v1-15-with-6838-v1/config.json
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/bbchicago-brt-v1-15-with-6838-v1/special_tokens_map.json
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/bbchicago-brt-v1-15-with-6838-v1/tokenizer_config.json
bbchicago-brt-v1-15-with-6838-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/bbchicago-brt-v1-15-with-6838-v1/tokenizer.json
bbchicago-brt-v1-15-with-6838-v1-mkmlizer:
Loading 0: 0%| | 0/291 [00:00<?, ?it/s]
Loading 0: 2%|▏ | 5/291 [00:00<00:07, 36.20it/s]
Loading 0: 4%|▍ | 13/291 [00:00<00:04, 57.75it/s]
Loading 0: 7%|▋ | 20/291 [00:00<00:05, 47.65it/s]
Loading 0: 9%|▉ | 26/291 [00:00<00:05, 47.38it/s]
Loading 0: 11%|█ | 32/291 [00:00<00:05, 43.27it/s]
Loading 0: 14%|█▍ | 41/291 [00:00<00:05, 48.64it/s]
Loading 0: 17%|█▋ | 50/291 [00:01<00:04, 51.80it/s]
Loading 0: 20%|█▉ | 58/291 [00:01<00:04, 57.14it/s]
Loading 0: 22%|██▏ | 64/291 [00:01<00:04, 49.79it/s]
Loading 0: 24%|██▍ | 70/291 [00:01<00:04, 48.15it/s]
Loading 0: 26%|██▌ | 75/291 [00:01<00:04, 48.41it/s]
Loading 0: 27%|██▋ | 80/291 [00:01<00:04, 47.71it/s]
Loading 0: 29%|██▉ | 85/291 [00:01<00:05, 34.61it/s]
Loading 0: 31%|███▏ | 91/291 [00:02<00:05, 36.36it/s]
Loading 0: 33%|███▎ | 96/291 [00:02<00:05, 37.14it/s]
Loading 0: 35%|███▌ | 102/291 [00:02<00:04, 41.59it/s]
Loading 0: 37%|███▋ | 107/291 [00:02<00:04, 43.13it/s]
Loading 0: 39%|███▉ | 113/291 [00:02<00:04, 41.76it/s]
Loading 0: 42%|████▏ | 121/291 [00:02<00:03, 49.13it/s]
Loading 0: 44%|████▎ | 127/291 [00:02<00:03, 44.70it/s]
Loading 0: 45%|████▌ | 132/291 [00:02<00:03, 45.90it/s]
Loading 0: 48%|████▊ | 140/291 [00:03<00:03, 46.27it/s]
Loading 0: 51%|█████ | 148/291 [00:03<00:02, 54.00it/s]
Loading 0: 53%|█████▎ | 154/291 [00:03<00:02, 51.64it/s]
Loading 0: 55%|█████▍ | 160/291 [00:03<00:02, 51.23it/s]
Loading 0: 57%|█████▋ | 166/291 [00:03<00:02, 51.57it/s]
Loading 0: 59%|█████▉ | 172/291 [00:03<00:02, 49.60it/s]
Loading 0: 61%|██████ | 178/291 [00:03<00:02, 51.30it/s]
Loading 0: 63%|██████▎ | 184/291 [00:03<00:02, 49.89it/s]
Loading 0: 65%|██████▌ | 190/291 [00:04<00:02, 34.04it/s]
Loading 0: 67%|██████▋ | 195/291 [00:04<00:02, 35.45it/s]
Loading 0: 69%|██████▉ | 202/291 [00:04<00:02, 42.27it/s]
Loading 0: 71%|███████ | 207/291 [00:04<00:01, 42.78it/s]
Loading 0: 73%|███████▎ | 212/291 [00:04<00:02, 37.25it/s]
Loading 0: 76%|███████▌ | 220/291 [00:04<00:01, 45.69it/s]
Loading 0: 78%|███████▊ | 226/291 [00:05<00:01, 41.27it/s]
Loading 0: 79%|███████▉ | 231/291 [00:05<00:01, 41.36it/s]
Loading 0: 82%|████████▏ | 238/291 [00:05<00:01, 46.51it/s]
Loading 0: 84%|████████▍ | 244/291 [00:05<00:01, 45.24it/s]
Loading 0: 86%|████████▌ | 249/291 [00:05<00:00, 45.76it/s]
Loading 0: 88%|████████▊ | 256/291 [00:05<00:00, 50.10it/s]
Loading 0: 90%|█████████ | 262/291 [00:05<00:00, 44.05it/s]
Loading 0: 92%|█████████▏| 267/291 [00:05<00:00, 44.13it/s]
Loading 0: 94%|█████████▍| 273/291 [00:06<00:00, 46.61it/s]
Loading 0: 96%|█████████▌| 278/291 [00:06<00:00, 46.08it/s]
Loading 0: 97%|█████████▋| 283/291 [00:06<00:00, 39.13it/s]
Loading 0: 99%|█████████▉| 288/291 [00:11<00:00, 3.13it/s]
Job bbchicago-brt-v1-15-with-6838-v1-mkmlizer completed after 85.41s with status: succeeded
Stopping job with name bbchicago-brt-v1-15-with-6838-v1-mkmlizer
Pipeline stage MKMLizer completed in 86.82s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.28s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service bbchicago-brt-v1-15-with-6838-v1
Waiting for inference service bbchicago-brt-v1-15-with-6838-v1 to be ready
Inference service bbchicago-brt-v1-15-with-6838-v1 ready after 140.83861684799194s
Pipeline stage MKMLDeployer completed in 141.20s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.118959903717041s
Received healthy response to inference request in 1.8467707633972168s
Received healthy response to inference request in 1.731069564819336s
Received healthy response to inference request in 1.3939754962921143s
Received healthy response to inference request in 1.7228736877441406s
5 requests
0 failed requests
5th percentile: 1.4597551345825195
10th percentile: 1.5255347728729247
20th percentile: 1.6570940494537354
30th percentile: 1.7245128631591797
40th percentile: 1.7277912139892577
50th percentile: 1.731069564819336
60th percentile: 1.7773500442504884
70th percentile: 1.8236305236816406
80th percentile: 2.101208591461182
90th percentile: 2.6100842475891115
95th percentile: 2.864522075653076
99th percentile: 3.068072338104248
mean time: 1.9627298831939697
Pipeline stage StressChecker completed in 11.60s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
starting trigger_guanaco_pipeline %s
Pipeline stage TriggerMKMLProfilingPipeline completed in 5.08s
bbchicago-brt-v1-15-with_6838_v1 status is now deployed due to DeploymentManager action
registered shutdown handler
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.10s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.10s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service bbchicago-brt-v1-15-with-6838-v1-profiler
Waiting for inference service bbchicago-brt-v1-15-with-6838-v1-profiler to be ready
Inference service bbchicago-brt-v1-15-with-6838-v1-profiler ready after 140.32835483551025s
Pipeline stage MKMLProfilerDeployer completed in 140.71s
run pipeline stage %s
Running pipeline stage MKMLProfilerRunner
kubectl cp /code/guanaco/guanaco_inference_services/src/inference_scripts tenant-chaiml-guanaco/bbchicago-brt-v1-15-0a1f620d6a2c31490bbb8c7e62087b58-deplof8j4j:/code/chaiverse_profiler_1725427131 --namespace tenant-chaiml-guanaco
kubectl exec -it bbchicago-brt-v1-15-0a1f620d6a2c31490bbb8c7e62087b58-deplof8j4j --namespace tenant-chaiml-guanaco -- sh -c 'cd /code/chaiverse_profiler_1725427131 && python profiles.py profile --best_of_n 16 --auto_batch 5 --batches 1,5,10,15,20,25,30,35,40,45,50,55,60,65,70,75,80,85,90,95,100,105,110,115,120,125,130,135,140,145,150,155,160,165,170,175,180,185,190,195 --samples 200 --input_tokens 512 --output_tokens 64 --summary /code/chaiverse_profiler_1725427131/summary.json'
Received SIGINT, running shutdown handler
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Checking if service bbchicago-brt-v1-15-with-6838-v1-profiler is running
Tearing down inference service bbchicago-brt-v1-15-with-6838-v1-profiler
Service bbchicago-brt-v1-15-with-6838-v1-profiler has been torndown
Pipeline stage MKMLProfilerDeleter completed in 1.83s
de-registered shutdown handler
bbchicago-brt-v1-15-with_6838_v1 status is now inactive due to auto deactivation removed underperforming models
bbchicago-brt-v1-15-with_6838_v1 status is now torndown due to DeploymentManager action