Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name cycy233-modelv-fusion-s-10816-v1-mkmlizer
Waiting for job on cycy233-modelv-fusion-s-10816-v1-mkmlizer to finish
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ _____ __ __ ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ /___/ ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ Version: 0.12.8 ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ https://mk1.ai ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ The license key for the current software has been verified as ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ belonging to: ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ Chai Research Corp. ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ║ ║
cycy233-modelv-fusion-s-10816-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
cycy233-modelv-fusion-s-10816-v1-mkmlizer: Downloaded to shared memory in 38.566s
cycy233-modelv-fusion-s-10816-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmppvuzcrnj, device:0
cycy233-modelv-fusion-s-10816-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
cycy233-modelv-fusion-s-10816-v1-mkmlizer: quantized model in 37.739s
cycy233-modelv-fusion-s-10816-v1-mkmlizer: Processed model cycy233/modelV_fusion_step9000 in 76.305s
cycy233-modelv-fusion-s-10816-v1-mkmlizer: creating bucket guanaco-mkml-models
cycy233-modelv-fusion-s-10816-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
cycy233-modelv-fusion-s-10816-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/cycy233-modelv-fusion-s-10816-v1
cycy233-modelv-fusion-s-10816-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/cycy233-modelv-fusion-s-10816-v1/special_tokens_map.json
cycy233-modelv-fusion-s-10816-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/cycy233-modelv-fusion-s-10816-v1/config.json
cycy233-modelv-fusion-s-10816-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/cycy233-modelv-fusion-s-10816-v1/tokenizer_config.json
cycy233-modelv-fusion-s-10816-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/cycy233-modelv-fusion-s-10816-v1/tokenizer.json
cycy233-modelv-fusion-s-10816-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/cycy233-modelv-fusion-s-10816-v1/flywheel_model.0.safetensors
cycy233-modelv-fusion-s-10816-v1-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:11, 30.21it/s]
Loading 0: 4%|▎ | 13/363 [00:00<00:06, 50.59it/s]
Loading 0: 5%|▌ | 19/363 [00:00<00:07, 45.96it/s]
Loading 0: 7%|▋ | 24/363 [00:00<00:07, 44.08it/s]
Loading 0: 9%|▊ | 31/363 [00:00<00:06, 49.81it/s]
Loading 0: 10%|█ | 37/363 [00:00<00:07, 45.52it/s]
Loading 0: 12%|█▏ | 42/363 [00:00<00:07, 43.80it/s]
Loading 0: 13%|█▎ | 49/363 [00:01<00:06, 49.26it/s]
Loading 0: 15%|█▌ | 55/363 [00:01<00:07, 43.92it/s]
Loading 0: 17%|█▋ | 60/363 [00:01<00:06, 44.11it/s]
Loading 0: 18%|█▊ | 65/363 [00:01<00:10, 28.84it/s]
Loading 0: 20%|█▉ | 71/363 [00:01<00:08, 34.26it/s]
Loading 0: 21%|██ | 76/363 [00:01<00:08, 35.22it/s]
Loading 0: 22%|██▏ | 81/363 [00:02<00:07, 37.45it/s]
Loading 0: 24%|██▎ | 86/363 [00:02<00:06, 39.77it/s]
Loading 0: 25%|██▌ | 91/363 [00:02<00:08, 32.50it/s]
Loading 0: 27%|██▋ | 98/363 [00:02<00:06, 38.87it/s]
Loading 0: 28%|██▊ | 103/363 [00:02<00:06, 38.55it/s]
Loading 0: 30%|██▉ | 108/363 [00:02<00:06, 40.08it/s]
Loading 0: 31%|███ | 113/363 [00:02<00:07, 34.09it/s]
Loading 0: 33%|███▎ | 118/363 [00:03<00:07, 34.08it/s]
Loading 0: 34%|███▍ | 123/363 [00:03<00:06, 36.61it/s]
Loading 0: 35%|███▍ | 127/363 [00:03<00:07, 32.84it/s]
Loading 0: 36%|███▋ | 132/363 [00:03<00:06, 36.60it/s]
Loading 0: 37%|███▋ | 136/363 [00:03<00:07, 32.35it/s]
Loading 0: 39%|███▉ | 141/363 [00:03<00:06, 36.04it/s]
Loading 0: 40%|███▉ | 145/363 [00:04<00:09, 24.08it/s]
Loading 0: 41%|████ | 149/363 [00:04<00:08, 24.39it/s]
Loading 0: 43%|████▎ | 156/363 [00:04<00:06, 31.60it/s]
Loading 0: 44%|████▍ | 160/363 [00:04<00:06, 31.77it/s]
Loading 0: 45%|████▌ | 165/363 [00:04<00:05, 34.55it/s]
Loading 0: 47%|████▋ | 169/363 [00:04<00:05, 33.71it/s]
Loading 0: 48%|████▊ | 174/363 [00:04<00:05, 35.52it/s]
Loading 0: 49%|████▉ | 178/363 [00:04<00:05, 34.01it/s]
Loading 0: 50%|█████ | 183/363 [00:05<00:05, 35.82it/s]
Loading 0: 52%|█████▏ | 187/363 [00:05<00:05, 34.55it/s]
Loading 0: 53%|█████▎ | 192/363 [00:05<00:04, 37.19it/s]
Loading 0: 54%|█████▍ | 196/363 [00:05<00:04, 35.20it/s]
Loading 0: 55%|█████▌ | 201/363 [00:05<00:04, 38.09it/s]
Loading 0: 56%|█████▋ | 205/363 [00:05<00:04, 36.42it/s]
Loading 0: 58%|█████▊ | 210/363 [00:05<00:03, 38.73it/s]
Loading 0: 59%|█████▉ | 214/363 [00:05<00:04, 36.07it/s]
Loading 0: 60%|██████ | 218/363 [00:06<00:04, 35.36it/s]
Loading 0: 61%|██████▏ | 223/363 [00:06<00:05, 26.83it/s]
Loading 0: 63%|██████▎ | 227/363 [00:06<00:04, 27.51it/s]
Loading 0: 64%|██████▎ | 231/363 [00:06<00:04, 27.61it/s]
Loading 0: 65%|██████▌ | 237/363 [00:06<00:03, 33.45it/s]
Loading 0: 66%|██████▋ | 241/363 [00:06<00:03, 33.66it/s]
Loading 0: 68%|██████▊ | 246/363 [00:06<00:03, 36.64it/s]
Loading 0: 69%|██████▉ | 250/363 [00:07<00:03, 34.84it/s]
Loading 0: 70%|███████ | 255/363 [00:07<00:02, 38.18it/s]
Loading 0: 71%|███████▏ | 259/363 [00:07<00:02, 38.01it/s]
Loading 0: 73%|███████▎ | 265/363 [00:07<00:02, 42.17it/s]
Loading 0: 75%|███████▍ | 271/363 [00:07<00:02, 42.16it/s]
Loading 0: 76%|███████▌ | 276/363 [00:07<00:02, 41.19it/s]
Loading 0: 78%|███████▊ | 283/363 [00:07<00:01, 46.24it/s]
Loading 0: 80%|███████▉ | 289/363 [00:07<00:01, 44.63it/s]
Loading 0: 81%|████████ | 294/363 [00:08<00:01, 41.98it/s]
Loading 0: 83%|████████▎ | 300/363 [00:08<00:01, 46.21it/s]
Loading 0: 84%|████████▍ | 305/363 [00:15<00:22, 2.53it/s]
Loading 0: 85%|████████▌ | 309/363 [00:15<00:16, 3.25it/s]
Loading 0: 86%|████████▌ | 313/363 [00:15<00:11, 4.20it/s]
Loading 0: 88%|████████▊ | 320/363 [00:15<00:06, 6.59it/s]
Loading 0: 90%|████████▉ | 325/363 [00:15<00:04, 8.72it/s]
Loading 0: 91%|█████████ | 330/363 [00:15<00:03, 10.78it/s]
Loading 0: 93%|█████████▎| 338/363 [00:15<00:01, 16.22it/s]
Loading 0: 95%|█████████▍| 344/363 [00:16<00:00, 19.53it/s]
Loading 0: 96%|█████████▌| 349/363 [00:16<00:00, 22.24it/s]
Loading 0: 98%|█████████▊| 356/363 [00:16<00:00, 28.30it/s]
Loading 0: 100%|█████████▉| 362/363 [00:16<00:00, 30.38it/s]
Job cycy233-modelv-fusion-s-10816-v1-mkmlizer completed after 104.27s with status: succeeded
Stopping job with name cycy233-modelv-fusion-s-10816-v1-mkmlizer
Pipeline stage MKMLizer completed in 105.51s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service cycy233-modelv-fusion-s-10816-v1
Waiting for inference service cycy233-modelv-fusion-s-10816-v1 to be ready
Inference service cycy233-modelv-fusion-s-10816-v1 ready after 120.505788564682s
Pipeline stage MKMLDeployer completed in 121.03s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.496598482131958s
Received healthy response to inference request in 1.3944745063781738s
Received healthy response to inference request in 1.639159917831421s
Received healthy response to inference request in 1.5794732570648193s
Received healthy response to inference request in 1.7887818813323975s
5 requests
0 failed requests
5th percentile: 1.431474256515503
10th percentile: 1.4684740066528321
20th percentile: 1.5424735069274902
30th percentile: 1.5914105892181396
40th percentile: 1.6152852535247804
50th percentile: 1.639159917831421
60th percentile: 1.6990087032318115
70th percentile: 1.7588574886322021
80th percentile: 1.9303452014923097
90th percentile: 2.213471841812134
95th percentile: 2.3550351619720455
99th percentile: 2.4682858180999756
mean time: 1.7796976089477539
Pipeline stage StressChecker completed in 10.16s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.73s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.71s
Shutdown handler de-registered
cycy233-modelv-fusion-s_10816_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.11s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.11s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service cycy233-modelv-fusion-s-10816-v1-profiler
Waiting for inference service cycy233-modelv-fusion-s-10816-v1-profiler to be ready
Inference service cycy233-modelv-fusion-s-10816-v1-profiler ready after 110.63234281539917s
Pipeline stage MKMLProfilerDeployer completed in 111.17s
run pipeline stage %s
Running pipeline stage MKMLProfilerRunner
kubectl cp /code/guanaco/guanaco_inference_services/src/inference_scripts tenant-chaiml-guanaco/cycy233-modelv-fusio57bfa385b81b07386fdeae48389686a8-deplo4tvqg:/code/chaiverse_profiler_1746929293 --namespace tenant-chaiml-guanaco
kubectl exec -it cycy233-modelv-fusio57bfa385b81b07386fdeae48389686a8-deplo4tvqg --namespace tenant-chaiml-guanaco -- sh -c 'cd /code/chaiverse_profiler_1746929293 && python profiles.py profile --best_of_n 8 --auto_batch 5 --batches 1,5,10,15,20,25,30,35,40,45,50,55,60,65,70,75,80,85,90,95,100,105,110,115,120,125,130,135,140,145,150,155,160,165,170,175,180,185,190,195 --samples 200 --input_tokens 1024 --output_tokens 64 --summary /code/chaiverse_profiler_1746929293/summary.json'
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2527.25s
Shutdown handler de-registered
cycy233-modelv-fusion-s_10816_v1 status is now inactive due to auto deactivation removed underperforming models
cycy233-modelv-fusion-s_10816_v1 status is now torndown due to DeploymentManager action