Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name riverise-alighment-0906-v1-mkmlizer
Waiting for job on riverise-alighment-0906-v1-mkmlizer to finish
riverise-alighment-0906-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
riverise-alighment-0906-v1-mkmlizer: ║ _____ __ __ ║
riverise-alighment-0906-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
riverise-alighment-0906-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
riverise-alighment-0906-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
riverise-alighment-0906-v1-mkmlizer: ║ /___/ ║
riverise-alighment-0906-v1-mkmlizer: ║ ║
riverise-alighment-0906-v1-mkmlizer: ║ Version: 0.10.1 ║
riverise-alighment-0906-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
riverise-alighment-0906-v1-mkmlizer: ║ https://mk1.ai ║
riverise-alighment-0906-v1-mkmlizer: ║ ║
riverise-alighment-0906-v1-mkmlizer: ║ The license key for the current software has been verified as ║
riverise-alighment-0906-v1-mkmlizer: ║ belonging to: ║
riverise-alighment-0906-v1-mkmlizer: ║ ║
riverise-alighment-0906-v1-mkmlizer: ║ Chai Research Corp. ║
riverise-alighment-0906-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
riverise-alighment-0906-v1-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
riverise-alighment-0906-v1-mkmlizer: ║ ║
riverise-alighment-0906-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Connection pool is full, discarding connection: %s. Connection pool size: %s
riverise-alighment-0906-v1-mkmlizer: Downloaded to shared memory in 34.572s
riverise-alighment-0906-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp3w3nqct0, device:0
riverise-alighment-0906-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
riverise-alighment-0906-v1-mkmlizer: quantized model in 25.633s
riverise-alighment-0906-v1-mkmlizer: Processed model Riverise/alighment_0906 in 60.205s
riverise-alighment-0906-v1-mkmlizer: creating bucket guanaco-mkml-models
riverise-alighment-0906-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
riverise-alighment-0906-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/riverise-alighment-0906-v1
riverise-alighment-0906-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/riverise-alighment-0906-v1/special_tokens_map.json
riverise-alighment-0906-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/riverise-alighment-0906-v1/config.json
riverise-alighment-0906-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/riverise-alighment-0906-v1/tokenizer_config.json
riverise-alighment-0906-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/riverise-alighment-0906-v1/tokenizer.json
riverise-alighment-0906-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/riverise-alighment-0906-v1/flywheel_model.0.safetensors
riverise-alighment-0906-v1-mkmlizer:
Loading 0: 0%| | 0/291 [00:00<?, ?it/s]
Loading 0: 2%|▏ | 7/291 [00:00<00:05, 52.07it/s]
Loading 0: 8%|▊ | 22/291 [00:00<00:03, 78.33it/s]
Loading 0: 11%|█ | 31/291 [00:00<00:03, 78.95it/s]
Loading 0: 14%|█▍ | 42/291 [00:00<00:02, 88.82it/s]
Loading 0: 18%|█▊ | 52/291 [00:00<00:02, 81.55it/s]
Loading 0: 21%|██ | 61/291 [00:00<00:02, 81.96it/s]
Loading 0: 24%|██▍ | 70/291 [00:00<00:02, 73.99it/s]
Loading 0: 27%|██▋ | 79/291 [00:01<00:02, 78.20it/s]
Loading 0: 30%|███ | 88/291 [00:02<00:09, 21.20it/s]
Loading 0: 35%|███▌ | 103/291 [00:02<00:05, 31.47it/s]
Loading 0: 38%|███▊ | 112/291 [00:02<00:04, 37.03it/s]
Loading 0: 42%|████▏ | 121/291 [00:02<00:03, 43.96it/s]
Loading 0: 45%|████▍ | 130/291 [00:02<00:03, 51.03it/s]
Loading 0: 49%|████▉ | 142/291 [00:02<00:02, 58.81it/s]
Loading 0: 52%|█████▏ | 151/291 [00:02<00:02, 62.87it/s]
Loading 0: 55%|█████▍ | 160/291 [00:03<00:02, 64.90it/s]
Loading 0: 58%|█████▊ | 169/291 [00:03<00:01, 67.23it/s]
Loading 0: 63%|██████▎ | 184/291 [00:03<00:01, 78.52it/s]
Loading 0: 66%|██████▋ | 193/291 [00:04<00:03, 24.55it/s]
Loading 0: 69%|██████▉ | 202/291 [00:04<00:02, 30.51it/s]
Loading 0: 73%|███████▎ | 211/291 [00:04<00:02, 37.29it/s]
Loading 0: 76%|███████▌ | 220/291 [00:04<00:01, 44.69it/s]
Loading 0: 79%|███████▊ | 229/291 [00:04<00:01, 51.66it/s]
Loading 0: 82%|████████▏ | 240/291 [00:04<00:00, 62.63it/s]
Loading 0: 86%|████████▌ | 250/291 [00:05<00:00, 63.13it/s]
Loading 0: 89%|████████▉ | 259/291 [00:05<00:00, 68.79it/s]
Loading 0: 92%|█████████▏| 268/291 [00:05<00:00, 69.71it/s]
Loading 0: 95%|█████████▌| 277/291 [00:05<00:00, 71.74it/s]
Loading 0: 99%|█████████▊| 287/291 [00:05<00:00, 42.86it/s]
Job riverise-alighment-0906-v1-mkmlizer completed after 84.64s with status: succeeded
Stopping job with name riverise-alighment-0906-v1-mkmlizer
Pipeline stage MKMLizer completed in 86.35s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.09s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service riverise-alighment-0906-v1
Waiting for inference service riverise-alighment-0906-v1 to be ready
Inference service riverise-alighment-0906-v1 ready after 141.53224205970764s
Pipeline stage MKMLDeployer completed in 142.15s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9558467864990234s
Received healthy response to inference request in 2.4215850830078125s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 1.556781530380249s
Received healthy response to inference request in 1.7771425247192383s
Received healthy response to inference request in 1.666693925857544s
5 requests
0 failed requests
5th percentile: 1.578764009475708
10th percentile: 1.600746488571167
20th percentile: 1.644711446762085
30th percentile: 1.6887836456298828
40th percentile: 1.7329630851745605
50th percentile: 1.7771425247192383
60th percentile: 1.8486242294311523
70th percentile: 1.9201059341430664
80th percentile: 2.048994445800781
90th percentile: 2.235289764404297
95th percentile: 2.3284374237060548
99th percentile: 2.402955551147461
mean time: 1.8756099700927735
Pipeline stage StressChecker completed in 10.90s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 4.83s
Shutdown handler de-registered
riverise-alighment-0906_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.11s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.10s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service riverise-alighment-0906-v1-profiler
Waiting for inference service riverise-alighment-0906-v1-profiler to be ready
Inference service riverise-alighment-0906-v1-profiler ready after 150.3379716873169s
Pipeline stage MKMLProfilerDeployer completed in 150.69s
run pipeline stage %s
Running pipeline stage MKMLProfilerRunner
kubectl cp /code/guanaco/guanaco_inference_services/src/inference_scripts tenant-chaiml-guanaco/riverise-alighment-0906-v1-profiler-predictor-00001-deployvk4qd:/code/chaiverse_profiler_1725873027 --namespace tenant-chaiml-guanaco
kubectl exec -it riverise-alighment-0906-v1-profiler-predictor-00001-deployvk4qd --namespace tenant-chaiml-guanaco -- sh -c 'cd /code/chaiverse_profiler_1725873027 && python profiles.py profile --best_of_n 16 --auto_batch 5 --batches 1,5,10,15,20,25,30,35,40,45,50,55,60,65,70,75,80,85,90,95,100,105,110,115,120,125,130,135,140,145,150,155,160,165,170,175,180,185,190,195 --samples 200 --input_tokens 512 --output_tokens 64 --summary /code/chaiverse_profiler_1725873027/summary.json'
kubectl exec -it riverise-alighment-0906-v1-profiler-predictor-00001-deployvk4qd --namespace tenant-chaiml-guanaco -- bash -c 'cat /code/chaiverse_profiler_1725873027/summary.json'
Pipeline stage MKMLProfilerRunner completed in 822.45s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Checking if service riverise-alighment-0906-v1-profiler is running
Tearing down inference service riverise-alighment-0906-v1-profiler
Service riverise-alighment-0906-v1-profiler has been torndown
Pipeline stage MKMLProfilerDeleter completed in 1.59s
Shutdown handler de-registered
riverise-alighment-0906_v1 status is now inactive due to auto deactivation removed underperforming models
riverise-alighment-0906_v1 status is now torndown due to DeploymentManager action