Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name mistralai-mistral-nemo-9330-v164-mkmlizer
Waiting for job on mistralai-mistral-nemo-9330-v164-mkmlizer to finish
mistralai-mistral-nemo-9330-v164-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ _____ __ __ ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ /___/ ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ Version: 0.11.12 ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ https://mk1.ai ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ The license key for the current software has been verified as ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ belonging to: ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ Chai Research Corp. ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ║ ║
mistralai-mistral-nemo-9330-v164-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
mistralai-mistral-nemo-9330-v164-mkmlizer: Downloaded to shared memory in 47.313s
mistralai-mistral-nemo-9330-v164-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpi9coawxk, device:0
mistralai-mistral-nemo-9330-v164-mkmlizer: Saving flywheel model at /dev/shm/model_cache
mistralai-mistral-nemo-9330-v164-mkmlizer: quantized model in 37.361s
mistralai-mistral-nemo-9330-v164-mkmlizer: Processed model mistralai/Mistral-Nemo-Instruct-2407 in 84.675s
mistralai-mistral-nemo-9330-v164-mkmlizer: creating bucket guanaco-mkml-models
mistralai-mistral-nemo-9330-v164-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
mistralai-mistral-nemo-9330-v164-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v164
mistralai-mistral-nemo-9330-v164-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v164/config.json
mistralai-mistral-nemo-9330-v164-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v164/special_tokens_map.json
mistralai-mistral-nemo-9330-v164-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v164/tokenizer_config.json
mistralai-mistral-nemo-9330-v164-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v164/tokenizer.json
mistralai-mistral-nemo-9330-v164-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v164/flywheel_model.0.safetensors
Job mistralai-mistral-nemo-9330-v164-mkmlizer completed after 114.81s with status: succeeded
Stopping job with name mistralai-mistral-nemo-9330-v164-mkmlizer
Pipeline stage MKMLizer completed in 115.32s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.18s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service mistralai-mistral-nemo-9330-v164
Waiting for inference service mistralai-mistral-nemo-9330-v164 to be ready
Inference service mistralai-mistral-nemo-9330-v164 ready after 151.03543853759766s
Pipeline stage MKMLDeployer completed in 151.54s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.152400016784668s
Received healthy response to inference request in 1.8251824378967285s
Received healthy response to inference request in 1.4937877655029297s
Received healthy response to inference request in 1.429908037185669s
Received healthy response to inference request in 1.7930872440338135s
5 requests
0 failed requests
5th percentile: 1.4426839828491211
10th percentile: 1.4554599285125733
20th percentile: 1.4810118198394775
30th percentile: 1.5536476612091064
40th percentile: 1.67336745262146
50th percentile: 1.7930872440338135
60th percentile: 1.8059253215789794
70th percentile: 1.8187633991241454
80th percentile: 1.8906259536743164
90th percentile: 2.021512985229492
95th percentile: 2.08695650100708
99th percentile: 2.1393113136291504
mean time: 1.7388731002807618
Pipeline stage StressChecker completed in 10.26s
Shutdown handler de-registered
mistralai-mistral-nemo_9330_v164 status is now deployed due to DeploymentManager action
mistralai-mistral-nemo_9330_v164 status is now inactive due to auto deactivation removed underperforming models
mistralai-mistral-nemo_9330_v164 status is now torndown due to DeploymentManager action