Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name mistralai-mistral-nem-93303-v611-uploader
Waiting for job on mistralai-mistral-nem-93303-v611-uploader to finish
mistralai-mistral-nem-93303-v611-uploader: Using quantization_mode: fp8
mistralai-mistral-nem-93303-v611-uploader: Checking if ChaiML/Mistral-Nemo-Instruct-2407-FP8 already exists in ChaiML
mistralai-mistral-nem-93303-v611-uploader: Model already exists. Downloading to /dev/shm/model_output...
mistralai-mistral-nem-93303-v611-uploader: Downloading snapshot of ChaiML/Mistral-Nemo-Instruct-2407-FP8...
mistralai-mistral-nem-93303-v611-uploader: Downloaded in 8.753s
mistralai-mistral-nem-93303-v611-uploader: Processed model mistralai/Mistral-Nemo-Instruct-2407 in 12.172s
mistralai-mistral-nem-93303-v611-uploader: creating bucket guanaco-vllm-models
mistralai-mistral-nem-93303-v611-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
mistralai-mistral-nem-93303-v611-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
mistralai-mistral-nem-93303-v611-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
mistralai-mistral-nem-93303-v611-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
mistralai-mistral-nem-93303-v611-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
mistralai-mistral-nem-93303-v611-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
mistralai-mistral-nem-93303-v611-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
mistralai-mistral-nem-93303-v611-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
mistralai-mistral-nem-93303-v611-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
mistralai-mistral-nem-93303-v611-uploader: if re.search("-\.", bucket, re.UNICODE):
mistralai-mistral-nem-93303-v611-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
mistralai-mistral-nem-93303-v611-uploader: if re.search("\.\.", bucket, re.UNICODE):
mistralai-mistral-nem-93303-v611-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
mistralai-mistral-nem-93303-v611-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
mistralai-mistral-nem-93303-v611-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
mistralai-mistral-nem-93303-v611-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
mistralai-mistral-nem-93303-v611-uploader: Bucket 's3://guanaco-vllm-models/' created
mistralai-mistral-nem-93303-v611-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v611/default
mistralai-mistral-nem-93303-v611-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v611/default/recipe.yaml
mistralai-mistral-nem-93303-v611-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v611/default/chat_template.jinja
mistralai-mistral-nem-93303-v611-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v611/default/special_tokens_map.json
mistralai-mistral-nem-93303-v611-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v611/default/config.json
mistralai-mistral-nem-93303-v611-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v611/default/generation_config.json
mistralai-mistral-nem-93303-v611-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v611/default/.gitattributes
mistralai-mistral-nem-93303-v611-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v611/default/model.safetensors.index.json
mistralai-mistral-nem-93303-v611-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v611/default/tokenizer_config.json
mistralai-mistral-nem-93303-v611-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v611/default/tokenizer.json
mistralai-mistral-nem-93303-v611-uploader: cp /dev/shm/model_output/model-00003-of-00003.safetensors s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v611/default/model-00003-of-00003.safetensors
mistralai-mistral-nem-93303-v611-uploader: cp /dev/shm/model_output/model-00001-of-00003.safetensors s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v611/default/model-00001-of-00003.safetensors
mistralai-mistral-nem-93303-v611-uploader: cp /dev/shm/model_output/model-00002-of-00003.safetensors s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v611/default/model-00002-of-00003.safetensors
Job mistralai-mistral-nem-93303-v611-uploader completed after 52.27s with status: succeeded
Stopping job with name mistralai-mistral-nem-93303-v611-uploader
Pipeline stage VLLMUploader completed in 52.70s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.78s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service mistralai-mistral-nem-93303-v611
Waiting for inference service mistralai-mistral-nem-93303-v611 to be ready
2026-03-20T23:07:22.450512+00:00 monitor updated for mistralai-mistral-nem_93303_v611
2026-03-20T23:08:22.548810+00:00 monitor updated for mistralai-mistral-nem_93303_v611
2026-03-20T23:09:22.641911+00:00 monitor updated for mistralai-mistral-nem_93303_v611
Inference service mistralai-mistral-nem-93303-v611 ready after 160.37225556373596s
Pipeline stage VLLMDeployer completed in 160.83s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.356667995452881s
Received healthy response to inference request in 2.3475565910339355s
Received healthy response to inference request in 2.3053231239318848s
Received healthy response to inference request in 2.40688419342041s
Received healthy response to inference request in 2.29186749458313s
Received healthy response to inference request in 2.256836175918579s
Received healthy response to inference request in 2.239605188369751s
Received healthy response to inference request in 2.2847208976745605s
Received healthy response to inference request in 2.2430953979492188s
Received healthy response to inference request in 2.2522871494293213s
2026-03-20T23:10:22.744325+00:00 monitor updated for mistralai-mistral-nem_93303_v611
Received healthy response to inference request in 2.233771800994873s
Received healthy response to inference request in 2.292695999145508s
Received healthy response to inference request in 2.244245767593384s
Received healthy response to inference request in 2.3891561031341553s
Received healthy response to inference request in 2.2988338470458984s
Received healthy response to inference request in 2.2346742153167725s
Received healthy response to inference request in 2.5090534687042236s
Received healthy response to inference request in 2.385389566421509s
Received healthy response to inference request in 2.244387626647949s
Received healthy response to inference request in 2.551868438720703s
Received healthy response to inference request in 2.328420877456665s
Received healthy response to inference request in 2.249264717102051s
Received healthy response to inference request in 2.352728843688965s
Received healthy response to inference request in 2.336475372314453s
Received healthy response to inference request in 2.2393927574157715s
Received healthy response to inference request in 2.23984694480896s
Received healthy response to inference request in 2.249741554260254s
Received healthy response to inference request in 2.241194009780884s
Received healthy response to inference request in 2.3913183212280273s
Received healthy response to inference request in 2.2798879146575928s
30 requests
0 failed requests
5th percentile: 2.236797559261322
10th percentile: 2.239583945274353
20th percentile: 2.2427151203155518
30th percentile: 2.24780158996582
40th percentile: 2.255016565322876
50th percentile: 2.288294196128845
60th percentile: 2.301429557800293
70th percentile: 2.339799737930298
80th percentile: 2.3624123096466065
90th percentile: 2.392874908447266
95th percentile: 2.4630772948265074
99th percentile: 2.539452097415924
mean time: 2.3092397451400757
Pipeline stage StressChecker completed in 79.06s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
mistralai-mistral-nem_93303_v611 status is now deployed due to DeploymentManager action
mistralai-mistral-nem_93303_v611 status is now inactive due to auto deactivation removed underperforming models