Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name mistralai-mistral-nem-93303-v610-uploader
Waiting for job on mistralai-mistral-nem-93303-v610-uploader to finish
chaiml-1007-tl-ads-run-2-gac-v6-uploader: Processed model ChaiML/1007-tl-ads-run-2-gac in 117.766s
chaiml-1007-tl-ads-run-2-gac-v6-uploader: creating bucket guanaco-vllm-models
chaiml-1007-tl-ads-run-2-gac-v6-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-1007-tl-ads-run-2-gac-v6-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-1007-tl-ads-run-2-gac-v6-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-1007-tl-ads-run-2-gac-v6-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-1007-tl-ads-run-2-gac-v6-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-1007-tl-ads-run-2-gac-v6-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-1007-tl-ads-run-2-gac-v6-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-1007-tl-ads-run-2-gac-v6-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-1007-tl-ads-run-2-gac-v6-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-1007-tl-ads-run-2-gac-v6-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-1007-tl-ads-run-2-gac-v6-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-1007-tl-ads-run-2-gac-v6-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-1007-tl-ads-run-2-gac-v6-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-1007-tl-ads-run-2-gac-v6-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-1007-tl-ads-run-2-gac-v6-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-1007-tl-ads-run-2-gac-v6-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-1007-tl-ads-run-2-gac-v6-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-1007-tl-ads-run-2-gac-v6-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-1007-tl-ads-run-2-gac-v6/default
chaiml-1007-tl-ads-run-2-gac-v6-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-1007-tl-ads-run-2-gac-v6/default/config.json
chaiml-1007-tl-ads-run-2-gac-v6-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-1007-tl-ads-run-2-gac-v6/default/special_tokens_map.json
chaiml-1007-tl-ads-run-2-gac-v6-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-1007-tl-ads-run-2-gac-v6/default/model.safetensors.index.json
chaiml-1007-tl-ads-run-2-gac-v6-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-1007-tl-ads-run-2-gac-v6/default/generation_config.json
chaiml-1007-tl-ads-run-2-gac-v6-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-1007-tl-ads-run-2-gac-v6/default/recipe.yaml
chaiml-1007-tl-ads-run-2-gac-v6-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-1007-tl-ads-run-2-gac-v6/default/chat_template.jinja
chaiml-1007-tl-ads-run-2-gac-v6-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-1007-tl-ads-run-2-gac-v6/default/tokenizer_config.json
chaiml-1007-tl-ads-run-2-gac-v6-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-1007-tl-ads-run-2-gac-v6/default/tokenizer.json
chaiml-1007-tl-ads-run-2-gac-v6-uploader: cp /dev/shm/model_output/model-00003-of-00003.safetensors s3://guanaco-vllm-models/chaiml-1007-tl-ads-run-2-gac-v6/default/model-00003-of-00003.safetensors
chaiml-1007-tl-ads-run-2-gac-v6-uploader: cp /dev/shm/model_output/model-00001-of-00003.safetensors s3://guanaco-vllm-models/chaiml-1007-tl-ads-run-2-gac-v6/default/model-00001-of-00003.safetensors
chaiml-1007-tl-ads-run-2-gac-v6-uploader: cp /dev/shm/model_output/model-00002-of-00003.safetensors s3://guanaco-vllm-models/chaiml-1007-tl-ads-run-2-gac-v6/default/model-00002-of-00003.safetensors
mistralai-mistral-nem-93303-v610-uploader: Using quantization_mode: fp8
mistralai-mistral-nem-93303-v610-uploader: Checking if ChaiML/Mistral-Nemo-Instruct-2407-FP8 already exists in ChaiML
mistralai-mistral-nem-93303-v610-uploader: Downloading snapshot of mistralai/Mistral-Nemo-Instruct-2407...
Job chaiml-1007-tl-ads-run-2-gac-v6-uploader completed after 176.69s with status: succeeded
Stopping job with name chaiml-1007-tl-ads-run-2-gac-v6-uploader
Pipeline stage VLLMUploader completed in 177.28s
run pipeline stage %s
Running pipeline stage VLLMTemplater
2026-03-20T00:03:51.798047+00:00 monitor updated for chaiml-1007-tl-ads-run-2-gac_v6
Pipeline stage VLLMTemplater completed in 7.41s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-1007-tl-ads-run-2-gac-v6
Waiting for inference service chaiml-1007-tl-ads-run-2-gac-v6 to be ready
2026-03-20T00:03:58.665361+00:00 monitor updated for mistralai-mistral-nem_93303_v610
mistralai-mistral-nem-93303-v610-uploader: Downloaded in 21.769s
mistralai-mistral-nem-93303-v610-uploader: Loading /tmp/model_input...
mistralai-mistral-nem-93303-v610-uploader: The tokenizer you are loading from '/tmp/model_input' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
mistralai-mistral-nem-93303-v610-uploader: `torch_dtype` is deprecated! Use `dtype` instead!
mistralai-mistral-nem-93303-v610-uploader: Applying quantization...
mistralai-mistral-nem-93303-v610-uploader: The tokenizer you are loading from '/tmp/model_input' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
mistralai-mistral-nem-93303-v610-uploader: 2026-03-19T17:04:01.192889-0700 | reset | INFO - Compression lifecycle reset
mistralai-mistral-nem-93303-v610-uploader: 2026-03-19T17:04:01.193669-0700 | from_modifiers | INFO - Creating recipe from modifiers
mistralai-mistral-nem-93303-v610-uploader: 2026-03-19T17:04:01.225487-0700 | initialize | INFO - Compression lifecycle initialized for 1 modifiers
mistralai-mistral-nem-93303-v610-uploader: 2026-03-19T17:04:01.225753-0700 | IndependentPipeline | INFO - Inferred `DataFreePipeline` for `QuantizationModifier`
mistralai-mistral-nem-93303-v610-uploader: Some parameters are on the meta device because they were offloaded to the cpu.
mistralai-mistral-nem-93303-v610-uploader: 2026-03-19T17:04:16.442605-0700 | finalize | INFO - Compression lifecycle finalized for 1 modifiers
mistralai-mistral-nem-93303-v610-uploader: 2026-03-19T17:04:24.735188-0700 | post_process | WARNING - Optimized model is not saved. To save, please provide`output_dir` as input arg.Ex. `oneshot(..., output_dir=...)`
mistralai-mistral-nem-93303-v610-uploader: Saving to /dev/shm/model_output...
mistralai-mistral-nem-93303-v610-uploader: 2026-03-19T17:04:24.758063-0700 | get_model_compressor | INFO - skip_sparsity_compression_stats set to True. Skipping sparsity compression statistic calculations. No sparsity compressor will be applied.
2026-03-20T00:04:51.947308+00:00 monitor updated for chaiml-1007-tl-ads-run-2-gac_v6
mistralai-mistral-nem-93303-v610-uploader: Cleaning quantization config in /dev/shm/model_output
mistralai-mistral-nem-93303-v610-uploader: Pushing to ChaiML/Mistral-Nemo-Instruct-2407-FP8
mistralai-mistral-nem-93303-v610-uploader: Checking if ChaiML/Mistral-Nemo-Instruct-2407-FP8 already exists in ChaiML
mistralai-mistral-nem-93303-v610-uploader: Creating repo ChaiML/Mistral-Nemo-Instruct-2407-FP8 and uploading /dev/shm/model_output to it
mistralai-mistral-nem-93303-v610-uploader: ---------- 2026-03-19 17:04:50 (0:00:00) ----------
mistralai-mistral-nem-93303-v610-uploader: Files: hashed 7/11 (238.0K/13.6G) | pre-uploaded: 0/0 (0.0/13.6G) (+11 unsure) | committed: 0/11 (0.0/13.6G) | ignored: 0
mistralai-mistral-nem-93303-v610-uploader: Workers: hashing: 4 | get upload mode: 5 | pre-uploading: 0 | committing: 0 | waiting: 117
mistralai-mistral-nem-93303-v610-uploader: ---------------------------------------------------
2026-03-20T00:04:58.811692+00:00 monitor updated for mistralai-mistral-nem_93303_v610
mistralai-mistral-nem-93303-v610-uploader: Processed model mistralai/Mistral-Nemo-Instruct-2407 in 120.673s
mistralai-mistral-nem-93303-v610-uploader: creating bucket guanaco-vllm-models
mistralai-mistral-nem-93303-v610-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
mistralai-mistral-nem-93303-v610-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
mistralai-mistral-nem-93303-v610-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
mistralai-mistral-nem-93303-v610-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
mistralai-mistral-nem-93303-v610-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
mistralai-mistral-nem-93303-v610-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
mistralai-mistral-nem-93303-v610-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
mistralai-mistral-nem-93303-v610-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
mistralai-mistral-nem-93303-v610-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
mistralai-mistral-nem-93303-v610-uploader: if re.search("-\.", bucket, re.UNICODE):
mistralai-mistral-nem-93303-v610-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
mistralai-mistral-nem-93303-v610-uploader: if re.search("\.\.", bucket, re.UNICODE):
mistralai-mistral-nem-93303-v610-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
mistralai-mistral-nem-93303-v610-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
mistralai-mistral-nem-93303-v610-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
mistralai-mistral-nem-93303-v610-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
mistralai-mistral-nem-93303-v610-uploader: Bucket 's3://guanaco-vllm-models/' created
mistralai-mistral-nem-93303-v610-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v610/default
mistralai-mistral-nem-93303-v610-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v610/default/special_tokens_map.json
mistralai-mistral-nem-93303-v610-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v610/default/tokenizer_config.json
mistralai-mistral-nem-93303-v610-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v610/default/model.safetensors.index.json
mistralai-mistral-nem-93303-v610-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v610/default/config.json
mistralai-mistral-nem-93303-v610-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v610/default/recipe.yaml
mistralai-mistral-nem-93303-v610-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v610/default/generation_config.json
mistralai-mistral-nem-93303-v610-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v610/default/chat_template.jinja
mistralai-mistral-nem-93303-v610-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v610/default/tokenizer.json
2026-03-20T00:05:52.107065+00:00 monitor updated for chaiml-1007-tl-ads-run-2-gac_v6
mistralai-mistral-nem-93303-v610-uploader: cp /dev/shm/model_output/model-00003-of-00003.safetensors s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v610/default/model-00003-of-00003.safetensors
mistralai-mistral-nem-93303-v610-uploader: cp /dev/shm/model_output/model-00001-of-00003.safetensors s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v610/default/model-00001-of-00003.safetensors
mistralai-mistral-nem-93303-v610-uploader: cp /dev/shm/model_output/model-00002-of-00003.safetensors s3://guanaco-vllm-models/mistralai-mistral-nem-93303-v610/default/model-00002-of-00003.safetensors
Job mistralai-mistral-nem-93303-v610-uploader completed after 178.04s with status: succeeded
Stopping job with name mistralai-mistral-nem-93303-v610-uploader
Pipeline stage VLLMUploader completed in 178.80s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.46s
run pipeline stage %s
Running pipeline stage VLLMDeployer
2026-03-20T00:05:58.964175+00:00 monitor updated for mistralai-mistral-nem_93303_v610
Creating inference service mistralai-mistral-nem-93303-v610
Waiting for inference service mistralai-mistral-nem-93303-v610 to be ready
Inference service chaiml-1007-tl-ads-run-2-gac-v6 ready after 170.6031093597412s
Pipeline stage VLLMDeployer completed in 171.31s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.6207261085510254s
2026-03-20T00:06:52.273944+00:00 monitor updated for chaiml-1007-tl-ads-run-2-gac_v6
Received healthy response to inference request in 2.2718803882598877s
Received healthy response to inference request in 2.364598512649536s
Received healthy response to inference request in 2.2808117866516113s
2026-03-20T00:06:59.154931+00:00 monitor updated for mistralai-mistral-nem_93303_v610
Received healthy response to inference request in 2.3027162551879883s
Received healthy response to inference request in 2.3145391941070557s
Received healthy response to inference request in 2.341092348098755s
Received healthy response to inference request in 2.5801074504852295s
Received healthy response to inference request in 2.2660861015319824s
Received healthy response to inference request in 2.5564610958099365s
Received healthy response to inference request in 2.3285582065582275s
Received healthy response to inference request in 2.2837772369384766s
Received healthy response to inference request in 2.237067937850952s
Received healthy response to inference request in 2.3601784706115723s
Received healthy response to inference request in 2.264846086502075s
Received healthy response to inference request in 2.333388328552246s
Received healthy response to inference request in 2.266319513320923s
Received healthy response to inference request in 2.4498720169067383s
Received healthy response to inference request in 2.3218441009521484s
Received healthy response to inference request in 2.209130048751831s
Received healthy response to inference request in 2.3609001636505127s
Received healthy response to inference request in 2.4859306812286377s
Received healthy response to inference request in 2.304891347885132s
Received healthy response to inference request in 2.228647470474243s
Received healthy response to inference request in 2.269847869873047s
2026-03-20T00:07:52.449205+00:00 monitor updated for chaiml-1007-tl-ads-run-2-gac_v6
Received healthy response to inference request in 2.3493714332580566s
Received healthy response to inference request in 2.285050392150879s
Received healthy response to inference request in 2.350088596343994s
2026-03-20T00:07:59.347314+00:00 monitor updated for mistralai-mistral-nem_93303_v610
Received healthy response to inference request in 2.277848958969116s
Received healthy response to inference request in 2.283231735229492s
30 requests
0 failed requests
5th percentile: 2.232436680793762
10th percentile: 2.262068271636963
20th percentile: 2.269142198562622
30th percentile: 2.279922938346863
40th percentile: 2.284541130065918
50th percentile: 2.3097152709960938
60th percentile: 2.330490255355835
70th percentile: 2.349586582183838
80th percentile: 2.3616398334503175
90th percentile: 2.4929837226867675
95th percentile: 2.5694665908813477
99th percentile: 2.6089466977119447
mean time: 2.338326994578044
Pipeline stage StressChecker completed in 76.83s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.30s
Shutdown handler de-registered
chaiml-1007-tl-ads-run-2-gac_v6 status is now deployed due to DeploymentManager action
Inference service mistralai-mistral-nem-93303-v610 ready after 150.48500061035156s
Pipeline stage VLLMDeployer completed in 151.33s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.3657937049865723s
Received healthy response to inference request in 2.4321789741516113s
Received healthy response to inference request in 2.3183019161224365s
Received healthy response to inference request in 2.2315027713775635s
Received healthy response to inference request in 2.3059043884277344s
Received healthy response to inference request in 2.2499306201934814s
Received healthy response to inference request in 2.3352181911468506s
Received healthy response to inference request in 2.4275107383728027s
Received healthy response to inference request in 2.2168076038360596s
Received healthy response to inference request in 2.2456297874450684s
Received healthy response to inference request in 2.36216402053833s
Received healthy response to inference request in 2.3264920711517334s
2026-03-20T00:08:59.522329+00:00 monitor updated for mistralai-mistral-nem_93303_v610
Received healthy response to inference request in 2.2672228813171387s
Received healthy response to inference request in 2.24171781539917s
Received healthy response to inference request in 2.291961669921875s
Received healthy response to inference request in 2.3384597301483154s
Received healthy response to inference request in 2.49311900138855s
Received healthy response to inference request in 2.350935220718384s
Received healthy response to inference request in 2.3099215030670166s
Received healthy response to inference request in 2.2330048084259033s
Received healthy response to inference request in 2.2653844356536865s
Received healthy response to inference request in 2.2455976009368896s
Received healthy response to inference request in 2.2570228576660156s
Received healthy response to inference request in 2.2309446334838867s
Received healthy response to inference request in 2.3292462825775146s
Received healthy response to inference request in 2.2585933208465576s
Received healthy response to inference request in 2.2535524368286133s
Received healthy response to inference request in 2.361833095550537s
Received healthy response to inference request in 2.3620054721832275s
Received healthy response to inference request in 2.2715649604797363s
30 requests
0 failed requests
5th percentile: 2.231195795536041
10th percentile: 2.2328546047210693
20th percentile: 2.2456233501434326
30th percentile: 2.2559817314147947
40th percentile: 2.2664875030517577
50th percentile: 2.2989330291748047
60th percentile: 2.3215779781341555
70th percentile: 2.33619065284729
80th percentile: 2.361867570877075
90th percentile: 2.3719654083251953
95th percentile: 2.4300782680511475
99th percentile: 2.475446393489838
mean time: 2.3059840838114423
Pipeline stage StressChecker completed in 72.52s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.76s
Shutdown handler de-registered
mistralai-mistral-nem_93303_v610 status is now deployed due to DeploymentManager action
mistralai-mistral-nem_93303_v610 status is now inactive due to system request
mistralai-mistral-nem_93303_v610 status is now torndown due to DeploymentManager action