Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-2a6f-69d4-linea-43777-v10-uploader
Waiting for job on chaiml-2a6f-69d4-linea-43777-v10-uploader to finish
chaiml-2a6f-69d4-linea-43777-v10-uploader: Using quantization_mode: fp8
chaiml-2a6f-69d4-linea-43777-v10-uploader: Repo ChaiML/2a6f-69d4-linear-w01-FP8 already ends in FP8. Skipping...
chaiml-2a6f-69d4-linea-43777-v10-uploader: Checking if ChaiML/2a6f-69d4-linear-w01-FP8 already exists in ChaiML
chaiml-2a6f-69d4-linea-43777-v10-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-2a6f-69d4-linea-43777-v10-uploader: Downloading snapshot of ChaiML/2a6f-69d4-linear-w01-FP8...
2026-03-25T22:19:16.313611+00:00 monitor updated for chaiml-2a6f-69d4-linea_43777_v10
chaiml-2a6f-69d4-linea-43777-v10-uploader: Downloaded in 14.194s
chaiml-2a6f-69d4-linea-43777-v10-uploader: Processed model ChaiML/2a6f-69d4-linear-w01-FP8 in 17.849s
chaiml-2a6f-69d4-linea-43777-v10-uploader: creating bucket guanaco-vllm-models
chaiml-2a6f-69d4-linea-43777-v10-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linea-43777-v10-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-2a6f-69d4-linea-43777-v10-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-2a6f-69d4-linea-43777-v10-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-2a6f-69d4-linea-43777-v10-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linea-43777-v10-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-2a6f-69d4-linea-43777-v10-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linea-43777-v10-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-2a6f-69d4-linea-43777-v10-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linea-43777-v10-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-2a6f-69d4-linea-43777-v10-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linea-43777-v10-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-2a6f-69d4-linea-43777-v10-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-2a6f-69d4-linea-43777-v10-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-2a6f-69d4-linea-43777-v10-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-2a6f-69d4-linea-43777-v10-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-2a6f-69d4-linea-43777-v10-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-2a6f-69d4-linea-43777-v10-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/README.md
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/generation_config.json
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/special_tokens_map.json
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/recipe.yaml
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/config.json
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/model.safetensors.index.json
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/.gitattributes
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/tokenizer_config.json
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/chat_template.jinja
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/tokenizer.json
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/model-00006-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/model-00006-of-00006.safetensors
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/model-00005-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/model-00005-of-00006.safetensors
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/model-00002-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/model-00002-of-00006.safetensors
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/model-00004-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/model-00004-of-00006.safetensors
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/model-00003-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/model-00003-of-00006.safetensors
chaiml-2a6f-69d4-linea-43777-v10-uploader: cp /dev/shm/model_output/model-00001-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linea-43777-v10/default/model-00001-of-00006.safetensors
Job chaiml-2a6f-69d4-linea-43777-v10-uploader completed after 73.17s with status: succeeded
Stopping job with name chaiml-2a6f-69d4-linea-43777-v10-uploader
Pipeline stage VLLMUploader completed in 73.82s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 2.31s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-2a6f-69d4-linea-43777-v10
Waiting for inference service chaiml-2a6f-69d4-linea-43777-v10 to be ready
2026-03-25T22:20:16.713584+00:00 monitor updated for chaiml-2a6f-69d4-linea_43777_v10
2026-03-25T22:21:16.815660+00:00 monitor updated for chaiml-2a6f-69d4-linea_43777_v10
2026-03-25T22:22:16.908156+00:00 monitor updated for chaiml-2a6f-69d4-linea_43777_v10
Inference service chaiml-2a6f-69d4-linea-43777-v10 ready after 170.41811156272888s
Pipeline stage VLLMDeployer completed in 171.03s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 8.0371732711792s
Received healthy response to inference request in 8.062968254089355s
Received healthy response to inference request in 8.641259670257568s
Received healthy response to inference request in 3.0868358612060547s
Received healthy response to inference request in 7.956582307815552s
Received healthy response to inference request in 2.9853594303131104s
Received healthy response to inference request in 3.0373008251190186s
Received healthy response to inference request in 3.058403968811035s
Received healthy response to inference request in 3.0125725269317627s
Received healthy response to inference request in 3.114269733428955s
2026-03-25T22:23:17.107330+00:00 monitor updated for chaiml-2a6f-69d4-linea_43777_v10
Received healthy response to inference request in 3.113478899002075s
Received healthy response to inference request in 8.195333003997803s
Received healthy response to inference request in 2.955868721008301s
Received healthy response to inference request in 2.924875020980835s
Received healthy response to inference request in 3.0369417667388916s
Received healthy response to inference request in 3.0151453018188477s
Received healthy response to inference request in 3.0436325073242188s
Received healthy response to inference request in 2.9323606491088867s
Received healthy response to inference request in 2.9265689849853516s
Received healthy response to inference request in 3.181732177734375s
Received healthy response to inference request in 3.040956974029541s
Received healthy response to inference request in 2.8839828968048096s
Received healthy response to inference request in 3.0526225566864014s
Received healthy response to inference request in 2.936156749725342s
Received healthy response to inference request in 2.9041154384613037s
Received healthy response to inference request in 3.1348798274993896s
Received healthy response to inference request in 2.910212755203247s
Received healthy response to inference request in 2.9391367435455322s
2026-03-25T22:24:17.202388+00:00 monitor updated for chaiml-2a6f-69d4-linea_43777_v10
Received healthy response to inference request in 3.0377938747406006s
Received healthy response to inference request in 2.9929420948028564s
30 requests
0 failed requests
5th percentile: 2.9068592309951784
10th percentile: 2.923408794403076
20th percentile: 2.9353975296020507
30th percentile: 2.9765122175216674
40th percentile: 3.0141161918640136
50th percentile: 3.0375473499298096
60th percentile: 3.047228527069092
70th percentile: 3.0948287725448607
80th percentile: 3.144250297546387
90th percentile: 8.039752769470216
95th percentile: 8.135768866539001
99th percentile: 8.511940937042237
mean time: 3.8717154264450073
Pipeline stage StressChecker completed in 118.77s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.33s
Shutdown handler de-registered
chaiml-2a6f-69d4-linea_43777_v10 status is now deployed due to DeploymentManager action
chaiml-2a6f-69d4-linea_43777_v10 status is now inactive due to system request
chaiml-2a6f-69d4-linea_43777_v10 status is now torndown due to DeploymentManager action