Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-2fe5-c13f-linea-57126-v11-uploader
Waiting for job on chaiml-2fe5-c13f-linea-57126-v11-uploader to finish
chaiml-2fe5-c13f-linea-57126-v11-uploader: Using quantization_mode: fp8
chaiml-2fe5-c13f-linea-57126-v11-uploader: Repo ChaiML/2fe5-c13f-linear-w01-FP8 already ends in FP8. Skipping...
chaiml-2fe5-c13f-linea-57126-v11-uploader: Checking if ChaiML/2fe5-c13f-linear-w01-FP8 already exists in ChaiML
chaiml-2fe5-c13f-linea-57126-v11-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-2fe5-c13f-linea-57126-v11-uploader: Downloading snapshot of ChaiML/2fe5-c13f-linear-w01-FP8...
chaiml-2fe5-c13f-linea-57126-v11-uploader: Downloaded in 6.469s
chaiml-2fe5-c13f-linea-57126-v11-uploader: Processed model ChaiML/2fe5-c13f-linear-w01-FP8 in 10.086s
chaiml-2fe5-c13f-linea-57126-v11-uploader: creating bucket guanaco-vllm-models
chaiml-2fe5-c13f-linea-57126-v11-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-2fe5-c13f-linea-57126-v11-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-2fe5-c13f-linea-57126-v11-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-2fe5-c13f-linea-57126-v11-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-2fe5-c13f-linea-57126-v11-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-2fe5-c13f-linea-57126-v11-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-2fe5-c13f-linea-57126-v11-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-2fe5-c13f-linea-57126-v11-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-2fe5-c13f-linea-57126-v11-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-2fe5-c13f-linea-57126-v11-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-2fe5-c13f-linea-57126-v11-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-2fe5-c13f-linea-57126-v11-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-2fe5-c13f-linea-57126-v11-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-2fe5-c13f-linea-57126-v11-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-2fe5-c13f-linea-57126-v11-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-2fe5-c13f-linea-57126-v11-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-2fe5-c13f-linea-57126-v11-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-2fe5-c13f-linea-57126-v11-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default
chaiml-2fe5-c13f-linea-57126-v11-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default/README.md
chaiml-2fe5-c13f-linea-57126-v11-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default/recipe.yaml
chaiml-2fe5-c13f-linea-57126-v11-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default/config.json
chaiml-2fe5-c13f-linea-57126-v11-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default/generation_config.json
chaiml-2fe5-c13f-linea-57126-v11-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default/special_tokens_map.json
chaiml-2fe5-c13f-linea-57126-v11-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default/chat_template.jinja
chaiml-2fe5-c13f-linea-57126-v11-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default/model.safetensors.index.json
chaiml-2fe5-c13f-linea-57126-v11-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default/.gitattributes
chaiml-2fe5-c13f-linea-57126-v11-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default/tokenizer_config.json
chaiml-2fe5-c13f-linea-57126-v11-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default/tokenizer.json
2026-03-24T20:53:17.778065+00:00 monitor updated for chaiml-2fe5-c13f-linea_57126_v11
chaiml-2fe5-c13f-linea-57126-v11-uploader: cp /dev/shm/model_output/model-00003-of-00003.safetensors s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default/model-00003-of-00003.safetensors
chaiml-2fe5-c13f-linea-57126-v11-uploader: cp /dev/shm/model_output/model-00001-of-00003.safetensors s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default/model-00001-of-00003.safetensors
chaiml-2fe5-c13f-linea-57126-v11-uploader: cp /dev/shm/model_output/model-00002-of-00003.safetensors s3://guanaco-vllm-models/chaiml-2fe5-c13f-linea-57126-v11/default/model-00002-of-00003.safetensors
Job chaiml-2fe5-c13f-linea-57126-v11-uploader completed after 72.6s with status: succeeded
Stopping job with name chaiml-2fe5-c13f-linea-57126-v11-uploader
Pipeline stage VLLMUploader completed in 73.14s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 1.95s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-2fe5-c13f-linea-57126-v11
Waiting for inference service chaiml-2fe5-c13f-linea-57126-v11 to be ready
2026-03-24T20:54:17.863499+00:00 monitor updated for chaiml-2fe5-c13f-linea_57126_v11
2026-03-24T20:55:17.948763+00:00 monitor updated for chaiml-2fe5-c13f-linea_57126_v11
Inference service chaiml-2fe5-c13f-linea-57126-v11 ready after 160.41200613975525s
Pipeline stage VLLMDeployer completed in 160.86s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.258723258972168s
Received healthy response to inference request in 3.1697933673858643s
Received healthy response to inference request in 3.2231597900390625s
2026-03-24T20:56:25.684125+00:00 monitor updated for chaiml-2fe5-c13f-linea_57126_v11
Received healthy response to inference request in 2.1359140872955322s
Received healthy response to inference request in 1.645362138748169s
Received healthy response to inference request in 3.3179943561553955s
Received healthy response to inference request in 1.6534483432769775s
Received healthy response to inference request in 1.643704891204834s
Received healthy response to inference request in 3.1981868743896484s
Received healthy response to inference request in 1.6121559143066406s
Received healthy response to inference request in 1.6147739887237549s
Received healthy response to inference request in 1.6692700386047363s
Received healthy response to inference request in 1.6862990856170654s
Received healthy response to inference request in 1.894209861755371s
Received healthy response to inference request in 1.757728099822998s
Received healthy response to inference request in 1.6013648509979248s
Received healthy response to inference request in 1.684762716293335s
Received healthy response to inference request in 1.9196114540100098s
Received healthy response to inference request in 1.6844427585601807s
Received healthy response to inference request in 1.763904094696045s
Received healthy response to inference request in 1.6060614585876465s
Received healthy response to inference request in 1.5961039066314697s
Received healthy response to inference request in 1.8348660469055176s
Received healthy response to inference request in 1.7861993312835693s
Received healthy response to inference request in 1.6501374244689941s
Received healthy response to inference request in 1.827815055847168s
Received healthy response to inference request in 1.648136854171753s
Received healthy response to inference request in 1.9322335720062256s
Received healthy response to inference request in 1.608999490737915s
Received healthy response to inference request in 1.6058945655822754s
30 requests
0 failed requests
5th percentile: 1.6034032225608825
10th percentile: 1.6060447692871094
20th percentile: 1.614250373840332
30th percentile: 1.6473044395446776
40th percentile: 1.6629413604736327
50th percentile: 1.6855309009552002
60th percentile: 1.7728221893310547
70th percentile: 1.8526691913604734
80th percentile: 1.9729696750640875
90th percentile: 3.20068416595459
95th percentile: 3.2427196979522703
99th percentile: 3.3008057379722597
mean time: 1.9743752559026082
Pipeline stage StressChecker completed in 61.61s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
chaiml-2fe5-c13f-linea_57126_v11 status is now deployed due to DeploymentManager action
chaiml-2fe5-c13f-linea_57126_v11 status is now inactive due to auto deactivation removed underperforming models
chaiml-2fe5-c13f-linea_57126_v11 status is now torndown due to DeploymentManager action