Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-2a6f-69d4-linear-43777-v1-uploader
Waiting for job on chaiml-2a6f-69d4-linear-43777-v1-uploader to finish
chaiml-2a6f-69d4-linear-43777-v1-uploader: Using quantization_mode: none
chaiml-2a6f-69d4-linear-43777-v1-uploader: Downloading snapshot of ChaiML/2a6f-69d4-linear-w01-FP8...
chaiml-2a6f-69d4-linear-43777-v1-uploader:
Fetching 15 files: 0%| | 0/15 [00:00<?, ?it/s]
Fetching 15 files: 7%|▋ | 1/15 [00:00<00:04, 3.38it/s]
Fetching 15 files: 33%|███▎ | 5/15 [00:11<00:24, 2.43s/it]
Fetching 15 files: 40%|████ | 6/15 [00:12<00:17, 2.00s/it]
Fetching 15 files: 47%|████▋ | 7/15 [00:12<00:12, 1.59s/it]
Fetching 15 files: 100%|██████████| 15/15 [00:12<00:00, 1.21it/s]
chaiml-2a6f-69d4-linear-43777-v1-uploader: Downloaded in 12.548s
chaiml-2a6f-69d4-linear-43777-v1-uploader: Processed model ChaiML/2a6f-69d4-linear-w01-FP8 in 21.501s
chaiml-2a6f-69d4-linear-43777-v1-uploader: creating bucket guanaco-vllm-models
chaiml-2a6f-69d4-linear-43777-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-43777-v1-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-2a6f-69d4-linear-43777-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-2a6f-69d4-linear-43777-v1-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-2a6f-69d4-linear-43777-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-43777-v1-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-2a6f-69d4-linear-43777-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-43777-v1-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-2a6f-69d4-linear-43777-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-43777-v1-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-2a6f-69d4-linear-43777-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-43777-v1-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-2a6f-69d4-linear-43777-v1-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-2a6f-69d4-linear-43777-v1-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-2a6f-69d4-linear-43777-v1-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-2a6f-69d4-linear-43777-v1-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-2a6f-69d4-linear-43777-v1-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-2a6f-69d4-linear-43777-v1-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/.gitattributes
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/chat_template.jinja
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/generation_config.json
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/recipe.yaml
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/config.json
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/special_tokens_map.json
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/model.safetensors.index.json
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/tokenizer_config.json
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/tokenizer.json
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/model-00006-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/model-00006-of-00006.safetensors
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/model-00005-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/model-00005-of-00006.safetensors
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/model-00003-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/model-00003-of-00006.safetensors
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/model-00001-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/model-00001-of-00006.safetensors
chaiml-2a6f-69d4-linear-43777-v1-uploader: cp /dev/shm/model_output/model-00004-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v1/model-00004-of-00006.safetensors
Job chaiml-2a6f-69d4-linear-43777-v1-uploader completed after 103.26s with status: succeeded
Stopping job with name chaiml-2a6f-69d4-linear-43777-v1-uploader
Pipeline stage VLLMUploader completed in 103.96s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.18s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-2a6f-69d4-linear-43777-v1
Waiting for inference service chaiml-2a6f-69d4-linear-43777-v1 to be ready
Inference service chaiml-2a6f-69d4-linear-43777-v1 ready after 160.7305428981781s
Pipeline stage VLLMDeployer completed in 161.29s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.3845183849334717s
Received healthy response to inference request in 1.5466578006744385s
Received healthy response to inference request in 1.387627124786377s
Received healthy response to inference request in 1.3907713890075684s
Received healthy response to inference request in 1.4785878658294678s
Received healthy response to inference request in 1.422184705734253s
Received healthy response to inference request in 1.6087820529937744s
Received healthy response to inference request in 1.6967854499816895s
Received healthy response to inference request in 1.5539257526397705s
Received healthy response to inference request in 1.4195594787597656s
Received healthy response to inference request in 1.82924485206604s
Received healthy response to inference request in 1.6962757110595703s
Received healthy response to inference request in 1.4105415344238281s
Received healthy response to inference request in 1.4734909534454346s
Received healthy response to inference request in 1.655531644821167s
Received healthy response to inference request in 1.5525262355804443s
Received healthy response to inference request in 1.5602169036865234s
Received healthy response to inference request in 1.5442395210266113s
Received healthy response to inference request in 1.6243162155151367s
Received healthy response to inference request in 1.3377466201782227s
Received healthy response to inference request in 1.7908177375793457s
Received healthy response to inference request in 1.4203906059265137s
Received healthy response to inference request in 1.3558149337768555s
Received healthy response to inference request in 1.6612226963043213s
Received healthy response to inference request in 1.3796980381011963s
Received healthy response to inference request in 1.3785796165466309s
Received healthy response to inference request in 1.3469712734222412s
Received healthy response to inference request in 1.4775595664978027s
Received healthy response to inference request in 1.5998783111572266s
Received healthy response to inference request in 1.4567444324493408s
30 requests
0 failed requests
5th percentile: 1.3509509205818175
10th percentile: 1.3763031482696533
20th percentile: 1.387005376815796
30th percentile: 1.4168540954589843
40th percentile: 1.4429205417633058
50th percentile: 1.4780737161636353
60th percentile: 1.5490051746368407
70th percentile: 1.5721153259277343
80th percentile: 1.6305593013763429
90th percentile: 1.6963266849517822
95th percentile: 1.7485032081604002
99th percentile: 1.8181009888648987
mean time: 1.5147069136301676
Pipeline stage StressChecker completed in 49.53s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
chaiml-2a6f-69d4-linear_43777_v1 status is now deployed due to DeploymentManager action
chaiml-2a6f-69d4-linear_43777_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-2a6f-69d4-linear_43777_v1 status is now torndown due to DeploymentManager action