Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-4d70-fd43-linear-51732-v8-uploader
Waiting for job on chaiml-4d70-fd43-linear-51732-v8-uploader to finish
chaiml-4d70-fd43-linear-51732-v8-uploader: Using quantization_mode: fp8
chaiml-4d70-fd43-linear-51732-v8-uploader: Downloaded in 7.935s
chaiml-4d70-fd43-linear-51732-v8-uploader: Processed model ChaiML/4d70-fd43-linear-w01-FP8 in 11.583s
chaiml-4d70-fd43-linear-51732-v8-uploader: creating bucket guanaco-vllm-models
chaiml-4d70-fd43-linear-51732-v8-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v8-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-4d70-fd43-linear-51732-v8-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-4d70-fd43-linear-51732-v8-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-4d70-fd43-linear-51732-v8-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v8-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-4d70-fd43-linear-51732-v8-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v8-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-4d70-fd43-linear-51732-v8-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v8-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-4d70-fd43-linear-51732-v8-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v8-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-4d70-fd43-linear-51732-v8-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-4d70-fd43-linear-51732-v8-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-4d70-fd43-linear-51732-v8-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-4d70-fd43-linear-51732-v8-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-4d70-fd43-linear-51732-v8-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-4d70-fd43-linear-51732-v8-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v8/default
chaiml-4d70-fd43-linear-51732-v8-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v8/default/.gitattributes
chaiml-4d70-fd43-linear-51732-v8-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v8/default/chat_template.jinja
chaiml-4d70-fd43-linear-51732-v8-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v8/default/config.json
chaiml-4d70-fd43-linear-51732-v8-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v8/default/generation_config.json
chaiml-4d70-fd43-linear-51732-v8-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v8/default/special_tokens_map.json
chaiml-4d70-fd43-linear-51732-v8-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v8/default/model.safetensors.index.json
chaiml-4d70-fd43-linear-51732-v8-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v8/default/tokenizer_config.json
chaiml-4d70-fd43-linear-51732-v8-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v8/default/recipe.yaml
chaiml-4d70-fd43-linear-51732-v8-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v8/default/tokenizer.json
chaiml-4d70-fd43-linear-51732-v8-uploader: cp /dev/shm/model_output/model-00003-of-00003.safetensors s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v8/default/model-00003-of-00003.safetensors
chaiml-4d70-fd43-linear-51732-v8-uploader: cp /dev/shm/model_output/model-00002-of-00003.safetensors s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v8/default/model-00002-of-00003.safetensors
chaiml-4d70-fd43-linear-51732-v8-uploader: cp /dev/shm/model_output/model-00001-of-00003.safetensors s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v8/default/model-00001-of-00003.safetensors
Job chaiml-4d70-fd43-linear-51732-v8-uploader completed after 64.96s with status: succeeded
Stopping job with name chaiml-4d70-fd43-linear-51732-v8-uploader
Pipeline stage VLLMUploader completed in 65.52s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 2.15s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-4d70-fd43-linear-51732-v8
Waiting for inference service chaiml-4d70-fd43-linear-51732-v8 to be ready
Inference service chaiml-4d70-fd43-linear-51732-v8 ready after 161.22405767440796s
Pipeline stage VLLMDeployer completed in 161.76s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.660017251968384s
Received healthy response to inference request in 1.8565449714660645s
Received healthy response to inference request in 2.526899576187134s
Received healthy response to inference request in 2.5212655067443848s
Received healthy response to inference request in 1.910001277923584s
Received healthy response to inference request in 2.231109380722046s
Received healthy response to inference request in 1.9911246299743652s
Received healthy response to inference request in 1.8203372955322266s
Received healthy response to inference request in 2.0445878505706787s
Received healthy response to inference request in 2.235293388366699s
Received healthy response to inference request in 2.18597412109375s
Received healthy response to inference request in 1.991612195968628s
Received healthy response to inference request in 2.5033586025238037s
Received healthy response to inference request in 1.9494550228118896s
Received healthy response to inference request in 1.8783729076385498s
Received healthy response to inference request in 1.8259072303771973s
Received healthy response to inference request in 2.2964773178100586s
Received healthy response to inference request in 2.7939748764038086s
Received healthy response to inference request in 2.379620313644409s
Received healthy response to inference request in 2.1344830989837646s
Received healthy response to inference request in 2.0418684482574463s
Received healthy response to inference request in 1.9308340549468994s
Received healthy response to inference request in 2.596479892730713s
Received healthy response to inference request in 2.3743369579315186s
Received healthy response to inference request in 1.9437460899353027s
Received healthy response to inference request in 1.96002197265625s
Received healthy response to inference request in 1.9072420597076416s
Received healthy response to inference request in 2.2049896717071533s
Received healthy response to inference request in 1.836855173110962s
Received healthy response to inference request in 2.196645736694336s
30 requests
0 failed requests
5th percentile: 1.8308338046073913
10th percentile: 1.8545759916305542
20th percentile: 1.9094494342803956
30th percentile: 1.9477423429489136
40th percentile: 1.9914171695709229
50th percentile: 2.0895354747772217
60th percentile: 2.199983310699463
70th percentile: 2.253648567199707
80th percentile: 2.4043679714202884
90th percentile: 2.5338576078414916
95th percentile: 2.631425440311432
99th percentile: 2.7551271653175355
mean time: 2.157647895812988
Pipeline stage StressChecker completed in 68.78s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.72s
Shutdown handler de-registered
chaiml-4d70-fd43-linear_51732_v8 status is now deployed due to DeploymentManager action
chaiml-4d70-fd43-linear_51732_v8 status is now inactive due to system request