Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-2a6f-69d4-linear-43777-v4-uploader
Waiting for job on chaiml-2a6f-69d4-linear-43777-v4-uploader to finish
chaiml-2a6f-69d4-linear-43777-v4-uploader: Using quantization_mode: none
chaiml-2a6f-69d4-linear-43777-v4-uploader: Downloading snapshot of ChaiML/2a6f-69d4-linear-w01-FP8...
chaiml-2a6f-69d4-linear-43777-v4-uploader:
Fetching 15 files: 0%| | 0/15 [00:00<?, ?it/s]
Fetching 15 files: 7%|▋ | 1/15 [00:00<00:03, 3.59it/s]
Fetching 15 files: 33%|███▎ | 5/15 [00:12<00:27, 2.75s/it]
Fetching 15 files: 100%|██████████| 15/15 [00:12<00:00, 1.15it/s]
chaiml-2a6f-69d4-linear-43777-v4-uploader: Downloaded in 13.115s
chaiml-2a6f-69d4-linear-43777-v4-uploader: Processed model ChaiML/2a6f-69d4-linear-w01-FP8 in 22.370s
chaiml-2a6f-69d4-linear-43777-v4-uploader: creating bucket guanaco-vllm-models
chaiml-2a6f-69d4-linear-43777-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-43777-v4-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-2a6f-69d4-linear-43777-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-2a6f-69d4-linear-43777-v4-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-2a6f-69d4-linear-43777-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-43777-v4-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-2a6f-69d4-linear-43777-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-43777-v4-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-2a6f-69d4-linear-43777-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-43777-v4-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-2a6f-69d4-linear-43777-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-43777-v4-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-2a6f-69d4-linear-43777-v4-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-2a6f-69d4-linear-43777-v4-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-2a6f-69d4-linear-43777-v4-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-2a6f-69d4-linear-43777-v4-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-2a6f-69d4-linear-43777-v4-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-2a6f-69d4-linear-43777-v4-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/.gitattributes
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/config.json
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/chat_template.jinja
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/special_tokens_map.json
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/generation_config.json
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/recipe.yaml
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/tokenizer_config.json
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/model.safetensors.index.json
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/tokenizer.json
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/model-00006-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/model-00006-of-00006.safetensors
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/model-00005-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/model-00005-of-00006.safetensors
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/model-00003-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/model-00003-of-00006.safetensors
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/model-00002-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/model-00002-of-00006.safetensors
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/model-00004-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/model-00004-of-00006.safetensors
chaiml-2a6f-69d4-linear-43777-v4-uploader: cp /dev/shm/model_output/model-00001-of-00006.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-43777-v4/model-00001-of-00006.safetensors
Job chaiml-2a6f-69d4-linear-43777-v4-uploader completed after 267.85s with status: succeeded
Stopping job with name chaiml-2a6f-69d4-linear-43777-v4-uploader
Pipeline stage VLLMUploader completed in 268.67s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.63s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-2a6f-69d4-linear-43777-v4
Waiting for inference service chaiml-2a6f-69d4-linear-43777-v4 to be ready
Inference service chaiml-2a6f-69d4-linear-43777-v4 ready after 362.9772582054138s
Pipeline stage VLLMDeployer completed in 363.59s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.276535987854004s
Received healthy response to inference request in 2.8137173652648926s
Received healthy response to inference request in 2.965681314468384s
Received healthy response to inference request in 3.010977268218994s
Received healthy response to inference request in 2.6691813468933105s
Received healthy response to inference request in 2.9438867568969727s
Received healthy response to inference request in 2.8102986812591553s
Received healthy response to inference request in 2.8226418495178223s
Received healthy response to inference request in 2.863598346710205s
Received healthy response to inference request in 3.1580846309661865s
Received healthy response to inference request in 3.2567951679229736s
Received healthy response to inference request in 3.064354419708252s
Received healthy response to inference request in 2.6999168395996094s
Received healthy response to inference request in 3.7263636589050293s
Received healthy response to inference request in 2.6796388626098633s
Received healthy response to inference request in 2.68963360786438s
Received healthy response to inference request in 3.2562341690063477s
Received healthy response to inference request in 2.77470326423645s
Received healthy response to inference request in 2.7726263999938965s
Received healthy response to inference request in 4.081495046615601s
Received healthy response to inference request in 2.929225444793701s
Received healthy response to inference request in 3.2185299396514893s
Received healthy response to inference request in 2.89992356300354s
Received healthy response to inference request in 2.811617136001587s
Received healthy response to inference request in 3.1391823291778564s
Received healthy response to inference request in 2.7078635692596436s
Received healthy response to inference request in 3.503521680831909s
Received healthy response to inference request in 2.7258899211883545s
Received healthy response to inference request in 2.790761947631836s
Received healthy response to inference request in 2.8748257160186768s
30 requests
0 failed requests
5th percentile: 2.6841364979743956
10th percentile: 2.6988885164260865
20th percentile: 2.763279104232788
30th percentile: 2.8044376611709594
40th percentile: 2.81907205581665
50th percentile: 2.8873746395111084
60th percentile: 2.952604579925537
70th percentile: 3.086802792549133
80th percentile: 3.226070785522461
90th percentile: 3.2992345571517947
95th percentile: 3.6260847687721247
99th percentile: 3.9785069441795353
mean time: 2.9979235410690306
Pipeline stage StressChecker completed in 95.39s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.75s
Shutdown handler de-registered
chaiml-2a6f-69d4-linear_43777_v4 status is now deployed due to DeploymentManager action
chaiml-2a6f-69d4-linear_43777_v4 status is now inactive due to system request
chaiml-2a6f-69d4-linear_43777_v4 status is now inactive due to auto deactivation removed underperforming models