Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-2a6f-69d4-linear-w01-v25-uploader
Waiting for job on chaiml-2a6f-69d4-linear-w01-v25-uploader to finish
chaiml-2a6f-69d4-linear-w01-v25-uploader: Using quantization_mode: none
chaiml-2a6f-69d4-linear-w01-v25-uploader: Downloading snapshot of ChaiML/2a6f-69d4-linear-w01...
chaiml-2a6f-69d4-linear-w01-v25-uploader:
Fetching 19 files: 0%| | 0/19 [00:00<?, ?it/s]
Fetching 19 files: 5%|▌ | 1/19 [00:00<00:06, 2.67it/s]
Fetching 19 files: 32%|███▏ | 6/19 [00:12<00:27, 2.13s/it]
Fetching 19 files: 37%|███▋ | 7/19 [00:14<00:25, 2.12s/it]
Fetching 19 files: 47%|████▋ | 9/19 [00:17<00:19, 1.94s/it]
Fetching 19 files: 53%|█████▎ | 10/19 [00:18<00:14, 1.63s/it]
Fetching 19 files: 74%|███████▎ | 14/19 [00:20<00:05, 1.08s/it]
Fetching 19 files: 79%|███████▉ | 15/19 [00:20<00:03, 1.07it/s]
Fetching 19 files: 100%|██████████| 19/19 [00:20<00:00, 1.09s/it]
chaiml-2a6f-69d4-linear-w01-v25-uploader: Downloaded in 20.769s
chaiml-2a6f-69d4-linear-w01-v25-uploader: Processed model ChaiML/2a6f-69d4-linear-w01 in 37.806s
chaiml-2a6f-69d4-linear-w01-v25-uploader: creating bucket guanaco-vllm-models
chaiml-2a6f-69d4-linear-w01-v25-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-w01-v25-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-2a6f-69d4-linear-w01-v25-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-2a6f-69d4-linear-w01-v25-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-2a6f-69d4-linear-w01-v25-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-w01-v25-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-2a6f-69d4-linear-w01-v25-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-w01-v25-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-2a6f-69d4-linear-w01-v25-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-w01-v25-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-2a6f-69d4-linear-w01-v25-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-w01-v25-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-2a6f-69d4-linear-w01-v25-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-2a6f-69d4-linear-w01-v25-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-2a6f-69d4-linear-w01-v25-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-2a6f-69d4-linear-w01-v25-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-2a6f-69d4-linear-w01-v25-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-2a6f-69d4-linear-w01-v25-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/.gitattributes
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/mergekit_config.yaml s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/mergekit_config.yaml
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/config.json
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/README.md
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/mergekit_config.yml s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/mergekit_config.yml
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/model.safetensors.index.json
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/special_tokens_map.json
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/tokenizer_config.json
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/tokenizer.json
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/model-00010-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/model-00010-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/model-00009-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/model-00009-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/model-00005-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/model-00005-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/model-00006-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/model-00006-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/model-00004-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/model-00004-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/model-00001-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/model-00001-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/model-00007-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/model-00007-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/model-00002-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/model-00002-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v25-uploader: cp /dev/shm/model_output/model-00003-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v25/model-00003-of-00010.safetensors
Job chaiml-2a6f-69d4-linear-w01-v25-uploader completed after 255.82s with status: succeeded
Stopping job with name chaiml-2a6f-69d4-linear-w01-v25-uploader
Pipeline stage VLLMUploader completed in 260.93s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.64s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-2a6f-69d4-linear-w01-v25
Waiting for inference service chaiml-2a6f-69d4-linear-w01-v25 to be ready
Inference service chaiml-2a6f-69d4-linear-w01-v25 ready after 190.96467804908752s
Pipeline stage VLLMDeployer completed in 194.77s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2966997623443604s
Received healthy response to inference request in 2.1978261470794678s
Received healthy response to inference request in 2.3931398391723633s
Received healthy response to inference request in 2.4060847759246826s
Received healthy response to inference request in 2.174247980117798s
Received healthy response to inference request in 2.3338241577148438s
Received healthy response to inference request in 2.201364278793335s
Received healthy response to inference request in 2.5492630004882812s
Received healthy response to inference request in 2.3849244117736816s
Received healthy response to inference request in 2.5557219982147217s
Received healthy response to inference request in 2.6358678340911865s
Received healthy response to inference request in 2.543551445007324s
Received healthy response to inference request in 2.3736941814422607s
Received healthy response to inference request in 2.635648488998413s
Received healthy response to inference request in 2.349846839904785s
Received healthy response to inference request in 2.2135062217712402s
Received healthy response to inference request in 2.8090696334838867s
Received healthy response to inference request in 2.513960361480713s
Received healthy response to inference request in 2.5449113845825195s
Received healthy response to inference request in 2.378998279571533s
Received healthy response to inference request in 2.210212230682373s
Received healthy response to inference request in 2.246051549911499s
Received healthy response to inference request in 2.462745428085327s
Received healthy response to inference request in 2.535752773284912s
Received healthy response to inference request in 2.1929092407226562s
Received healthy response to inference request in 2.632505416870117s
Received healthy response to inference request in 2.9301886558532715s
Received healthy response to inference request in 2.874152183532715s
Received healthy response to inference request in 2.3433072566986084s
Received healthy response to inference request in 2.500419855117798s
30 requests
0 failed requests
5th percentile: 2.1951218485832213
10th percentile: 2.2010104656219482
20th percentile: 2.2395424842834473
30th percentile: 2.340462327003479
40th percentile: 2.376876640319824
50th percentile: 2.399612307548523
60th percentile: 2.5058360576629637
70th percentile: 2.5439594268798826
80th percentile: 2.571078681945801
90th percentile: 2.653188014030457
95th percentile: 2.844865036010742
99th percentile: 2.91393807888031
mean time: 2.4473465204238893
Pipeline stage StressChecker completed in 84.66s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 7.42s
Shutdown handler de-registered
chaiml-2a6f-69d4-linear-w01_v25 status is now deployed due to DeploymentManager action
chaiml-2a6f-69d4-linear-w01_v25 status is now inactive due to auto deactivation removed underperforming models
chaiml-2a6f-69d4-linear-w01_v25 status is now torndown due to DeploymentManager action