Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-2a6f-69d4-linear-w01-v28-uploader
Waiting for job on chaiml-2a6f-69d4-linear-w01-v28-uploader to finish
HTTP Request: %s %s "%s %d %s"
chaiml-2a6f-69d4-linear-w01-v28-uploader: Using quantization_mode: none
chaiml-2a6f-69d4-linear-w01-v28-uploader: Downloading snapshot of ChaiML/2a6f-69d4-linear-w01...
chaiml-2a6f-69d4-linear-w01-v28-uploader:
Fetching 19 files: 0%| | 0/19 [00:00<?, ?it/s]
Fetching 19 files: 5%|▌ | 1/19 [00:00<00:05, 3.47it/s]
Fetching 19 files: 32%|███▏ | 6/19 [00:16<00:36, 2.83s/it]
Fetching 19 files: 42%|████▏ | 8/19 [00:16<00:21, 1.92s/it]
Fetching 19 files: 53%|█████▎ | 10/19 [00:16<00:12, 1.35s/it]
Fetching 19 files: 74%|███████▎ | 14/19 [00:20<00:05, 1.11s/it]
Fetching 19 files: 100%|██████████| 19/19 [00:20<00:00, 1.06s/it]
chaiml-2a6f-69d4-linear-w01-v28-uploader: Downloaded in 20.224s
chaiml-2a6f-69d4-linear-w01-v28-uploader: Processed model ChaiML/2a6f-69d4-linear-w01 in 37.666s
chaiml-2a6f-69d4-linear-w01-v28-uploader: creating bucket guanaco-vllm-models
chaiml-2a6f-69d4-linear-w01-v28-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-w01-v28-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-2a6f-69d4-linear-w01-v28-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-2a6f-69d4-linear-w01-v28-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-2a6f-69d4-linear-w01-v28-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-w01-v28-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-2a6f-69d4-linear-w01-v28-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-w01-v28-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-2a6f-69d4-linear-w01-v28-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-w01-v28-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-2a6f-69d4-linear-w01-v28-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-2a6f-69d4-linear-w01-v28-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-2a6f-69d4-linear-w01-v28-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-2a6f-69d4-linear-w01-v28-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-2a6f-69d4-linear-w01-v28-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-2a6f-69d4-linear-w01-v28-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-2a6f-69d4-linear-w01-v28-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-2a6f-69d4-linear-w01-v28-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/mergekit_config.yml s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/mergekit_config.yml
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/mergekit_config.yaml s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/mergekit_config.yaml
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/README.md
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/config.json
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/special_tokens_map.json
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/.gitattributes
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/model.safetensors.index.json
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/tokenizer_config.json
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/tokenizer.json
HTTP Request: %s %s "%s %d %s"
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/model-00010-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/model-00010-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/model-00002-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/model-00002-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/model-00007-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/model-00007-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/model-00004-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/model-00004-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/model-00009-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/model-00009-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/model-00006-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/model-00006-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/model-00003-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/model-00003-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/model-00001-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/model-00001-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/model-00005-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/model-00005-of-00010.safetensors
chaiml-2a6f-69d4-linear-w01-v28-uploader: cp /dev/shm/model_output/model-00008-of-00010.safetensors s3://guanaco-vllm-models/chaiml-2a6f-69d4-linear-w01-v28/model-00008-of-00010.safetensors
Job chaiml-2a6f-69d4-linear-w01-v28-uploader completed after 287.75s with status: succeeded
Stopping job with name chaiml-2a6f-69d4-linear-w01-v28-uploader
Pipeline stage VLLMUploader completed in 288.32s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-2a6f-69d4-linear-w01-v28
Waiting for inference service chaiml-2a6f-69d4-linear-w01-v28 to be ready
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Inference service chaiml-2a6f-69d4-linear-w01-v28 ready after 684.8319320678711s
Pipeline stage VLLMDeployer completed in 685.49s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2538692951202393s
Received healthy response to inference request in 2.207192897796631s
Received healthy response to inference request in 2.5314483642578125s
Received healthy response to inference request in 2.549628496170044s
Received healthy response to inference request in 2.4372916221618652s
Received healthy response to inference request in 2.555893898010254s
Received healthy response to inference request in 2.7899413108825684s
Received healthy response to inference request in 2.386061906814575s
Received healthy response to inference request in 2.3433735370635986s
Received healthy response to inference request in 2.3732640743255615s
Received healthy response to inference request in 2.336461305618286s
Received healthy response to inference request in 2.198580741882324s
Received healthy response to inference request in 2.231600284576416s
Received healthy response to inference request in 2.1953744888305664s
Received healthy response to inference request in 2.187279224395752s
Received healthy response to inference request in 2.37518310546875s
Received healthy response to inference request in 2.2440710067749023s
Received healthy response to inference request in 2.430142879486084s
Received healthy response to inference request in 2.6993825435638428s
Received healthy response to inference request in 2.2235329151153564s
Received healthy response to inference request in 2.378814935684204s
Received healthy response to inference request in 2.493690252304077s
Received healthy response to inference request in 2.770907402038574s
Received healthy response to inference request in 2.41083025932312s
Received healthy response to inference request in 2.5312271118164062s
Received healthy response to inference request in 2.375185966491699s
Received healthy response to inference request in 2.2378454208374023s
Received healthy response to inference request in 2.4525341987609863s
Received healthy response to inference request in 2.3585641384124756s
Received healthy response to inference request in 2.245474338531494s
30 requests
0 failed requests
5th percentile: 2.1968173027038573
10th percentile: 2.2063316822052004
20th percentile: 2.236596393585205
30th percentile: 2.2513508081436155
40th percentile: 2.352487897872925
50th percentile: 2.3751845359802246
60th percentile: 2.3959692478179933
70th percentile: 2.4418643951416015
80th percentile: 2.5312713623046874
90th percentile: 2.570242762565613
95th percentile: 2.738721215724945
99th percentile: 2.78442147731781
mean time: 2.3934882640838624
Pipeline stage StressChecker completed in 75.41s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
chaiml-2a6f-69d4-linear-w01_v28 status is now deployed due to DeploymentManager action
chaiml-2a6f-69d4-linear-w01_v28 status is now inactive due to auto deactivation removed underperforming models
chaiml-2a6f-69d4-linear-w01_v28 status is now torndown due to DeploymentManager action