Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-muster-v0b-lr1e5-34397-v7-uploader
Waiting for job on chaiml-muster-v0b-lr1e5-34397-v7-uploader to finish
chaiml-muster-v0b-lr1e5-34397-v7-uploader: Using quantization_mode: w4a16
chaiml-muster-v0b-lr1e5-34397-v7-uploader: Checking if ChaiML/muster-v0b-lr1e5ep2r64g4b01-W4A16 already exists in ChaiML
chaiml-muster-v0b-lr1e5-34397-v7-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-muster-v0b-lr1e5-34397-v7-uploader: Downloading snapshot of ChaiML/muster-v0b-lr1e5ep2r64g4b01-W4A16...
chaiml-muster-v0b-lr1e5-34397-v7-uploader: creating bucket guanaco-vllm-models
chaiml-muster-v0b-lr1e5-34397-v7-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0b-lr1e5-34397-v7-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-muster-v0b-lr1e5-34397-v7-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-muster-v0b-lr1e5-34397-v7-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-muster-v0b-lr1e5-34397-v7-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0b-lr1e5-34397-v7-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-muster-v0b-lr1e5-34397-v7-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0b-lr1e5-34397-v7-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-muster-v0b-lr1e5-34397-v7-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0b-lr1e5-34397-v7-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-muster-v0b-lr1e5-34397-v7-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0b-lr1e5-34397-v7-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-muster-v0b-lr1e5-34397-v7-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-muster-v0b-lr1e5-34397-v7-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-muster-v0b-lr1e5-34397-v7-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-muster-v0b-lr1e5-34397-v7-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-muster-v0b-lr1e5-34397-v7-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-muster-v0b-lr1e5-34397-v7-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/added_tokens.json
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/generation_config.json
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/special_tokens_map.json
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/quantization_config.json
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/config.json
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/merges.txt
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/vocab.json
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/tokenizer_config.json
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/chat_template.jinja
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/.gitattributes
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model.safetensors.index.json
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/tokenizer.json
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00027-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00024-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00001-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00013-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00023-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00004-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00012-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00005-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00025-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00018-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00020-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00009-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00003-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00017-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00011-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00019-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00002-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00008-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00021-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00007-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00015-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00026-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00006-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00014-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00022-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00016-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v7-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v7/model-00010-of-00027.safetensors
Job chaiml-muster-v0b-lr1e5-34397-v7-uploader completed after 298.97s with status: succeeded
Stopping job with name chaiml-muster-v0b-lr1e5-34397-v7-uploader
Pipeline stage VLLMUploader completed in 300.03s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.23s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-muster-v0b-lr1e5-34397-v7
Waiting for inference service chaiml-muster-v0b-lr1e5-34397-v7 to be ready
Inference service chaiml-muster-v0b-lr1e5-34397-v7 ready after 578.1560275554657s
Pipeline stage VLLMDeployer completed in 579.20s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.240243434906006s
Received healthy response to inference request in 1.981480598449707s
Received healthy response to inference request in 2.251538038253784s
Received healthy response to inference request in 2.2936742305755615s
Received healthy response to inference request in 2.132277011871338s
Received healthy response to inference request in 2.202261447906494s
Received healthy response to inference request in 2.15164852142334s
Received healthy response to inference request in 2.0449602603912354s
Received healthy response to inference request in 2.1671979427337646s
Received healthy response to inference request in 2.1285579204559326s
Received healthy response to inference request in 2.300609827041626s
Received healthy response to inference request in 2.0005712509155273s
Received healthy response to inference request in 2.035895824432373s
Received healthy response to inference request in 2.0822055339813232s
Received healthy response to inference request in 2.234984874725342s
Received healthy response to inference request in 2.1155190467834473s
Received healthy response to inference request in 2.145479202270508s
Received healthy response to inference request in 2.3340470790863037s
Received healthy response to inference request in 2.312864065170288s
Received healthy response to inference request in 2.0167534351348877s
Received healthy response to inference request in 2.2243402004241943s
Received healthy response to inference request in 2.258474826812744s
Received healthy response to inference request in 2.1667256355285645s
Received healthy response to inference request in 2.1741864681243896s
Received healthy response to inference request in 2.2841641902923584s
Received healthy response to inference request in 2.178541898727417s
Received healthy response to inference request in 2.1253631114959717s
Received healthy response to inference request in 2.0665574073791504s
Received healthy response to inference request in 2.2280681133270264s
Received healthy response to inference request in 2.2778306007385254s
30 requests
0 failed requests
5th percentile: 2.0078532338142394
10th percentile: 2.0339815855026244
20th percentile: 2.0790759086608888
30th percentile: 2.1275994777679443
40th percentile: 2.149180793762207
50th percentile: 2.170692205429077
60th percentile: 2.211092948913574
70th percentile: 2.236562442779541
80th percentile: 2.2623459815979006
90th percentile: 2.294367790222168
95th percentile: 2.3073496580123902
99th percentile: 2.327904005050659
mean time: 2.171900733311971
Pipeline stage StressChecker completed in 72.19s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 3.23s
Shutdown handler de-registered
chaiml-muster-v0b-lr1e5_34397_v7 status is now deployed due to DeploymentManager action
chaiml-muster-v0b-lr1e5_34397_v7 status is now inactive due to auto deactivation removed underperforming models
chaiml-muster-v0b-lr1e5_34397_v7 status is now torndown due to DeploymentManager action