Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-muster-v0b-lr1e5-34397-v6-uploader
Waiting for job on chaiml-muster-v0b-lr1e5-34397-v6-uploader to finish
chaiml-muster-v0b-lr1e5-34397-v6-uploader: Using quantization_mode: w4a16
chaiml-muster-v0b-lr1e5-34397-v6-uploader: Checking if ChaiML/muster-v0b-lr1e5ep2r64g4b01-W4A16 already exists in ChaiML
chaiml-muster-v0b-lr1e5-34397-v6-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-muster-v0b-lr1e5-34397-v6-uploader: Downloading snapshot of ChaiML/muster-v0b-lr1e5ep2r64g4b01-W4A16...
chaiml-muster-v0b-lr1e5-34397-v6-uploader:
Fetching 39 files: 0%| | 0/39 [00:00<?, ?it/s]
Fetching 39 files: 3%|▎ | 1/39 [00:00<00:09, 4.09it/s]
Fetching 39 files: 18%|█▊ | 7/39 [00:12<01:01, 1.92s/it]
Fetching 39 files: 21%|██ | 8/39 [00:15<01:05, 2.11s/it]
Fetching 39 files: 23%|██▎ | 9/39 [00:16<00:52, 1.76s/it]
Fetching 39 files: 38%|███▊ | 15/39 [00:27<00:42, 1.78s/it]
Fetching 39 files: 44%|████▎ | 17/39 [00:27<00:31, 1.45s/it]
Fetching 39 files: 49%|████▊ | 19/39 [00:30<00:27, 1.40s/it]
Fetching 39 files: 51%|█████▏ | 20/39 [00:31<00:24, 1.31s/it]
Fetching 39 files: 56%|█████▋ | 22/39 [00:31<00:16, 1.04it/s]
Fetching 39 files: 59%|█████▉ | 23/39 [00:40<00:37, 2.36s/it]
Fetching 39 files: 62%|██████▏ | 24/39 [00:41<00:32, 2.16s/it]
Fetching 39 files: 64%|██████▍ | 25/39 [00:41<00:23, 1.71s/it]
Fetching 39 files: 67%|██████▋ | 26/39 [00:42<00:17, 1.37s/it]
Fetching 39 files: 69%|██████▉ | 27/39 [00:44<00:20, 1.70s/it]
Fetching 39 files: 72%|███████▏ | 28/39 [00:45<00:15, 1.42s/it]
Fetching 39 files: 74%|███████▍ | 29/39 [00:45<00:11, 1.14s/it]
Fetching 39 files: 79%|███████▉ | 31/39 [00:47<00:08, 1.02s/it]
Fetching 39 files: 82%|████████▏ | 32/39 [00:48<00:07, 1.04s/it]
Fetching 39 files: 100%|██████████| 39/39 [00:48<00:00, 1.25s/it]
chaiml-muster-v0b-lr1e5-34397-v6-uploader: Downloaded in 48.649s
chaiml-muster-v0b-lr1e5-34397-v6-uploader: Processed model ChaiML/muster-v0b-lr1e5ep2r64g4b01 in 49.181s
chaiml-muster-v0b-lr1e5-34397-v6-uploader: creating bucket guanaco-vllm-models
chaiml-muster-v0b-lr1e5-34397-v6-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0b-lr1e5-34397-v6-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-muster-v0b-lr1e5-34397-v6-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-muster-v0b-lr1e5-34397-v6-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-muster-v0b-lr1e5-34397-v6-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0b-lr1e5-34397-v6-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-muster-v0b-lr1e5-34397-v6-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0b-lr1e5-34397-v6-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-muster-v0b-lr1e5-34397-v6-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0b-lr1e5-34397-v6-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-muster-v0b-lr1e5-34397-v6-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0b-lr1e5-34397-v6-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-muster-v0b-lr1e5-34397-v6-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-muster-v0b-lr1e5-34397-v6-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-muster-v0b-lr1e5-34397-v6-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-muster-v0b-lr1e5-34397-v6-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-muster-v0b-lr1e5-34397-v6-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-muster-v0b-lr1e5-34397-v6-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/added_tokens.json
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/chat_template.jinja
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/generation_config.json
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/.gitattributes
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/special_tokens_map.json
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/tokenizer_config.json
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/config.json
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/quantization_config.json
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/merges.txt
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/vocab.json
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model.safetensors.index.json
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/tokenizer.json
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00027-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00016-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00023-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00009-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00013-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00007-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00008-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00005-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00018-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00019-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00002-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00026-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00014-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00022-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00011-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00006-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00003-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00012-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00024-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00021-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00025-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00015-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00004-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00001-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00010-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00017-of-00027.safetensors
chaiml-muster-v0b-lr1e5-34397-v6-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0b-lr1e5-34397-v6/model-00020-of-00027.safetensors
Job chaiml-muster-v0b-lr1e5-34397-v6-uploader completed after 280.09s with status: succeeded
Stopping job with name chaiml-muster-v0b-lr1e5-34397-v6-uploader
Pipeline stage VLLMUploader completed in 281.06s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.36s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-muster-v0b-lr1e5-34397-v6
Waiting for inference service chaiml-muster-v0b-lr1e5-34397-v6 to be ready
Inference service chaiml-muster-v0b-lr1e5-34397-v6 ready after 1174.8808763027191s
Pipeline stage VLLMDeployer completed in 1176.03s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.379784345626831s
Received healthy response to inference request in 2.117305278778076s
Received healthy response to inference request in 2.0158066749572754s
Received healthy response to inference request in 2.0384788513183594s
Received healthy response to inference request in 2.109189987182617s
Received healthy response to inference request in 2.0064361095428467s
Received healthy response to inference request in 2.1642837524414062s
Received healthy response to inference request in 2.2162346839904785s
Received healthy response to inference request in 2.1305975914001465s
Received healthy response to inference request in 2.189108371734619s
Received healthy response to inference request in 2.3334763050079346s
Received healthy response to inference request in 2.2975034713745117s
Received healthy response to inference request in 2.3073649406433105s
Received healthy response to inference request in 1.9988741874694824s
Received healthy response to inference request in 2.0394651889801025s
Received healthy response to inference request in 2.0201404094696045s
Received healthy response to inference request in 2.2841813564300537s
Received healthy response to inference request in 2.0617661476135254s
Received healthy response to inference request in 2.103010892868042s
Received healthy response to inference request in 2.0291032791137695s
Received healthy response to inference request in 2.6103997230529785s
Received healthy response to inference request in 2.007807970046997s
Received healthy response to inference request in 2.3379924297332764s
Received healthy response to inference request in 2.4824814796447754s
Received healthy response to inference request in 2.4569473266601562s
Received healthy response to inference request in 2.4280407428741455s
Received healthy response to inference request in 2.008065938949585s
Received healthy response to inference request in 2.3251261711120605s
Received healthy response to inference request in 2.1715140342712402s
Received healthy response to inference request in 2.031700849533081s
30 requests
0 failed requests
5th percentile: 2.0070534467697145
10th percentile: 2.0080401420593263
20th percentile: 2.0273107051849366
30th percentile: 2.0391692876815797
40th percentile: 2.106718349456787
50th percentile: 2.1474406719207764
60th percentile: 2.199958896636963
70th percentile: 2.3004619121551513
80th percentile: 2.334379529953003
90th percentile: 2.4309314012527468
95th percentile: 2.470991110801697
99th percentile: 2.5733034324646
mean time: 2.190072949727376
Pipeline stage StressChecker completed in 72.68s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.29s
Shutdown handler de-registered
chaiml-muster-v0b-lr1e5_34397_v6 status is now deployed due to DeploymentManager action
chaiml-muster-v0b-lr1e5_34397_v6 status is now inactive due to auto deactivation removed underperforming models
chaiml-muster-v0b-lr1e5_34397_v6 status is now torndown due to DeploymentManager action