Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-muster-v0a-lr1e5-44160-v3-uploader
Waiting for job on chaiml-muster-v0a-lr1e5-44160-v3-uploader to finish
chaiml-muster-v0a-lr1e5-44160-v3-uploader: Using quantization_mode: w4a16
chaiml-muster-v0a-lr1e5-44160-v3-uploader: Checking if ChaiML/muster-v0a-lr1e5ep2r64g4b01-W4A16 already exists in ChaiML
chaiml-muster-v0a-lr1e5-44160-v3-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-muster-v0a-lr1e5-44160-v3-uploader: Downloading snapshot of ChaiML/muster-v0a-lr1e5ep2r64g4b01-W4A16...
chaiml-muster-v0a-lr1e5-44160-v3-uploader:
Fetching 39 files: 0%| | 0/39 [00:00<?, ?it/s]
Fetching 39 files: 3%|▎ | 1/39 [00:00<00:10, 3.64it/s]
Fetching 39 files: 18%|█▊ | 7/39 [00:15<01:11, 2.24s/it]
Fetching 39 files: 21%|██ | 8/39 [00:15<00:58, 1.90s/it]
Fetching 39 files: 28%|██▊ | 11/39 [00:43<02:22, 5.08s/it]
Fetching 39 files: 64%|██████▍ | 25/39 [00:44<00:18, 1.36s/it]
Fetching 39 files: 69%|██████▉ | 27/39 [00:46<00:15, 1.33s/it]
Fetching 39 files: 74%|███████▍ | 29/39 [00:50<00:14, 1.45s/it]
Fetching 39 files: 77%|███████▋ | 30/39 [00:53<00:14, 1.56s/it]
Fetching 39 files: 100%|██████████| 39/39 [00:53<00:00, 1.36s/it]
chaiml-muster-v0a-lr1e5-44160-v3-uploader: Downloaded in 53.219s
chaiml-muster-v0a-lr1e5-44160-v3-uploader: Processed model ChaiML/muster-v0a-lr1e5ep2r64g4b01 in 53.766s
chaiml-muster-v0a-lr1e5-44160-v3-uploader: creating bucket guanaco-vllm-models
chaiml-muster-v0a-lr1e5-44160-v3-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0a-lr1e5-44160-v3-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-muster-v0a-lr1e5-44160-v3-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-muster-v0a-lr1e5-44160-v3-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-muster-v0a-lr1e5-44160-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0a-lr1e5-44160-v3-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-muster-v0a-lr1e5-44160-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0a-lr1e5-44160-v3-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-muster-v0a-lr1e5-44160-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0a-lr1e5-44160-v3-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-muster-v0a-lr1e5-44160-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v0a-lr1e5-44160-v3-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-muster-v0a-lr1e5-44160-v3-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-muster-v0a-lr1e5-44160-v3-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-muster-v0a-lr1e5-44160-v3-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-muster-v0a-lr1e5-44160-v3-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-muster-v0a-lr1e5-44160-v3-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-muster-v0a-lr1e5-44160-v3-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/added_tokens.json
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/.gitattributes
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/generation_config.json
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/chat_template.jinja
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/quantization_config.json
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/tokenizer_config.json
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/special_tokens_map.json
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/tokenizer.json
HTTP Request: %s %s "%s %d %s"
admin requested tearing down of junhua024-chai-12-full-06-13_v1
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLDeleter
%s, retrying in %s seconds...
%s, retrying in %s seconds...
clean up pipeline due to error=TeardownError('401\nReason: Unauthorized\nHTTP response headers: HTTPHeaderDict({\'Audit-Id\': \'b7117ba3-981c-49c5-a7ff-f46e049727af\', \'Cache-Control\': \'no-cache, private\', \'Content-Type\': \'application/json\', \'Date\': \'Sat, 07 Feb 2026 15:11:43 GMT\', \'Content-Length\': \'129\'})\nHTTP response body: b\'{"kind":"Status","apiVersion":"v1","metadata":{},"status":"Failure","message":"Unauthorized","reason":"Unauthorized","code":401}\\n\'\nOriginal traceback: \n File "/root/miniconda3/envs/guanaco/lib/python3.11/site-packages/kubernetes/dynamic/client.py", line 55, in inner\n resp = func(self, *args, **kwargs)\n ^^^^^^^^^^^^^^^^^^^^^^^^^^^\n\n File "/root/miniconda3/envs/guanaco/lib/python3.11/site-packages/kubernetes/dynamic/client.py", line 273, in request\n api_response = self.client.call_api(\n ^^^^^^^^^^^^^^^^^^^^^\n\n File "/root/miniconda3/envs/guanaco/lib/python3.11/site-packages/kubernetes/client/api_client.py", line 348, in call_api\n return self.__call_api(resource_path, method,\n ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^\n\n File "/root/miniconda3/envs/guanaco/lib/python3.11/site-packages/kubernetes/client/api_client.py", line 180, in __call_api\n response_data = self.request(\n ^^^^^^^^^^^^^\n\n File "/root/miniconda3/envs/guanaco/lib/python3.11/site-packages/kubernetes/client/api_client.py", line 373, in request\n return self.rest_client.GET(url,\n ^^^^^^^^^^^^^^^^^^^^^^^^^\n\n File "/root/miniconda3/envs/guanaco/lib/python3.11/site-packages/kubernetes/client/rest.py", line 244, in GET\n return self.request("GET", url,\n ^^^^^^^^^^^^^^^^^^^^^^^^\n\n File "/root/miniconda3/envs/guanaco/lib/python3.11/site-packages/kubernetes/client/rest.py", line 238, in request\n raise ApiException(http_resp=r)\n')
Shutdown handler de-registered
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG retryable error: ServiceUnavailable: Service Unavailable
chaiml-muster-v0a-lr1e5-44160-v3-uploader: status code: 503, request id: , host id:
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00027-of-00027.safetensors
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00005-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00016-of-00027.safetensors
HTTP Request: %s %s "%s %d %s"
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00003-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00012-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00004-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00020-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00025-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00006-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00026-of-00027.safetensors
HTTP Request: %s %s "%s %d %s"
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00009-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00013-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00017-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00021-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00022-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00011-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00023-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00002-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00019-of-00027.safetensors
chaiml-muster-v0a-lr1e5-44160-v3-uploader: Retry 1/5 exited 1, retrying in 2 seconds...
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/.gitattributes": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/added_tokens.json": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/chat_template.jinja": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/config.json": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/generation_config.json": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/merges.txt": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00002-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00003-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00004-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00005-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00006-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00007-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00008-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00009-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00010-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00011-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00012-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00013-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00014-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00015-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00016-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00017-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00018-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00019-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00020-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00021-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00022-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00023-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00024-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00025-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00026-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00027-of-00027.safetensors": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model.safetensors.index.json": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/quantization_config.json": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/special_tokens_map.json": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/tokenizer.json": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/tokenizer_config.json": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: DEBUG "sync /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/vocab.json": object size matches
chaiml-muster-v0a-lr1e5-44160-v3-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v0a-lr1e5-44160-v3/model-00001-of-00027.safetensors
Job chaiml-muster-v0a-lr1e5-44160-v3-uploader completed after 707.51s with status: succeeded
Stopping job with name chaiml-muster-v0a-lr1e5-44160-v3-uploader
Pipeline stage VLLMUploader completed in 708.14s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-muster-v0a-lr1e5-44160-v3
Waiting for inference service chaiml-muster-v0a-lr1e5-44160-v3 to be ready
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Inference service chaiml-muster-v0a-lr1e5-44160-v3 ready after 641.3601086139679s
Pipeline stage VLLMDeployer completed in 644.02s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.1399059295654297s
Received healthy response to inference request in 2.377329111099243s
Received healthy response to inference request in 1.9636781215667725s
Received healthy response to inference request in 2.237375497817993s
Received healthy response to inference request in 2.73374342918396s
Received healthy response to inference request in 2.5970757007598877s
Received healthy response to inference request in 2.394758701324463s
Received healthy response to inference request in 2.3958001136779785s
Received healthy response to inference request in 2.3454525470733643s
Received healthy response to inference request in 2.991279363632202s
Received healthy response to inference request in 2.212552547454834s
Received healthy response to inference request in 2.707141160964966s
Received healthy response to inference request in 2.090121269226074s
Received healthy response to inference request in 2.1722986698150635s
Received healthy response to inference request in 2.4190356731414795s
Received healthy response to inference request in 2.1691248416900635s
Received healthy response to inference request in 2.2144229412078857s
Received healthy response to inference request in 2.063417434692383s
Received healthy response to inference request in 2.5990185737609863s
Received healthy response to inference request in 2.6693379878997803s
Received healthy response to inference request in 2.3448305130004883s
Received healthy response to inference request in 1.935899257659912s
Received healthy response to inference request in 2.193354845046997s
Received healthy response to inference request in 2.550429344177246s
Received healthy response to inference request in 2.8517136573791504s
Received healthy response to inference request in 2.280745506286621s
Received healthy response to inference request in 2.400494337081909s
Received healthy response to inference request in 2.390650510787964s
Received healthy response to inference request in 2.2081024646759033s
Received healthy response to inference request in 2.7701783180236816s
30 requests
0 failed requests
5th percentile: 2.008560812473297
10th percentile: 2.087450885772705
20th percentile: 2.1716639041900634
30th percentile: 2.211217522621155
40th percentile: 2.26339750289917
50th percentile: 2.3613908290863037
HTTP Request: %s %s "%s %d %s"
60th percentile: 2.395175266265869
70th percentile: 2.458453774452209
80th percentile: 2.6130824565887454
90th percentile: 2.737386918067932
95th percentile: 2.8150227546691893
99th percentile: 2.950805308818817
mean time: 2.380642278989156
Pipeline stage StressChecker completed in 107.43s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.83s
Shutdown handler de-registered
chaiml-muster-v0a-lr1e5_44160_v3 status is now deployed due to DeploymentManager action
chaiml-muster-v0a-lr1e5_44160_v3 status is now inactive due to auto deactivation removed underperforming models
chaiml-muster-v0a-lr1e5_44160_v3 status is now torndown due to DeploymentManager action