Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name intel-glm-5-int4-mixed-13556-v3-uploader
Waiting for job on intel-glm-5-int4-mixed-13556-v3-uploader to finish
2026-03-25T17:50:42.928787+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
intel-glm-5-int4-mixed-13556-v3-uploader: Using quantization_mode: none
intel-glm-5-int4-mixed-13556-v3-uploader: Downloading snapshot of Intel/GLM-5-int4-mixed-AutoRound...
2026-03-25T17:51:43.156992+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
2026-03-25T17:52:43.373939+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
intel-glm-5-int4-mixed-13556-v3-uploader: Downloaded in 155.769s
2026-03-25T17:53:43.585256+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
2026-03-25T17:54:43.785149+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
2026-03-25T17:55:44.253761+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
2026-03-25T17:56:44.466205+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
2026-03-25T17:57:44.684610+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
intel-glm-5-int4-mixed-13556-v3-uploader: Processed model Intel/GLM-5-int4-mixed-AutoRound in 428.260s
intel-glm-5-int4-mixed-13556-v3-uploader: creating bucket guanaco-vllm-models
intel-glm-5-int4-mixed-13556-v3-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
intel-glm-5-int4-mixed-13556-v3-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
intel-glm-5-int4-mixed-13556-v3-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
intel-glm-5-int4-mixed-13556-v3-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
intel-glm-5-int4-mixed-13556-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
intel-glm-5-int4-mixed-13556-v3-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
intel-glm-5-int4-mixed-13556-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
intel-glm-5-int4-mixed-13556-v3-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
intel-glm-5-int4-mixed-13556-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
intel-glm-5-int4-mixed-13556-v3-uploader: if re.search("-\.", bucket, re.UNICODE):
intel-glm-5-int4-mixed-13556-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
intel-glm-5-int4-mixed-13556-v3-uploader: if re.search("\.\.", bucket, re.UNICODE):
intel-glm-5-int4-mixed-13556-v3-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
intel-glm-5-int4-mixed-13556-v3-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
intel-glm-5-int4-mixed-13556-v3-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
intel-glm-5-int4-mixed-13556-v3-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
intel-glm-5-int4-mixed-13556-v3-uploader: Bucket 's3://guanaco-vllm-models/' created
intel-glm-5-int4-mixed-13556-v3-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/README.md
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/config.json
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/.gitattributes
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/generation_config.json
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/chat_template.jinja
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/tokenizer_config.json
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/quantization_config.json
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00081-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00081-of-00081.safetensors
2026-03-25T17:58:44.885304+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00061-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00061-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00011-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00011-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00059-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00059-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00004-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00004-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00006-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00006-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00056-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00056-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00030-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00030-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00020-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00020-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00066-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00066-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00043-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00043-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00051-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00051-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00007-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00007-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00052-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00052-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00067-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00067-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00009-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00009-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00024-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00024-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00015-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00015-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00029-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00029-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00032-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00032-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00068-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00068-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00057-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00057-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00041-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00041-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00044-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00044-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00019-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00019-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00047-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00047-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00034-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00034-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00026-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00026-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00046-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00046-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00049-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00049-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00008-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00008-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00037-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00037-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00021-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00021-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00013-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00013-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00042-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00042-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00073-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00073-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00018-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00018-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00023-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00023-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00070-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00070-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00025-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00025-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00048-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00048-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00038-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00038-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00065-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00065-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00058-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00058-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00060-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00060-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00010-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00010-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00079-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00079-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00001-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00001-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00072-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00072-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00027-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00027-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00055-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00055-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00069-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00069-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00003-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00003-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00045-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00045-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00017-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00017-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00035-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00035-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00053-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00053-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00064-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00064-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00078-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00078-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00036-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00036-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00062-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00062-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00005-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00005-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00050-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00050-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00077-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00077-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00031-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00031-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00012-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00012-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00071-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00071-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00022-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00022-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00054-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00054-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00033-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00033-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00074-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00074-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00075-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00075-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00040-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00040-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00028-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00028-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00076-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00076-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00063-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00063-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00039-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00039-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00016-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00016-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00002-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00002-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model-00014-of-00081.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model-00014-of-00081.safetensors
intel-glm-5-int4-mixed-13556-v3-uploader: cp /dev/shm/model_output/model_extra_tensors.safetensors s3://guanaco-vllm-models/intel-glm-5-int4-mixed-13556-v3/default/model_extra_tensors.safetensors
2026-03-25T17:59:45.081017+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
Job intel-glm-5-int4-mixed-13556-v3-uploader completed after 618.12s with status: succeeded
Stopping job with name intel-glm-5-int4-mixed-13556-v3-uploader
Pipeline stage VLLMUploader completed in 619.69s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 1.38s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service intel-glm-5-int4-mixed-13556-v3
Waiting for inference service intel-glm-5-int4-mixed-13556-v3 to be ready
2026-03-25T18:00:45.311873+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
2026-03-25T18:01:45.837983+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
2026-03-25T18:02:46.058533+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
2026-03-25T18:03:46.269608+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
2026-03-25T18:04:46.499604+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
2026-03-25T18:05:46.734527+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
2026-03-25T18:06:46.975635+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
Inference service intel-glm-5-int4-mixed-13556-v3 ready after 435.39760637283325s
Pipeline stage VLLMDeployer completed in 437.23s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 11.860048532485962s
Received healthy response to inference request in 2.854172706604004s
Received healthy response to inference request in 2.8702292442321777s
Received healthy response to inference request in 2.8261055946350098s
Received healthy response to inference request in 2.578589677810669s
2026-03-25T18:07:47.202752+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
Received healthy response to inference request in 2.79012131690979s
Received healthy response to inference request in 2.7961323261260986s
Received healthy response to inference request in 2.689939260482788s
Received healthy response to inference request in 2.721235990524292s
Received healthy response to inference request in 2.7312676906585693s
Received healthy response to inference request in 2.839524507522583s
Received healthy response to inference request in 2.8243019580841064s
Received healthy response to inference request in 2.7691054344177246s
Received healthy response to inference request in 2.580687999725342s
Received healthy response to inference request in 2.642739772796631s
Received healthy response to inference request in 2.7593376636505127s
Received healthy response to inference request in 2.994675874710083s
Received healthy response to inference request in 2.845355749130249s
Received healthy response to inference request in 2.8019931316375732s
Received healthy response to inference request in 2.9781413078308105s
Received healthy response to inference request in 2.8925676345825195s
Received healthy response to inference request in 2.8570141792297363s
Received healthy response to inference request in 2.983079195022583s
Received healthy response to inference request in 3.064021348953247s
2026-03-25T18:08:47.401422+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
Received healthy response to inference request in 11.45807695388794s
{"detail":"('http://intel-glm-5-int4-mixed-13556-v3-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/completions', 'upstream connect error or disconnect/reset before headers. reset reason: connection termination')"}
Received unhealthy response to inference request!
Received healthy response to inference request in 11.852257013320923s
Received healthy response to inference request in 11.764743328094482s
Received healthy response to inference request in 2.8458099365234375s
Received healthy response to inference request in 2.6623497009277344s
30 requests
1 failed requests
5th percentile: 2.608611297607422
10th percentile: 2.660388708114624
20th percentile: 2.729261350631714
30th percentile: 2.7838165521621705
40th percentile: 2.815378427505493
50th percentile: 2.842440128326416
60th percentile: 2.8553092956542967
70th percentile: 2.9182397365570067
80th percentile: 3.008544969558716
90th percentile: 11.488743591308594
95th percentile: 11.812875854969024
99th percentile: 11.8577889919281
mean time: 4.266807691256205
%s, retrying in %s seconds...
Received healthy response to inference request in 2.962038993835449s
Received healthy response to inference request in 3.418459415435791s
Received healthy response to inference request in 2.7983357906341553s
2026-03-25T18:09:47.648565+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
Received healthy response to inference request in 3.30920147895813s
Received healthy response to inference request in 2.713923454284668s
Received healthy response to inference request in 2.8670871257781982s
Received healthy response to inference request in 3.43808913230896s
Received healthy response to inference request in 2.911045551300049s
Received healthy response to inference request in 2.94315767288208s
Received healthy response to inference request in 3.513453245162964s
Received healthy response to inference request in 2.666537284851074s
Received healthy response to inference request in 2.700368642807007s
Received healthy response to inference request in 2.573578357696533s
Received healthy response to inference request in 2.6700940132141113s
Received healthy response to inference request in 2.9638144969940186s
Received healthy response to inference request in 3.103987455368042s
Received healthy response to inference request in 2.7672269344329834s
Received healthy response to inference request in 2.9507222175598145s
Received healthy response to inference request in 2.903353691101074s
Received healthy response to inference request in 2.950575351715088s
Received healthy response to inference request in 3.2474873065948486s
Received healthy response to inference request in 2.8703649044036865s
2026-03-25T18:10:47.891511+00:00 monitor updated for intel-glm-5-int4-mixed-_13556_v3
Received healthy response to inference request in 2.8828554153442383s
Received healthy response to inference request in 2.734581232070923s
Received healthy response to inference request in 2.9719531536102295s
Received healthy response to inference request in 3.0353684425354004s
Received healthy response to inference request in 2.7130401134490967s
Received healthy response to inference request in 2.733057975769043s
Received healthy response to inference request in 2.872560501098633s
Received healthy response to inference request in 2.6809072494506836s
30 requests
0 failed requests
5th percentile: 2.668137812614441
10th percentile: 2.6798259258270263
20th percentile: 2.7137467861175537
30th percentile: 2.757433223724365
40th percentile: 2.8690537929534914
50th percentile: 2.8931045532226562
60th percentile: 2.9461247444152834
70th percentile: 2.96257164478302
80th percentile: 3.049092245101929
90th percentile: 3.3201272726058964
95th percentile: 3.429255759716034
99th percentile: 3.4915976524353027
mean time: 2.928907553354899
Pipeline stage StressChecker completed in 228.71s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.47s
Shutdown handler de-registered
intel-glm-5-int4-mixed-_13556_v3 status is now deployed due to DeploymentManager action
intel-glm-5-int4-mixed-_13556_v3 status is now inactive due to auto deactivation removed underperforming models