Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-pony-d3-g46-pv2-64172-v4-uploader
Waiting for job on chaiml-pony-d3-g46-pv2-64172-v4-uploader to finish
chaiml-pony-d3-g46-pv2-64172-v4-uploader: Using quantization_mode: fp8
chaiml-pony-d3-g46-pv2-64172-v4-uploader: Checking if ChaiML/pony-d3-g46-pv2-lr5e6ep2r64b8-FP8 already exists in ChaiML
chaiml-pony-d3-g46-pv2-64172-v4-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-pony-d3-g46-pv2-64172-v4-uploader: Downloading snapshot of ChaiML/pony-d3-g46-pv2-lr5e6ep2r64b8-FP8...
chaiml-pony-d3-g46-pv2-64172-v4-uploader: Downloaded in 112.673s
chaiml-pony-d3-g46-pv2-64172-v4-uploader: Processed model ChaiML/pony-d3-g46-pv2-lr5e6ep2r64b8 in 116.196s
chaiml-pony-d3-g46-pv2-64172-v4-uploader: creating bucket guanaco-vllm-models
chaiml-pony-d3-g46-pv2-64172-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-64172-v4-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-pony-d3-g46-pv2-64172-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-pony-d3-g46-pv2-64172-v4-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-pony-d3-g46-pv2-64172-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-64172-v4-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-pony-d3-g46-pv2-64172-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-64172-v4-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-pony-d3-g46-pv2-64172-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-64172-v4-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-pony-d3-g46-pv2-64172-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-64172-v4-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-pony-d3-g46-pv2-64172-v4-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-pony-d3-g46-pv2-64172-v4-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-pony-d3-g46-pv2-64172-v4-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-pony-d3-g46-pv2-64172-v4-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-pony-d3-g46-pv2-64172-v4-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-pony-d3-g46-pv2-64172-v4-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/config.json
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/generation_config.json
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/.gitattributes
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/chat_template.jinja
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/special_tokens_map.json
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/recipe.yaml
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/tokenizer_config.json
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model.safetensors.index.json
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/tokenizer.json
Retrying (%r) after connection broken by '%r': %s
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00072-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00072-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00071-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00071-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00035-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00035-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00008-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00008-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00059-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00059-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00045-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00045-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00046-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00046-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00019-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00019-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00034-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00034-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00014-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00014-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00001-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00001-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00061-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00061-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00032-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00032-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00069-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00069-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00044-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00044-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00030-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00030-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00005-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00005-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00048-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00048-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00036-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00036-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00025-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00025-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00050-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00050-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00007-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00007-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00031-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00031-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00027-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00027-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00015-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00015-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00017-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00017-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00018-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00018-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00055-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00055-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00041-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00041-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00051-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00051-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00011-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00011-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00064-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00064-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00065-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00065-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00009-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00009-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00057-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00057-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00028-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00028-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00002-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00002-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00070-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00070-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00020-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00020-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00024-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00024-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00042-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00042-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00067-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00067-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00010-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00010-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00052-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00052-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00063-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00063-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00016-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00016-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00037-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00037-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00066-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00066-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00058-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00058-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00021-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00021-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00039-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00039-of-00072.safetensors
chaiml-pony-d3-g46-pv2-64172-v4-uploader: cp /dev/shm/model_output/model-00023-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-64172-v4/default/model-00023-of-00072.safetensors
Job chaiml-pony-d3-g46-pv2-64172-v4-uploader completed after 261.03s with status: succeeded
Stopping job with name chaiml-pony-d3-g46-pv2-64172-v4-uploader
Pipeline stage VLLMUploader completed in 261.55s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 1.18s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-pony-d3-g46-pv2-64172-v4
Waiting for inference service chaiml-pony-d3-g46-pv2-64172-v4 to be ready
Inference service chaiml-pony-d3-g46-pv2-64172-v4 ready after 552.7320969104767s
Pipeline stage VLLMDeployer completed in 553.21s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.257096529006958s
Received healthy response to inference request in 2.192244529724121s
Received healthy response to inference request in 2.0983099937438965s
Received healthy response to inference request in 2.3123464584350586s
Received healthy response to inference request in 2.25097393989563s
Received healthy response to inference request in 2.2116754055023193s
Received healthy response to inference request in 2.718593120574951s
Received healthy response to inference request in 2.1988463401794434s
Received healthy response to inference request in 2.159359931945801s
Received healthy response to inference request in 2.0681042671203613s
Received healthy response to inference request in 2.272961139678955s
Received healthy response to inference request in 2.6148974895477295s
Received healthy response to inference request in 2.2642745971679688s
Received healthy response to inference request in 2.0756914615631104s
Received healthy response to inference request in 2.052964687347412s
Received healthy response to inference request in 2.0864298343658447s
Received healthy response to inference request in 2.1088624000549316s
Received healthy response to inference request in 2.200190544128418s
Received healthy response to inference request in 2.6653215885162354s
Received healthy response to inference request in 2.1894595623016357s
Received healthy response to inference request in 2.085101366043091s
Received healthy response to inference request in 2.152958631515503s
Received healthy response to inference request in 2.084825277328491s
Received healthy response to inference request in 2.34389328956604s
Received healthy response to inference request in 2.0723915100097656s
Received healthy response to inference request in 2.150237560272217s
Received healthy response to inference request in 2.0926175117492676s
Received healthy response to inference request in 2.1252799034118652s
Received healthy response to inference request in 3.3686625957489014s
Received healthy response to inference request in 2.4448258876800537s
30 requests
0 failed requests
5th percentile: 2.0700335264205934
10th percentile: 2.075361466407776
20th percentile: 2.086164140701294
30th percentile: 2.105696678161621
40th percentile: 2.1518702030181887
50th percentile: 2.1908520460128784
60th percentile: 2.2047844886779786
70th percentile: 2.2668805599212645
80th percentile: 2.364079809188843
90th percentile: 2.670648741722107
95th percentile: 3.0147699952125535
99th percentile: 3.336308436393738
mean time: 2.2973132451375324
Pipeline stage StressChecker completed in 72.94s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.82s
Shutdown handler de-registered
chaiml-pony-d3-g46-pv2-_64172_v4 status is now deployed due to DeploymentManager action
chaiml-pony-d3-g46-pv2-_64172_v4 status is now inactive due to auto deactivation removed underperforming models
chaiml-pony-d3-g46-pv2-_64172_v4 status is now torndown due to DeploymentManager action