Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-pony-d3-g46-pv2-41910-v6-uploader
Waiting for job on chaiml-pony-d3-g46-pv2-41910-v6-uploader to finish
chaiml-pony-d3-g46-pv2-41910-v6-uploader: Using quantization_mode: fp8
chaiml-pony-d3-g46-pv2-41910-v6-uploader: Checking if ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8-FP8 already exists in ChaiML
chaiml-pony-d3-g46-pv2-41910-v6-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-pony-d3-g46-pv2-41910-v6-uploader: Downloading snapshot of ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8-FP8...
2026-03-11T14:22:41.668081+00:00 monitor updated for chaiml-pony-d3-g46-pv2-_41910_v6
2026-03-11T14:23:41.753259+00:00 monitor updated for chaiml-pony-d3-g46-pv2-_41910_v6
2026-03-11T14:24:41.838939+00:00 monitor updated for chaiml-pony-d3-g46-pv2-_41910_v6
chaiml-pony-d3-g46-pv2-41910-v6-uploader: Downloaded in 136.784s
chaiml-pony-d3-g46-pv2-41910-v6-uploader: Processed model ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8 in 140.206s
chaiml-pony-d3-g46-pv2-41910-v6-uploader: creating bucket guanaco-vllm-models
chaiml-pony-d3-g46-pv2-41910-v6-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v6-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-pony-d3-g46-pv2-41910-v6-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-pony-d3-g46-pv2-41910-v6-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-pony-d3-g46-pv2-41910-v6-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v6-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-pony-d3-g46-pv2-41910-v6-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v6-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-pony-d3-g46-pv2-41910-v6-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v6-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-pony-d3-g46-pv2-41910-v6-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v6-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-pony-d3-g46-pv2-41910-v6-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-pony-d3-g46-pv2-41910-v6-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-pony-d3-g46-pv2-41910-v6-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-pony-d3-g46-pv2-41910-v6-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-pony-d3-g46-pv2-41910-v6-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-pony-d3-g46-pv2-41910-v6-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/.gitattributes
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/chat_template.jinja
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/config.json
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/generation_config.json
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model.safetensors.index.json
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/special_tokens_map.json
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/recipe.yaml
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/tokenizer_config.json
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/tokenizer.json
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00072-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00072-of-00072.safetensors
2026-03-11T14:25:41.928933+00:00 monitor updated for chaiml-pony-d3-g46-pv2-_41910_v6
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00071-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00071-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00032-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00032-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00063-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00063-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00015-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00015-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00009-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00009-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00007-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00007-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00057-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00057-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00029-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00029-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00026-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00026-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00027-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00027-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00049-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00049-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00047-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00047-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00004-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00004-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00044-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00044-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00041-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00041-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00059-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00059-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00042-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00042-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00043-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00043-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00058-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00058-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00025-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00025-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00010-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00010-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00002-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00002-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00056-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00056-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00005-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00005-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00051-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00051-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00052-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00052-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00019-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00019-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00021-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00021-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00055-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00055-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00036-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00036-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00011-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00011-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00023-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00023-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00045-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00045-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00048-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00048-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00061-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00061-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00030-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00030-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00050-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00050-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00033-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00033-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00017-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00017-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00008-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00008-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00054-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00054-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00060-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00060-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00038-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00038-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00037-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00037-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v6-uploader: cp /dev/shm/model_output/model-00022-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v6/default/model-00022-of-00072.safetensors
Job chaiml-pony-d3-g46-pv2-41910-v6-uploader completed after 289.05s with status: succeeded
Stopping job with name chaiml-pony-d3-g46-pv2-41910-v6-uploader
Pipeline stage VLLMUploader completed in 289.62s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.54s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-pony-d3-g46-pv2-41910-v6
Waiting for inference service chaiml-pony-d3-g46-pv2-41910-v6 to be ready
2026-03-11T14:26:42.024700+00:00 monitor updated for chaiml-pony-d3-g46-pv2-_41910_v6
2026-03-11T14:27:42.125098+00:00 monitor updated for chaiml-pony-d3-g46-pv2-_41910_v6
2026-03-11T14:28:42.241330+00:00 monitor updated for chaiml-pony-d3-g46-pv2-_41910_v6
2026-03-11T14:29:42.341569+00:00 monitor updated for chaiml-pony-d3-g46-pv2-_41910_v6
2026-03-11T14:30:42.432503+00:00 monitor updated for chaiml-pony-d3-g46-pv2-_41910_v6
2026-03-11T14:31:42.529246+00:00 monitor updated for chaiml-pony-d3-g46-pv2-_41910_v6
Inference service chaiml-pony-d3-g46-pv2-41910-v6 ready after 351.10927391052246s
Pipeline stage VLLMDeployer completed in 351.65s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 8.974537134170532s
Received healthy response to inference request in 9.000672578811646s
2026-03-11T14:32:42.626547+00:00 monitor updated for chaiml-pony-d3-g46-pv2-_41910_v6
Received healthy response to inference request in 1.985025405883789s
Received healthy response to inference request in 9.026894807815552s
Received healthy response to inference request in 2.554452896118164s
Received healthy response to inference request in 8.920087575912476s
Received healthy response to inference request in 1.9502854347229004s
Received healthy response to inference request in 9.182294607162476s
Received healthy response to inference request in 2.1413064002990723s
Received healthy response to inference request in 2.1758100986480713s
Received healthy response to inference request in 2.082031488418579s
Received healthy response to inference request in 2.042747735977173s
Received healthy response to inference request in 1.9659490585327148s
Received healthy response to inference request in 1.9753108024597168s
Received healthy response to inference request in 1.9729745388031006s
Received healthy response to inference request in 1.9459586143493652s
Received healthy response to inference request in 2.0485446453094482s
Received healthy response to inference request in 1.994666576385498s
Received healthy response to inference request in 1.9695184230804443s
Received healthy response to inference request in 2.0085573196411133s
2026-03-11T14:33:42.727131+00:00 monitor updated for chaiml-pony-d3-g46-pv2-_41910_v6
Received healthy response to inference request in 1.9375452995300293s
Received healthy response to inference request in 2.3098952770233154s
Received healthy response to inference request in 2.0202066898345947s
Received healthy response to inference request in 1.9383387565612793s
Received healthy response to inference request in 2.008971691131592s
Received healthy response to inference request in 2.048422336578369s
Received healthy response to inference request in 2.034601926803589s
Received healthy response to inference request in 2.0168051719665527s
Received healthy response to inference request in 2.0246706008911133s
Received healthy response to inference request in 1.986917495727539s
30 requests
0 failed requests
5th percentile: 1.941767692565918
10th percentile: 1.949852752685547
20th percentile: 1.9722833156585693
30th percentile: 1.986349868774414
40th percentile: 2.0088059425354006
50th percentile: 2.022438645362854
60th percentile: 2.0450175762176515
70th percentile: 2.099813961982727
80th percentile: 2.358806800842286
90th percentile: 8.977150678634644
95th percentile: 9.015094804763795
99th percentile: 9.137228665351868
mean time: 3.2081333796183267
Pipeline stage StressChecker completed in 99.25s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.68s
Shutdown handler de-registered
chaiml-pony-d3-g46-pv2-_41910_v6 status is now deployed due to DeploymentManager action
chaiml-pony-d3-g46-pv2-_41910_v6 status is now torndown due to DeploymentManager action