Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-pony-d3-g46-pv2-41910-v1-uploader
Waiting for job on chaiml-pony-d3-g46-pv2-41910-v1-uploader to finish
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Using quantization_mode: fp8
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Checking if ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8-FP8 already exists in ChaiML
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Downloading snapshot of ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8...
Failed to get response for submission chaiml-glm-air-4-5-sft_93550_v12: HTTPConnectionPool(host='chaiml-glm-air-4-5-sft-93550-v12-predictor.tenant-chaiml-guanaco.k2.chaiverse.com', port=80): Read timed out. (read timeout=20.0)
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Downloaded in 264.093s
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Loading /tmp/model_input...
chaiml-pony-d3-g46-pv2-41910-v1-uploader: The tokenizer you are loading from '/tmp/model_input' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-pony-d3-g46-pv2-41910-v1-uploader: `torch_dtype` is deprecated! Use `dtype` instead!
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Some parameters are on the meta device because they were offloaded to the cpu.
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Applying quantization...
chaiml-pony-d3-g46-pv2-41910-v1-uploader: The tokenizer you are loading from '/tmp/model_input' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-pony-d3-g46-pv2-41910-v1-uploader: 2026-03-05T11:26:15.794003-0800 | reset | INFO - Compression lifecycle reset
chaiml-pony-d3-g46-pv2-41910-v1-uploader: 2026-03-05T11:26:15.860093-0800 | from_modifiers | INFO - Creating recipe from modifiers
chaiml-pony-d3-g46-pv2-41910-v1-uploader: 2026-03-05T11:27:13.277469-0800 | initialize | INFO - Compression lifecycle initialized for 1 modifiers
chaiml-pony-d3-g46-pv2-41910-v1-uploader: 2026-03-05T11:27:13.277935-0800 | IndependentPipeline | INFO - Inferred `DataFreePipeline` for `QuantizationModifier`
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Some parameters are on the meta device because they were offloaded to the cpu.
chaiml-pony-d3-g46-pv2-41910-v1-uploader: 2026-03-05T11:49:00.136888-0800 | finalize | INFO - Compression lifecycle finalized for 1 modifiers
chaiml-pony-d3-g46-pv2-41910-v1-uploader: 2026-03-05T11:49:06.066696-0800 | post_process | WARNING - Optimized model is not saved. To save, please provide`output_dir` as input arg.Ex. `oneshot(..., output_dir=...)`
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Saving to /dev/shm/model_output...
chaiml-pony-d3-g46-pv2-41910-v1-uploader: 2026-03-05T11:49:06.101302-0800 | get_model_compressor | INFO - skip_sparsity_compression_stats set to True. Skipping sparsity compression statistic calculations. No sparsity compressor will be applied.
Failed to get response for submission chaiml-pony-v2-g46-lr1_80834_v29: ('http://chaiml-pony-v2-g46-lr1-80834-v29-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/completions', 'request timeout')
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Cleaning quantization config in /dev/shm/model_output
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Pushing to ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8-FP8
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Checking if ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8-FP8 already exists in ChaiML
chaiml-pony-d3-g46-pv2-41910-v1-uploader: ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8-FP8 already exists in ChaiML
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Processed model ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8 in 4330.973s
chaiml-pony-d3-g46-pv2-41910-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v1-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-pony-d3-g46-pv2-41910-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v1-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-pony-d3-g46-pv2-41910-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v1-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-pony-d3-g46-pv2-41910-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v1-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-pony-d3-g46-pv2-41910-v1-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-pony-d3-g46-pv2-41910-v1-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-pony-d3-g46-pv2-41910-v1-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-pony-d3-g46-pv2-41910-v1-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-pony-d3-g46-pv2-41910-v1-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-pony-d3-g46-pv2-41910-v1-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/config.json
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/generation_config.json
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/chat_template.jinja
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/tokenizer_config.json
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/special_tokens_map.json
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/recipe.yaml
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model.safetensors.index.json
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/tokenizer.json
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00072-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00072-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00010-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00010-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00007-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00007-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00003-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00003-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00016-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00016-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00012-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00012-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00023-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00023-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00014-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00014-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00025-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00025-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00005-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00005-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00048-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00048-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00020-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00020-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00030-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00030-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00039-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00039-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00047-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00047-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00064-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00064-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00045-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00045-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00061-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00061-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00006-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00006-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00051-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00051-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00046-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00046-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00004-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00004-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00034-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00034-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00002-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00002-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00038-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00038-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00001-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00001-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00021-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00021-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00036-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00036-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00008-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00008-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00041-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00041-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00037-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00037-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00031-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00031-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00018-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00018-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00013-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00013-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00009-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00009-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00011-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00011-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00067-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00067-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00040-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00040-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00056-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00056-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00069-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00069-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00065-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00065-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00022-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00022-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00033-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00033-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00070-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00070-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00017-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00017-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00035-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00035-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00062-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00062-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00050-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00050-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00044-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00044-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00049-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00049-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00043-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00043-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00042-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00042-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00053-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00053-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00059-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00059-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00015-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00015-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00058-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00058-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00068-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00068-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00019-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00019-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00060-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00060-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00026-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00026-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00024-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00024-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00057-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00057-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00055-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00055-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00028-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00028-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00063-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00063-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00052-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00052-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00027-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00027-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00066-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00066-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00029-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00029-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00054-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00054-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v1-uploader: cp /dev/shm/model_output/model-00032-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v1/default/model-00032-of-00072.safetensors
Job chaiml-pony-d3-g46-pv2-41910-v1-uploader completed after 4487.83s with status: succeeded
Stopping job with name chaiml-pony-d3-g46-pv2-41910-v1-uploader
Pipeline stage VLLMUploader completed in 4490.02s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 1.03s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-pony-d3-g46-pv2-41910-v1
Waiting for inference service chaiml-pony-d3-g46-pv2-41910-v1 to be ready
Inference service chaiml-pony-d3-g46-pv2-41910-v1 ready after 551.7812061309814s
Pipeline stage VLLMDeployer completed in 552.23s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.4780771732330322s
Received healthy response to inference request in 2.5208168029785156s
Received healthy response to inference request in 2.3083977699279785s
Received healthy response to inference request in 2.0530431270599365s
Received healthy response to inference request in 2.123756170272827s
Received healthy response to inference request in 2.4419407844543457s
Received healthy response to inference request in 2.135363817214966s
Received healthy response to inference request in 2.0550119876861572s
Received healthy response to inference request in 2.6965177059173584s
Received healthy response to inference request in 3.006225347518921s
Received healthy response to inference request in 2.4036004543304443s
Received healthy response to inference request in 2.1451563835144043s
Received healthy response to inference request in 2.126631498336792s
Received healthy response to inference request in 2.421478271484375s
Received healthy response to inference request in 2.732311487197876s
Received healthy response to inference request in 2.0082693099975586s
Received healthy response to inference request in 2.2501463890075684s
Received healthy response to inference request in 2.021573305130005s
Received healthy response to inference request in 2.79610276222229s
Received healthy response to inference request in 2.044235944747925s
Received healthy response to inference request in 2.380404472351074s
Received healthy response to inference request in 4.820250034332275s
Received healthy response to inference request in 1.975853681564331s
Received healthy response to inference request in 2.0072617530822754s
Received healthy response to inference request in 2.5392677783966064s
Received healthy response to inference request in 2.0088493824005127s
Received healthy response to inference request in 2.0743863582611084s
Received healthy response to inference request in 2.3163037300109863s
Received healthy response to inference request in 2.271071195602417s
Received healthy response to inference request in 2.099879741668701s
30 requests
0 failed requests
5th percentile: 2.0077151536941527
10th percentile: 2.008791375160217
20th percentile: 2.051281690597534
30th percentile: 2.0922317266464234
40th percentile: 2.131870889663696
50th percentile: 2.2606087923049927
60th percentile: 2.3419440269470213
70th percentile: 2.427617025375366
80th percentile: 2.524506998062134
90th percentile: 2.7386906147003174
95th percentile: 2.911670184135436
99th percentile: 4.294182875156404
mean time: 2.3754061539967855
Pipeline stage StressChecker completed in 88.11s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.87s
Shutdown handler de-registered
chaiml-pony-d3-g46-pv2-_41910_v1 status is now deployed due to DeploymentManager action
chaiml-pony-d3-g46-pv2-_41910_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-pony-d3-g46-pv2-_41910_v1 status is now torndown due to DeploymentManager action