Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-pony-d3-g46-pv2-41910-v2-uploader
Waiting for job on chaiml-pony-d3-g46-pv2-41910-v2-uploader to finish
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Using quantization_mode: fp8
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Checking if ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8-FP8 already exists in ChaiML
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Downloading snapshot of ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8...
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Downloaded in 247.829s
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Loading /tmp/model_input...
chaiml-pony-d3-g46-pv2-41910-v2-uploader: The tokenizer you are loading from '/tmp/model_input' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-pony-d3-g46-pv2-41910-v2-uploader: `torch_dtype` is deprecated! Use `dtype` instead!
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Some parameters are on the meta device because they were offloaded to the cpu.
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Applying quantization...
chaiml-pony-d3-g46-pv2-41910-v2-uploader: The tokenizer you are loading from '/tmp/model_input' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-pony-d3-g46-pv2-41910-v2-uploader: 2026-03-05T11:26:17.176733-0800 | reset | INFO - Compression lifecycle reset
chaiml-pony-d3-g46-pv2-41910-v2-uploader: 2026-03-05T11:26:17.251455-0800 | from_modifiers | INFO - Creating recipe from modifiers
chaiml-pony-d3-g46-pv2-41910-v2-uploader: 2026-03-05T11:27:12.941500-0800 | initialize | INFO - Compression lifecycle initialized for 1 modifiers
chaiml-pony-d3-g46-pv2-41910-v2-uploader: 2026-03-05T11:27:12.941989-0800 | IndependentPipeline | INFO - Inferred `DataFreePipeline` for `QuantizationModifier`
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Some parameters are on the meta device because they were offloaded to the cpu.
chaiml-pony-d3-g46-pv2-41910-v2-uploader: 2026-03-05T11:47:55.193141-0800 | finalize | INFO - Compression lifecycle finalized for 1 modifiers
chaiml-pony-d3-g46-pv2-41910-v2-uploader: 2026-03-05T11:48:01.333304-0800 | post_process | WARNING - Optimized model is not saved. To save, please provide`output_dir` as input arg.Ex. `oneshot(..., output_dir=...)`
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Saving to /dev/shm/model_output...
chaiml-pony-d3-g46-pv2-41910-v2-uploader: 2026-03-05T11:48:01.360071-0800 | get_model_compressor | INFO - skip_sparsity_compression_stats set to True. Skipping sparsity compression statistic calculations. No sparsity compressor will be applied.
Failed to get response for submission chaiml-pony-v2-g46-lr1_80834_v29: HTTPConnectionPool(host='chaiml-pony-v2-g46-lr1-80834-v29-predictor.tenant-chaiml-guanaco.k2.chaiverse.com', port=80): Read timed out. (read timeout=20.0)
Failed to get response for submission chaiml-pony-v2-g46-lr1_80834_v29: ('http://chaiml-pony-v2-g46-lr1-80834-v29-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/completions', 'request timeout')
Connection pool is full, discarding connection: %s. Connection pool size: %s
Failed to get response for submission chaiml-csfs-v3-3-dpo-lr5e_937_v5: ('http://guanaco-model-mesh-load-balancer.model-mesh.kchai-google-us-east4.chaiverse.com/models/chaiml-csfs-v3-3-dpo-lr5e_937_v5/predict', '{"detail":"1 validation error for RuntimeResponse\\npredictions\\n Field required [type=missing, input_value={\'detail\': \\"503, message=...o-lr5e_937_v5/predict\'\\"}, input_type=dict]\\n For further information visit https://errors.pydantic.dev/2.12/v/missing"}')
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Cleaning quantization config in /dev/shm/model_output
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Pushing to ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8-FP8
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Checking if ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8-FP8 already exists in ChaiML
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Creating repo ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8-FP8 and uploading /dev/shm/model_output to it
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:31:32 (0:00:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 8/80 (28.2M/354.7G) | pre-uploaded: 0/1 (0.0/354.7G) (+72 unsure) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 72 | get upload mode: 0 | pre-uploading: 1 | committing: 0 | waiting: 53
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:32:32 (0:01:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 3/73 (5.1G/354.7G) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 70 | committing: 0 | waiting: 56
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:33:32 (0:02:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 10/73 (40.0G/354.7G) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 63 | committing: 0 | waiting: 63
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:34:32 (0:03:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 11/73 (45.0G/354.7G) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 62 | committing: 0 | waiting: 64
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:35:32 (0:04:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 19/73 (85.0G/354.7G) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 54 | committing: 0 | waiting: 72
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:36:32 (0:05:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 27/73 (125.0G/354.7G) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 46 | committing: 0 | waiting: 80
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:37:32 (0:06:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 35/73 (164.9G/354.7G) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 38 | committing: 0 | waiting: 88
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:38:32 (0:07:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 41/73 (194.9G/354.7G) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 32 | committing: 0 | waiting: 94
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:39:32 (0:08:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 43/73 (204.9G/354.7G) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 30 | committing: 0 | waiting: 96
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:40:32 (0:09:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 51/73 (244.8G/354.7G) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 22 | committing: 0 | waiting: 104
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:41:32 (0:10:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 59/73 (284.8G/354.7G) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 14 | committing: 0 | waiting: 112
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:42:32 (0:11:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 67/73 (324.7G/354.7G) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 6 | committing: 0 | waiting: 120
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:43:32 (0:12:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 73/73 (354.7G/354.7G) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 0 | committing: 1 | waiting: 125
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:44:32 (0:13:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 73/73 (354.7G/354.7G) | committed: 0/80 (0.0/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 0 | committing: 1 | waiting: 125
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------- 2026-03-05 12:45:32 (0:14:00) ----------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Files: hashed 80/80 (354.7G/354.7G) | pre-uploaded: 73/73 (354.7G/354.7G) | committed: 50/80 (204.9G/354.7G) | ignored: 0
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 0 | committing: 1 | waiting: 125
chaiml-pony-d3-g46-pv2-41910-v2-uploader: ---------------------------------------------------
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Processed model ChaiML/pony-d3-g46-pv2-lr5e6ep1r64b8 in 5072.322s
chaiml-pony-d3-g46-pv2-41910-v2-uploader: creating bucket guanaco-vllm-models
chaiml-pony-d3-g46-pv2-41910-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v2-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-pony-d3-g46-pv2-41910-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-pony-d3-g46-pv2-41910-v2-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-pony-d3-g46-pv2-41910-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v2-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-pony-d3-g46-pv2-41910-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v2-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-pony-d3-g46-pv2-41910-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v2-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-pony-d3-g46-pv2-41910-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3-g46-pv2-41910-v2-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-pony-d3-g46-pv2-41910-v2-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-pony-d3-g46-pv2-41910-v2-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-pony-d3-g46-pv2-41910-v2-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-pony-d3-g46-pv2-41910-v2-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-pony-d3-g46-pv2-41910-v2-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-pony-d3-g46-pv2-41910-v2-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/config.json
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/recipe.yaml
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model.safetensors.index.json
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/generation_config.json
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/chat_template.jinja
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/tokenizer.json
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/special_tokens_map.json
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/tokenizer_config.json
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00072-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00072-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00071-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00071-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00061-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00061-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00031-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00031-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00038-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00038-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00046-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00046-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00047-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00047-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00040-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00040-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00006-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00006-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00033-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00033-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00007-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00007-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00012-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00012-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00011-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00011-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00019-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00019-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00014-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00014-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00027-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00027-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00052-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00052-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00049-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00049-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00018-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00018-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00029-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00029-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00016-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00016-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00055-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00055-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00003-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00003-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00013-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00013-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00010-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00010-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00065-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00065-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00060-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00060-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00028-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00028-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00001-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00001-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00058-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00058-of-00072.safetensors
chaiml-pony-d3-g46-pv2-41910-v2-uploader: cp /dev/shm/model_output/model-00050-of-00072.safetensors s3://guanaco-vllm-models/chaiml-pony-d3-g46-pv2-41910-v2/default/model-00050-of-00072.safetensors
Job chaiml-pony-d3-g46-pv2-41910-v2-uploader completed after 5242.9s with status: succeeded
Stopping job with name chaiml-pony-d3-g46-pv2-41910-v2-uploader
Pipeline stage VLLMUploader completed in 5244.95s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 1.12s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-pony-d3-g46-pv2-41910-v2
Waiting for inference service chaiml-pony-d3-g46-pv2-41910-v2 to be ready
Inference service chaiml-pony-d3-g46-pv2-41910-v2 ready after 550.9722039699554s
Pipeline stage VLLMDeployer completed in 551.44s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.403165578842163s
Received healthy response to inference request in 2.8997802734375s
Received healthy response to inference request in 2.4764902591705322s
Received healthy response to inference request in 2.5433876514434814s
Received healthy response to inference request in 2.7658724784851074s
Received healthy response to inference request in 2.3588433265686035s
Received healthy response to inference request in 2.013683319091797s
Received healthy response to inference request in 2.6546454429626465s
Received healthy response to inference request in 2.67671275138855s
Received healthy response to inference request in 2.5715320110321045s
Received healthy response to inference request in 2.169534683227539s
Received healthy response to inference request in 2.21360445022583s
Received healthy response to inference request in 2.450662136077881s
Received healthy response to inference request in 2.747696876525879s
Received healthy response to inference request in 2.877742290496826s
Received healthy response to inference request in 2.2225003242492676s
Received healthy response to inference request in 2.3577780723571777s
Received healthy response to inference request in 2.499983310699463s
Received healthy response to inference request in 2.0162668228149414s
Received healthy response to inference request in 3.193371295928955s
Received healthy response to inference request in 2.07601261138916s
Received healthy response to inference request in 2.035649538040161s
Received healthy response to inference request in 2.294074535369873s
Received healthy response to inference request in 2.334834337234497s
Received healthy response to inference request in 2.292511224746704s
Received healthy response to inference request in 2.6859045028686523s
Received healthy response to inference request in 2.9417779445648193s
Received healthy response to inference request in 2.2919323444366455s
Received healthy response to inference request in 2.1649351119995117s
Received healthy response to inference request in 2.454306125640869s
30 requests
0 failed requests
5th percentile: 2.0249890446662904
10th percentile: 2.0719763040542603
20th percentile: 2.204790496826172
30th percentile: 2.2923375606536864
40th percentile: 2.3486005783081056
50th percentile: 2.426913857460022
60th percentile: 2.4858874797821047
70th percentile: 2.5964660406112667
80th percentile: 2.698262977600098
90th percentile: 2.8799460887908936
95th percentile: 2.9228789925575254
99th percentile: 3.120409224033356
mean time: 2.456173054377238
Pipeline stage StressChecker completed in 79.39s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.35s
Shutdown handler de-registered
chaiml-pony-d3-g46-pv2-_41910_v2 status is now deployed due to DeploymentManager action
chaiml-pony-d3-g46-pv2-_41910_v2 status is now inactive due to auto deactivation removed underperforming models
chaiml-pony-d3-g46-pv2-_41910_v2 status is now torndown due to DeploymentManager action