Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-pony-d2-q235b-pv-18495-v1-uploader
Waiting for job on chaiml-pony-d2-q235b-pv-18495-v1-uploader to finish
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Using quantization_mode: w4a16
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Checking if ChaiML/pony-d2-q235b-pv1-lr5e6ep2r64g4-W4A16 already exists in ChaiML
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Downloading snapshot of ChaiML/pony-d2-q235b-pv1-lr5e6ep2r64g4...
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Downloaded in 158.030s
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Applying quantization...
chaiml-pony-d2-q235b-pv-18495-v1-uploader: [33;1m2026-02-27 18:25:30 WARNING modeling_utils.py L4670: `torch_dtype` is deprecated! Use `dtype` instead![0m
chaiml-pony-d2-q235b-pv-18495-v1-uploader: [38;20m2026-02-27 18:25:47 INFO base.py L366: using torch.bfloat16 for quantization tuning[0m
chaiml-pony-d2-q235b-pv-18495-v1-uploader: [38;20m2026-02-27 18:25:51 INFO base.py L1145: start to compute imatrix[0m
chaiml-pony-d2-q235b-pv-18495-v1-uploader: /usr/local/lib/python3.12/dist-packages/torch/backends/cuda/__init__.py:131: UserWarning: Please use the new API settings to control TF32 behavior, such as torch.backends.cudnn.conv.fp32_precision = 'tf32' or torch.backends.cuda.matmul.fp32_precision = 'ieee'. Old settings, e.g, torch.backends.cuda.matmul.allow_tf32 = True, torch.backends.cudnn.allow_tf32 = True, allowTF32CuDNN() and allowTF32CuBLAS() will be deprecated after Pytorch 2.9. Please see https://pytorch.org/docs/main/notes/cuda.html#tensorfloat-32-tf32-on-ampere-and-later-devices (Triggered internally at /pytorch/aten/src/ATen/Context.cpp:80.)
chaiml-pony-d2-q235b-pv-18495-v1-uploader: return torch._C._get_cublas_allow_tf32()
chaiml-pony-d2-q235b-pv-18495-v1-uploader: W0227 18:26:56.421000 7 torch/_dynamo/convert_frame.py:1358] [6/8] torch._dynamo hit config.recompile_limit (8)
chaiml-pony-d2-q235b-pv-18495-v1-uploader: W0227 18:26:56.421000 7 torch/_dynamo/convert_frame.py:1358] [6/8] function: 'forward' (/usr/local/lib/python3.12/dist-packages/transformers/models/qwen3_moe/modeling_qwen3_moe.py:208)
chaiml-pony-d2-q235b-pv-18495-v1-uploader: W0227 18:26:56.421000 7 torch/_dynamo/convert_frame.py:1358] [6/8] last reason: 6/7: self._modules['up_proj'].imatrix_cnt == 1356 # module.imatrix_cnt += input.shape[0] # auto_round/compressors/base.py:1179 in get_imatrix_hook (HINT: torch.compile considers integer attributes of the nn.Module to be static. If you are observing recompilation, you might want to make this integer dynamic using torch._dynamo.config.allow_unspec_int_on_nn_module = True, or convert this integer into a tensor.)
chaiml-pony-d2-q235b-pv-18495-v1-uploader: W0227 18:26:56.421000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
chaiml-pony-d2-q235b-pv-18495-v1-uploader: W0227 18:26:56.421000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/torch.compiler_troubleshooting.html
chaiml-pony-d2-q235b-pv-18495-v1-uploader: W0227 18:26:59.396000 7 torch/_dynamo/convert_frame.py:1358] [3/8] torch._dynamo hit config.recompile_limit (8)
chaiml-pony-d2-q235b-pv-18495-v1-uploader: W0227 18:26:59.396000 7 torch/_dynamo/convert_frame.py:1358] [3/8] function: 'forward' (/usr/local/lib/python3.12/dist-packages/transformers/models/qwen3_moe/modeling_qwen3_moe.py:305)
chaiml-pony-d2-q235b-pv-18495-v1-uploader: W0227 18:26:59.396000 7 torch/_dynamo/convert_frame.py:1358] [3/8] last reason: 3/7: self._modules['self_attn']._modules['k_proj'].imatrix_cnt == 56 # module.imatrix_cnt += input.shape[0] # auto_round/compressors/base.py:1179 in get_imatrix_hook (HINT: torch.compile considers integer attributes of the nn.Module to be static. If you are observing recompilation, you might want to make this integer dynamic using torch._dynamo.config.allow_unspec_int_on_nn_module = True, or convert this integer into a tensor.)
chaiml-pony-d2-q235b-pv-18495-v1-uploader: W0227 18:26:59.396000 7 torch/_dynamo/convert_frame.py:1358] [3/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
chaiml-pony-d2-q235b-pv-18495-v1-uploader: W0227 18:26:59.396000 7 torch/_dynamo/convert_frame.py:1358] [3/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/torch.compiler_troubleshooting.html
chaiml-pony-d2-q235b-pv-18495-v1-uploader: [33;1m2026-02-27 18:27:17 WARNING gguf.py L297: please use more data via setting `nsamples` to improve accuracy as calibration activations contain 0[0m
Failed to get response for submission chaiml-pony-v1-q235b-l_99625_v11: HTTPConnectionPool(host='chaiml-pony-v1-q235b-l-99625-v11-predictor.tenant-chaiml-guanaco.k2.chaiverse.com', port=80): Read timed out. (read timeout=20.0)
Failed to get response for submission chaiml-pony-v1-q235b-l_99625_v11: HTTPConnectionPool(host='chaiml-pony-v1-q235b-l-99625-v11-predictor.tenant-chaiml-guanaco.k2.chaiverse.com', port=80): Read timed out. (read timeout=20.0)
Unable to record family friendly update due to error: Invalid JSON input: Expecting value: line 1 column 1 (char 0)
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Checking if ChaiML/pony-d2-q235b-pv1-lr5e6ep2r64g4-W4A16 already exists in ChaiML
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Creating repo ChaiML/pony-d2-q235b-pv1-lr5e6ep2r64g4-W4A16 and uploading /dev/shm/model_output to it
chaiml-pony-d2-q235b-pv-18495-v1-uploader: ---------- 2026-02-27 19:40:44 (0:00:00) ----------
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Files: hashed 11/38 (26.1M/131.9G) | pre-uploaded: 0/1 (0.0/131.9G) (+27 unsure) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Workers: hashing: 27 | get upload mode: 0 | pre-uploading: 1 | committing: 0 | waiting: 98
chaiml-pony-d2-q235b-pv-18495-v1-uploader: ---------------------------------------------------
chaiml-pony-d2-q235b-pv-18495-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d2-q235b-pv-18495-v1-uploader: ---------- 2026-02-27 19:41:44 (0:01:00) ----------
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 2/28 (1.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 26 | committing: 0 | waiting: 100
chaiml-pony-d2-q235b-pv-18495-v1-uploader: ---------------------------------------------------
chaiml-pony-d2-q235b-pv-18495-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d2-q235b-pv-18495-v1-uploader: ---------- 2026-02-27 19:42:44 (0:02:00) ----------
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 10/28 (41.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 18 | committing: 0 | waiting: 108
chaiml-pony-d2-q235b-pv-18495-v1-uploader: ---------------------------------------------------
chaiml-pony-d2-q235b-pv-18495-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d2-q235b-pv-18495-v1-uploader: ---------- 2026-02-27 19:43:44 (0:03:00) ----------
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 18/28 (81.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 10 | committing: 0 | waiting: 116
chaiml-pony-d2-q235b-pv-18495-v1-uploader: ---------------------------------------------------
chaiml-pony-d2-q235b-pv-18495-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d2-q235b-pv-18495-v1-uploader: ---------- 2026-02-27 19:44:44 (0:04:00) ----------
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 26/28 (121.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 2 | committing: 0 | waiting: 124
chaiml-pony-d2-q235b-pv-18495-v1-uploader: ---------------------------------------------------
chaiml-pony-d2-q235b-pv-18495-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-pony-d2-q235b-pv-18495-v1-uploader: ---------- 2026-02-27 19:45:44 (0:05:00) ----------
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 28/28 (131.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 0 | committing: 1 | waiting: 125
chaiml-pony-d2-q235b-pv-18495-v1-uploader: ---------------------------------------------------
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Processed model ChaiML/pony-d2-q235b-pv1-lr5e6ep2r64g4 in 4996.797s
chaiml-pony-d2-q235b-pv-18495-v1-uploader: creating bucket guanaco-vllm-models
chaiml-pony-d2-q235b-pv-18495-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d2-q235b-pv-18495-v1-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-pony-d2-q235b-pv-18495-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-pony-d2-q235b-pv-18495-v1-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-pony-d2-q235b-pv-18495-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d2-q235b-pv-18495-v1-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-pony-d2-q235b-pv-18495-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d2-q235b-pv-18495-v1-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-pony-d2-q235b-pv-18495-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d2-q235b-pv-18495-v1-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-pony-d2-q235b-pv-18495-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d2-q235b-pv-18495-v1-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-pony-d2-q235b-pv-18495-v1-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-pony-d2-q235b-pv-18495-v1-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-pony-d2-q235b-pv-18495-v1-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-pony-d2-q235b-pv-18495-v1-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-pony-d2-q235b-pv-18495-v1-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-pony-d2-q235b-pv-18495-v1-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/chat_template.jinja
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/tokenizer_config.json
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/added_tokens.json
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/generation_config.json
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/config.json
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/special_tokens_map.json
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/quantization_config.json
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/merges.txt
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model.safetensors.index.json
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/tokenizer.json
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/vocab.json
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00027-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00020-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00015-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00006-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00009-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00026-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00021-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00001-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00019-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00016-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00017-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00007-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00004-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00012-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00013-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00022-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00018-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00024-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00005-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00008-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00010-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00025-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00014-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00011-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00002-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00023-of-00027.safetensors
chaiml-pony-d2-q235b-pv-18495-v1-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-18495-v1/default/model-00003-of-00027.safetensors
Job chaiml-pony-d2-q235b-pv-18495-v1-uploader completed after 5090.69s with status: succeeded
Stopping job with name chaiml-pony-d2-q235b-pv-18495-v1-uploader
Pipeline stage VLLMUploader completed in 5091.39s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.45s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-pony-d2-q235b-pv-18495-v1
Waiting for inference service chaiml-pony-d2-q235b-pv-18495-v1 to be ready
Inference service chaiml-pony-d2-q235b-pv-18495-v1 ready after 460.3215334415436s
Pipeline stage VLLMDeployer completed in 460.73s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.043283224105835s
Received healthy response to inference request in 2.24052095413208s
Received healthy response to inference request in 1.8713524341583252s
Received healthy response to inference request in 1.829070806503296s
Received healthy response to inference request in 1.9926748275756836s
Received healthy response to inference request in 2.260028123855591s
Received healthy response to inference request in 1.8704168796539307s
Received healthy response to inference request in 1.9067044258117676s
Received healthy response to inference request in 1.8612756729125977s
Received healthy response to inference request in 1.8676321506500244s
Received healthy response to inference request in 1.877878189086914s
Received healthy response to inference request in 1.862109661102295s
Received healthy response to inference request in 1.8629841804504395s
Received healthy response to inference request in 3.211517333984375s
Received healthy response to inference request in 1.8187458515167236s
Received healthy response to inference request in 2.040614604949951s
Received healthy response to inference request in 2.197322368621826s
Received healthy response to inference request in 1.891228437423706s
Received healthy response to inference request in 2.368730068206787s
Received healthy response to inference request in 1.8234477043151855s
Received healthy response to inference request in 1.865858554840088s
Received healthy response to inference request in 1.950838327407837s
Received healthy response to inference request in 2.036480188369751s
Received healthy response to inference request in 2.016326427459717s
Received healthy response to inference request in 1.868532657623291s
Received healthy response to inference request in 1.9745252132415771s
Received healthy response to inference request in 2.122912883758545s
Received healthy response to inference request in 2.177708148956299s
Received healthy response to inference request in 1.8643629550933838s
Received healthy response to inference request in 1.8959598541259766s
30 requests
0 failed requests
5th percentile: 1.8259781002998352
10th percentile: 1.8580551862716674
20th percentile: 1.864087200164795
30th percentile: 1.8682625055313111
40th percentile: 1.8752678871154784
50th percentile: 1.901332139968872
60th percentile: 1.9817850589752197
70th percentile: 2.037720513343811
80th percentile: 2.133871936798096
90th percentile: 2.2424716711044312
95th percentile: 2.3198141932487486
99th percentile: 2.9671090269088753
mean time: 2.01570143699646
Pipeline stage StressChecker completed in 64.40s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.69s
Shutdown handler de-registered
chaiml-pony-d2-q235b-pv_18495_v1 status is now deployed due to DeploymentManager action
chaiml-pony-d2-q235b-pv_18495_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-pony-d2-q235b-pv_18495_v1 status is now torndown due to DeploymentManager action