Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-kimid-v8b-kimidv-77693-v1-uploader
Waiting for job on chaiml-kimid-v8b-kimidv-77693-v1-uploader to finish
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Using quantization_mode: w4a16
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Checking if ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01-W4A16 already exists in ChaiML
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Downloading snapshot of ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01...
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Downloaded in 196.602s
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Applying quantization...
chaiml-kimid-v8b-kimidv-77693-v1-uploader: [33;1m2026-02-19 11:33:47 WARNING modeling_utils.py L4670: `torch_dtype` is deprecated! Use `dtype` instead![0m
chaiml-kimid-v8b-kimidv-77693-v1-uploader: [38;20m2026-02-19 11:34:06 INFO base.py L366: using torch.bfloat16 for quantization tuning[0m
chaiml-kimid-v8b-kimidv-77693-v1-uploader: [38;20m2026-02-19 11:34:11 INFO base.py L1145: start to compute imatrix[0m
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-kimid-v8b-kimidv-77693-v1-uploader: /usr/local/lib/python3.12/dist-packages/torch/backends/cuda/__init__.py:131: UserWarning: Please use the new API settings to control TF32 behavior, such as torch.backends.cudnn.conv.fp32_precision = 'tf32' or torch.backends.cuda.matmul.fp32_precision = 'ieee'. Old settings, e.g, torch.backends.cuda.matmul.allow_tf32 = True, torch.backends.cudnn.allow_tf32 = True, allowTF32CuDNN() and allowTF32CuBLAS() will be deprecated after Pytorch 2.9. Please see https://pytorch.org/docs/main/notes/cuda.html#tensorfloat-32-tf32-on-ampere-and-later-devices (Triggered internally at /pytorch/aten/src/ATen/Context.cpp:80.)
chaiml-kimid-v8b-kimidv-77693-v1-uploader: return torch._C._get_cublas_allow_tf32()
chaiml-kimid-v8b-kimidv-77693-v1-uploader: W0219 11:35:13.813000 7 torch/_dynamo/convert_frame.py:1358] [6/8] torch._dynamo hit config.recompile_limit (8)
chaiml-kimid-v8b-kimidv-77693-v1-uploader: W0219 11:35:13.813000 7 torch/_dynamo/convert_frame.py:1358] [6/8] function: 'forward' (/usr/local/lib/python3.12/dist-packages/transformers/models/qwen3_moe/modeling_qwen3_moe.py:208)
chaiml-kimid-v8b-kimidv-77693-v1-uploader: W0219 11:35:13.813000 7 torch/_dynamo/convert_frame.py:1358] [6/8] last reason: 6/7: self._modules['up_proj'].imatrix_cnt == 1341 # module.imatrix_cnt += input.shape[0] # auto_round/compressors/base.py:1179 in get_imatrix_hook (HINT: torch.compile considers integer attributes of the nn.Module to be static. If you are observing recompilation, you might want to make this integer dynamic using torch._dynamo.config.allow_unspec_int_on_nn_module = True, or convert this integer into a tensor.)
chaiml-kimid-v8b-kimidv-77693-v1-uploader: W0219 11:35:13.813000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
chaiml-kimid-v8b-kimidv-77693-v1-uploader: W0219 11:35:13.813000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/torch.compiler_troubleshooting.html
chaiml-kimid-v8b-kimidv-77693-v1-uploader: W0219 11:35:16.666000 7 torch/_dynamo/convert_frame.py:1358] [3/8] torch._dynamo hit config.recompile_limit (8)
chaiml-kimid-v8b-kimidv-77693-v1-uploader: W0219 11:35:16.666000 7 torch/_dynamo/convert_frame.py:1358] [3/8] function: 'forward' (/usr/local/lib/python3.12/dist-packages/transformers/models/qwen3_moe/modeling_qwen3_moe.py:305)
chaiml-kimid-v8b-kimidv-77693-v1-uploader: W0219 11:35:16.666000 7 torch/_dynamo/convert_frame.py:1358] [3/8] last reason: 3/7: self._modules['self_attn']._modules['k_proj'].imatrix_cnt == 56 # module.imatrix_cnt += input.shape[0] # auto_round/compressors/base.py:1179 in get_imatrix_hook (HINT: torch.compile considers integer attributes of the nn.Module to be static. If you are observing recompilation, you might want to make this integer dynamic using torch._dynamo.config.allow_unspec_int_on_nn_module = True, or convert this integer into a tensor.)
chaiml-kimid-v8b-kimidv-77693-v1-uploader: W0219 11:35:16.666000 7 torch/_dynamo/convert_frame.py:1358] [3/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
chaiml-kimid-v8b-kimidv-77693-v1-uploader: W0219 11:35:16.666000 7 torch/_dynamo/convert_frame.py:1358] [3/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/torch.compiler_troubleshooting.html
chaiml-kimid-v8b-kimidv-77693-v1-uploader: [33;1m2026-02-19 11:35:34 WARNING gguf.py L297: please use more data via setting `nsamples` to improve accuracy as calibration activations contain 0[0m
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Checking if ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01-W4A16 already exists in ChaiML
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Creating repo ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01-W4A16 and uploading /dev/shm/model_output to it
chaiml-kimid-v8b-kimidv-77693-v1-uploader: ---------- 2026-02-19 12:51:26 (0:00:00) ----------
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Files: hashed 11/38 (26.1M/131.9G) | pre-uploaded: 0/1 (0.0/131.9G) (+28 unsure) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Workers: hashing: 27 | get upload mode: 1 | pre-uploading: 1 | committing: 0 | waiting: 97
chaiml-kimid-v8b-kimidv-77693-v1-uploader: ---------------------------------------------------
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-kimid-v8b-kimidv-77693-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-kimid-v8b-kimidv-77693-v1-uploader: ---------- 2026-02-19 12:52:26 (0:01:00) ----------
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 2/28 (1.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 26 | committing: 0 | waiting: 100
chaiml-kimid-v8b-kimidv-77693-v1-uploader: ---------------------------------------------------
chaiml-kimid-v8b-kimidv-77693-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-kimid-v8b-kimidv-77693-v1-uploader: ---------- 2026-02-19 12:53:26 (0:02:00) ----------
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 10/28 (41.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 18 | committing: 0 | waiting: 108
chaiml-kimid-v8b-kimidv-77693-v1-uploader: ---------------------------------------------------
chaiml-kimid-v8b-kimidv-77693-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-kimid-v8b-kimidv-77693-v1-uploader: ---------- 2026-02-19 12:54:26 (0:03:00) ----------
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 18/28 (81.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 10 | committing: 0 | waiting: 116
chaiml-kimid-v8b-kimidv-77693-v1-uploader: ---------------------------------------------------
chaiml-kimid-v8b-kimidv-77693-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-kimid-v8b-kimidv-77693-v1-uploader: ---------- 2026-02-19 12:55:26 (0:04:00) ----------
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 23/28 (106.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 5 | committing: 0 | waiting: 121
chaiml-kimid-v8b-kimidv-77693-v1-uploader: ---------------------------------------------------
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-kimid-v8b-kimidv-77693-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-kimid-v8b-kimidv-77693-v1-uploader: ---------- 2026-02-19 12:56:26 (0:05:00) ----------
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 28/28 (131.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 0 | committing: 1 | waiting: 125
chaiml-kimid-v8b-kimidv-77693-v1-uploader: ---------------------------------------------------
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Processed model ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01 in 5186.892s
chaiml-kimid-v8b-kimidv-77693-v1-uploader: creating bucket guanaco-vllm-models
chaiml-kimid-v8b-kimidv-77693-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimidv-77693-v1-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-kimid-v8b-kimidv-77693-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-kimid-v8b-kimidv-77693-v1-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-kimid-v8b-kimidv-77693-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimidv-77693-v1-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-kimid-v8b-kimidv-77693-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimidv-77693-v1-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-kimid-v8b-kimidv-77693-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimidv-77693-v1-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-kimid-v8b-kimidv-77693-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimidv-77693-v1-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-kimid-v8b-kimidv-77693-v1-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-kimid-v8b-kimidv-77693-v1-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-kimid-v8b-kimidv-77693-v1-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-kimid-v8b-kimidv-77693-v1-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-kimid-v8b-kimidv-77693-v1-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-kimid-v8b-kimidv-77693-v1-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/added_tokens.json
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/config.json
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/chat_template.jinja
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/generation_config.json
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/merges.txt
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/quantization_config.json
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/special_tokens_map.json
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model.safetensors.index.json
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/vocab.json
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/tokenizer_config.json
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/tokenizer.json
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00027-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00009-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00007-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00024-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00002-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00022-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00004-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00014-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00015-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00008-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00017-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00016-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00021-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00018-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00023-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00025-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00013-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00026-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00001-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00020-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00019-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00011-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00006-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00003-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00010-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00012-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v1-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v1/default/model-00005-of-00027.safetensors
Job chaiml-kimid-v8b-kimidv-77693-v1-uploader completed after 5273.77s with status: succeeded
Stopping job with name chaiml-kimid-v8b-kimidv-77693-v1-uploader
Pipeline stage VLLMUploader completed in 5274.37s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-kimid-v8b-kimidv-77693-v1
Waiting for inference service chaiml-kimid-v8b-kimidv-77693-v1 to be ready
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service chaiml-kimid-v8b-kimidv-77693-v1 ready after 382.53743743896484s
Pipeline stage VLLMDeployer completed in 383.23s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2843902111053467s
Received healthy response to inference request in 1.9860939979553223s
Received healthy response to inference request in 2.184816360473633s
Received healthy response to inference request in 1.961827278137207s
Received healthy response to inference request in 2.0659725666046143s
Received healthy response to inference request in 2.0624489784240723s
Received healthy response to inference request in 2.056685447692871s
Received healthy response to inference request in 1.9996800422668457s
Received healthy response to inference request in 1.9343879222869873s
Received healthy response to inference request in 2.236877202987671s
Received healthy response to inference request in 2.2449536323547363s
Received healthy response to inference request in 2.0506651401519775s
Received healthy response to inference request in 2.040591239929199s
Received healthy response to inference request in 2.1993210315704346s
Received healthy response to inference request in 1.9926278591156006s
Received healthy response to inference request in 2.026716947555542s
Received healthy response to inference request in 1.9936885833740234s
Received healthy response to inference request in 2.2478065490722656s
Received healthy response to inference request in 2.369550943374634s
Received healthy response to inference request in 2.1143198013305664s
Received healthy response to inference request in 2.0388712882995605s
Received healthy response to inference request in 2.086019277572632s
Received healthy response to inference request in 1.8993875980377197s
Received healthy response to inference request in 1.9362115859985352s
Received healthy response to inference request in 1.9760653972625732s
Received healthy response to inference request in 2.006838798522949s
Received healthy response to inference request in 2.3968417644500732s
Received healthy response to inference request in 1.9616141319274902s
Received healthy response to inference request in 2.0857059955596924s
Received healthy response to inference request in 1.928673267364502s
30 requests
0 failed requests
5th percentile: 1.9312448620796203
10th percentile: 1.9360292196273803
20th percentile: 1.9732177734375
30th percentile: 1.9933703660964965
40th percentile: 2.018765687942505
50th percentile: 2.0456281900405884
60th percentile: 2.063858413696289
70th percentile: 2.094509434700012
80th percentile: 2.206832265853882
90th percentile: 2.251464915275574
95th percentile: 2.331228613853454
99th percentile: 2.3889274263381957
mean time: 2.0789883613586424
Pipeline stage StressChecker completed in 68.16s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.72s
Shutdown handler de-registered
chaiml-kimid-v8b-kimidv_77693_v1 status is now deployed due to DeploymentManager action
chaiml-kimid-v8b-kimidv_77693_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-kimid-v8b-kimidv_77693_v1 status is now torndown due to DeploymentManager action