Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-muster-v3c-kakit-51472-v1-uploader
Waiting for job on chaiml-muster-v3c-kakit-51472-v1-uploader to finish
chaiml-muster-v3c-kakit-51472-v1-uploader: Using quantization_mode: w4a16
chaiml-muster-v3c-kakit-51472-v1-uploader: Checking if ChaiML/muster-v3c-kakit-q235b-lr1e4ep1r64g4-W4A16 already exists in ChaiML
chaiml-muster-v3c-kakit-51472-v1-uploader: Downloading snapshot of ChaiML/muster-v3c-kakit-q235b-lr1e4ep1r64g4...
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
chaiml-muster-v3c-kakit-51472-v1-uploader: Downloaded in 149.587s
chaiml-muster-v3c-kakit-51472-v1-uploader: Applying quantization...
chaiml-muster-v3c-kakit-51472-v1-uploader: [33;1m2026-02-14 21:07:15 WARNING modeling_utils.py L4670: `torch_dtype` is deprecated! Use `dtype` instead![0m
chaiml-muster-v3c-kakit-51472-v1-uploader: [38;20m2026-02-14 21:07:40 INFO base.py L366: using torch.bfloat16 for quantization tuning[0m
chaiml-muster-v3c-kakit-51472-v1-uploader: [38;20m2026-02-14 21:07:44 INFO base.py L1145: start to compute imatrix[0m
chaiml-muster-v3c-kakit-51472-v1-uploader: /usr/local/lib/python3.12/dist-packages/torch/backends/cuda/__init__.py:131: UserWarning: Please use the new API settings to control TF32 behavior, such as torch.backends.cudnn.conv.fp32_precision = 'tf32' or torch.backends.cuda.matmul.fp32_precision = 'ieee'. Old settings, e.g, torch.backends.cuda.matmul.allow_tf32 = True, torch.backends.cudnn.allow_tf32 = True, allowTF32CuDNN() and allowTF32CuBLAS() will be deprecated after Pytorch 2.9. Please see https://pytorch.org/docs/main/notes/cuda.html#tensorfloat-32-tf32-on-ampere-and-later-devices (Triggered internally at /pytorch/aten/src/ATen/Context.cpp:80.)
chaiml-muster-v3c-kakit-51472-v1-uploader: return torch._C._get_cublas_allow_tf32()
chaiml-muster-v3c-kakit-51472-v1-uploader: W0214 21:08:47.323000 7 torch/_dynamo/convert_frame.py:1358] [6/8] torch._dynamo hit config.recompile_limit (8)
chaiml-muster-v3c-kakit-51472-v1-uploader: W0214 21:08:47.323000 7 torch/_dynamo/convert_frame.py:1358] [6/8] function: 'forward' (/usr/local/lib/python3.12/dist-packages/transformers/models/qwen3_moe/modeling_qwen3_moe.py:208)
chaiml-muster-v3c-kakit-51472-v1-uploader: W0214 21:08:47.323000 7 torch/_dynamo/convert_frame.py:1358] [6/8] last reason: 6/7: self._modules['up_proj'].imatrix_cnt == 1347 # module.imatrix_cnt += input.shape[0] # auto_round/compressors/base.py:1179 in get_imatrix_hook (HINT: torch.compile considers integer attributes of the nn.Module to be static. If you are observing recompilation, you might want to make this integer dynamic using torch._dynamo.config.allow_unspec_int_on_nn_module = True, or convert this integer into a tensor.)
chaiml-muster-v3c-kakit-51472-v1-uploader: W0214 21:08:47.323000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
chaiml-muster-v3c-kakit-51472-v1-uploader: W0214 21:08:47.323000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/torch.compiler_troubleshooting.html
chaiml-muster-v3c-kakit-51472-v1-uploader: W0214 21:08:50.259000 7 torch/_dynamo/convert_frame.py:1358] [3/8] torch._dynamo hit config.recompile_limit (8)
chaiml-muster-v3c-kakit-51472-v1-uploader: W0214 21:08:50.259000 7 torch/_dynamo/convert_frame.py:1358] [3/8] function: 'forward' (/usr/local/lib/python3.12/dist-packages/transformers/models/qwen3_moe/modeling_qwen3_moe.py:305)
chaiml-muster-v3c-kakit-51472-v1-uploader: W0214 21:08:50.259000 7 torch/_dynamo/convert_frame.py:1358] [3/8] last reason: 3/7: self._modules['self_attn']._modules['k_proj'].imatrix_cnt == 56 # module.imatrix_cnt += input.shape[0] # auto_round/compressors/base.py:1179 in get_imatrix_hook (HINT: torch.compile considers integer attributes of the nn.Module to be static. If you are observing recompilation, you might want to make this integer dynamic using torch._dynamo.config.allow_unspec_int_on_nn_module = True, or convert this integer into a tensor.)
chaiml-muster-v3c-kakit-51472-v1-uploader: W0214 21:08:50.259000 7 torch/_dynamo/convert_frame.py:1358] [3/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
chaiml-muster-v3c-kakit-51472-v1-uploader: W0214 21:08:50.259000 7 torch/_dynamo/convert_frame.py:1358] [3/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/torch.compiler_troubleshooting.html
chaiml-muster-v3c-kakit-51472-v1-uploader: [33;1m2026-02-14 21:09:08 WARNING gguf.py L297: please use more data via setting `nsamples` to improve accuracy as calibration activations contain 0[0m
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-muster-v3c-kakit-51472-v1-uploader: Checking if ChaiML/muster-v3c-kakit-q235b-lr1e4ep1r64g4-W4A16 already exists in ChaiML
chaiml-muster-v3c-kakit-51472-v1-uploader: Creating repo ChaiML/muster-v3c-kakit-q235b-lr1e4ep1r64g4-W4A16 and uploading /dev/shm/model_output to it
chaiml-muster-v3c-kakit-51472-v1-uploader: ---------- 2026-02-14 22:23:38 (0:00:00) ----------
chaiml-muster-v3c-kakit-51472-v1-uploader: Files: hashed 11/38 (26.1M/131.9G) | pre-uploaded: 0/1 (0.0/131.9G) (+27 unsure) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-muster-v3c-kakit-51472-v1-uploader: Workers: hashing: 27 | get upload mode: 0 | pre-uploading: 1 | committing: 0 | waiting: 98
chaiml-muster-v3c-kakit-51472-v1-uploader: ---------------------------------------------------
chaiml-muster-v3c-kakit-51472-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-muster-v3c-kakit-51472-v1-uploader: ---------- 2026-02-14 22:24:38 (0:01:00) ----------
chaiml-muster-v3c-kakit-51472-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 2/28 (1.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-muster-v3c-kakit-51472-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 26 | committing: 0 | waiting: 100
chaiml-muster-v3c-kakit-51472-v1-uploader: ---------------------------------------------------
HTTP Request: %s %s "%s %d %s"
chaiml-muster-v3c-kakit-51472-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-muster-v3c-kakit-51472-v1-uploader: ---------- 2026-02-14 22:25:38 (0:02:00) ----------
chaiml-muster-v3c-kakit-51472-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 10/28 (41.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-muster-v3c-kakit-51472-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 18 | committing: 0 | waiting: 108
chaiml-muster-v3c-kakit-51472-v1-uploader: ---------------------------------------------------
chaiml-muster-v3c-kakit-51472-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-muster-v3c-kakit-51472-v1-uploader: ---------- 2026-02-14 22:26:38 (0:03:00) ----------
chaiml-muster-v3c-kakit-51472-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 18/28 (81.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-muster-v3c-kakit-51472-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 10 | committing: 0 | waiting: 116
chaiml-muster-v3c-kakit-51472-v1-uploader: ---------------------------------------------------
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-muster-v3c-kakit-51472-v1-uploader:
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
[K[F
chaiml-muster-v3c-kakit-51472-v1-uploader: ---------- 2026-02-14 22:27:38 (0:04:00) ----------
chaiml-muster-v3c-kakit-51472-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 28/28 (131.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-muster-v3c-kakit-51472-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 0 | committing: 1 | waiting: 125
chaiml-muster-v3c-kakit-51472-v1-uploader: ---------------------------------------------------
chaiml-muster-v3c-kakit-51472-v1-uploader: Processed model ChaiML/muster-v3c-kakit-q235b-lr1e4ep1r64g4 in 5035.133s
chaiml-muster-v3c-kakit-51472-v1-uploader: creating bucket guanaco-vllm-models
chaiml-muster-v3c-kakit-51472-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v3c-kakit-51472-v1-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-muster-v3c-kakit-51472-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-muster-v3c-kakit-51472-v1-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-muster-v3c-kakit-51472-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v3c-kakit-51472-v1-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-muster-v3c-kakit-51472-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v3c-kakit-51472-v1-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-muster-v3c-kakit-51472-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v3c-kakit-51472-v1-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-muster-v3c-kakit-51472-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-muster-v3c-kakit-51472-v1-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-muster-v3c-kakit-51472-v1-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-muster-v3c-kakit-51472-v1-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-muster-v3c-kakit-51472-v1-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-muster-v3c-kakit-51472-v1-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-muster-v3c-kakit-51472-v1-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-muster-v3c-kakit-51472-v1-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/added_tokens.json
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/generation_config.json
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/chat_template.jinja
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/tokenizer_config.json
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/config.json
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/quantization_config.json
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/special_tokens_map.json
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/tokenizer.json
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/merges.txt
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/vocab.json
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00027-of-00027.safetensors
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00024-of-00027.safetensors
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00008-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00014-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00017-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00013-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00003-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00004-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00006-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00021-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00016-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00020-of-00027.safetensors
Failed to get response for submission chaiml-kimid-v9-opusdv_83165_v14: HTTPConnectionPool(host='guanaco-model-mesh-load-balancer.model-mesh.k2.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00007-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00025-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00023-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00018-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00026-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00005-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00002-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00011-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00019-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00009-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00022-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00012-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00001-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00015-of-00027.safetensors
chaiml-muster-v3c-kakit-51472-v1-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-muster-v3c-kakit-51472-v1/default/model-00010-of-00027.safetensors
Job chaiml-muster-v3c-kakit-51472-v1-uploader completed after 5456.54s with status: succeeded
Stopping job with name chaiml-muster-v3c-kakit-51472-v1-uploader
Pipeline stage VLLMUploader completed in 5456.97s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-muster-v3c-kakit-51472-v1
Waiting for inference service chaiml-muster-v3c-kakit-51472-v1 to be ready
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Inference service chaiml-muster-v3c-kakit-51472-v1 ready after 660.3860874176025s
Pipeline stage VLLMDeployer completed in 660.84s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.1966562271118164s
Received healthy response to inference request in 1.9992010593414307s
Received healthy response to inference request in 2.184905529022217s
Received healthy response to inference request in 2.0105960369110107s
Received healthy response to inference request in 2.044757127761841s
Received healthy response to inference request in 2.3486385345458984s
Received healthy response to inference request in 2.00929856300354s
Received healthy response to inference request in 2.283162832260132s
Received healthy response to inference request in 2.0145137310028076s
Received healthy response to inference request in 2.0881311893463135s
Received healthy response to inference request in 1.9649157524108887s
Received healthy response to inference request in 2.5140974521636963s
Received healthy response to inference request in 1.970865249633789s
Received healthy response to inference request in 2.0376269817352295s
Received healthy response to inference request in 2.314237594604492s
Received healthy response to inference request in 1.8502464294433594s
Received healthy response to inference request in 2.1764097213745117s
Received healthy response to inference request in 1.957690954208374s
Received healthy response to inference request in 2.2047789096832275s
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Received healthy response to inference request in 2.268826723098755s
Received healthy response to inference request in 1.9517276287078857s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 2.3589019775390625s
Received healthy response to inference request in 3.7251453399658203s
Received healthy response to inference request in 1.9623870849609375s
Received healthy response to inference request in 2.2967305183410645s
Received healthy response to inference request in 1.9364476203918457s
Received healthy response to inference request in 1.9825592041015625s
Received healthy response to inference request in 1.918323040008545s
Received healthy response to inference request in 2.078594923019409s
Received healthy response to inference request in 1.9616742134094238s
30 requests
0 failed requests
5th percentile: 1.9264791011810303
10th percentile: 1.9501996278762816
20th percentile: 1.9622445106506348
30th percentile: 1.9790510177612304
40th percentile: 2.0100770473480223
50th percentile: 2.041192054748535
60th percentile: 2.1234426021575925
70th percentile: 2.1990930318832396
80th percentile: 2.2858763694763184
90th percentile: 2.349664878845215
95th percentile: 2.4442594885826106
99th percentile: 3.3739414525032054
mean time: 2.1537349383036295
Pipeline stage StressChecker completed in 68.49s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.88s
Shutdown handler de-registered
chaiml-muster-v3c-kakit_51472_v1 status is now deployed due to DeploymentManager action
chaiml-muster-v3c-kakit_51472_v1 status is now inactive due to auto deactivation removed underperforming models