developer_uid: zonemercy
submission_id: chaiml-pony-d2-q235b-pv_37537_v1
model_name: chaiml-pony-d2-q235b-pv_37537_v1
model_group: ChaiML/pony-d2-q235b-pv2
status: inactive
timestamp: 2026-03-01T08:17:24+00:00
num_battles: 13327
num_wins: 7339
celo_rating: 1326.64
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/pony-d2-q235b-pv2-lr5e6ep2r64g4
model_architecture: Qwen3MoeForCausalLM
model_num_parameters: 18790207488.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-pony-d2-q235b-pv_37537_v1
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/pony-d2-q235b-pv2-lr5e6ep2r64g4
model_size: 19B
ranking_group: single
us_pacific_date: 2026-03-01
win_ratio: 0.5506865761236588
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '####', '<|im_end|>', '<|assistant|>', '<|user|>', '</think>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-pony-d2-q235b-pv-37537-v1-uploader
Waiting for job on chaiml-pony-d2-q235b-pv-37537-v1-uploader to finish
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Using quantization_mode: w4a16
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Checking if ChaiML/pony-d2-q235b-pv2-lr5e6ep2r64g4-W4A16 already exists in ChaiML
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Downloading snapshot of ChaiML/pony-d2-q235b-pv2-lr5e6ep2r64g4...
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Downloaded in 144.677s
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Applying quantization...
chaiml-pony-d2-q235b-pv-37537-v1-uploader: 2026-02-28 21:35:52 WARNING modeling_utils.py L4670: `torch_dtype` is deprecated! Use `dtype` instead!
chaiml-pony-d2-q235b-pv-37537-v1-uploader: 2026-02-28 21:36:08 INFO base.py L366: using torch.bfloat16 for quantization tuning
chaiml-pony-d2-q235b-pv-37537-v1-uploader: 2026-02-28 21:36:12 INFO base.py L1145: start to compute imatrix
chaiml-pony-d2-q235b-pv-37537-v1-uploader: /usr/local/lib/python3.12/dist-packages/torch/backends/cuda/__init__.py:131: UserWarning: Please use the new API settings to control TF32 behavior, such as torch.backends.cudnn.conv.fp32_precision = 'tf32' or torch.backends.cuda.matmul.fp32_precision = 'ieee'. Old settings, e.g, torch.backends.cuda.matmul.allow_tf32 = True, torch.backends.cudnn.allow_tf32 = True, allowTF32CuDNN() and allowTF32CuBLAS() will be deprecated after Pytorch 2.9. Please see https://pytorch.org/docs/main/notes/cuda.html#tensorfloat-32-tf32-on-ampere-and-later-devices (Triggered internally at /pytorch/aten/src/ATen/Context.cpp:80.)
chaiml-pony-d2-q235b-pv-37537-v1-uploader: return torch._C._get_cublas_allow_tf32()
chaiml-pony-d2-q235b-pv-37537-v1-uploader: W0228 21:37:15.636000 7 torch/_dynamo/convert_frame.py:1358] [6/8] torch._dynamo hit config.recompile_limit (8)
chaiml-pony-d2-q235b-pv-37537-v1-uploader: W0228 21:37:15.636000 7 torch/_dynamo/convert_frame.py:1358] [6/8] function: 'forward' (/usr/local/lib/python3.12/dist-packages/transformers/models/qwen3_moe/modeling_qwen3_moe.py:208)
chaiml-pony-d2-q235b-pv-37537-v1-uploader: W0228 21:37:15.636000 7 torch/_dynamo/convert_frame.py:1358] [6/8] last reason: 6/7: self._modules['up_proj'].imatrix_cnt == 1346 # module.imatrix_cnt += input.shape[0] # auto_round/compressors/base.py:1179 in get_imatrix_hook (HINT: torch.compile considers integer attributes of the nn.Module to be static. If you are observing recompilation, you might want to make this integer dynamic using torch._dynamo.config.allow_unspec_int_on_nn_module = True, or convert this integer into a tensor.)
chaiml-pony-d2-q235b-pv-37537-v1-uploader: W0228 21:37:15.636000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
chaiml-pony-d2-q235b-pv-37537-v1-uploader: W0228 21:37:15.636000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/torch.compiler_troubleshooting.html
chaiml-pony-d2-q235b-pv-37537-v1-uploader: W0228 21:37:18.551000 7 torch/_dynamo/convert_frame.py:1358] [3/8] torch._dynamo hit config.recompile_limit (8)
chaiml-pony-d2-q235b-pv-37537-v1-uploader: W0228 21:37:18.551000 7 torch/_dynamo/convert_frame.py:1358] [3/8] function: 'forward' (/usr/local/lib/python3.12/dist-packages/transformers/models/qwen3_moe/modeling_qwen3_moe.py:305)
chaiml-pony-d2-q235b-pv-37537-v1-uploader: W0228 21:37:18.551000 7 torch/_dynamo/convert_frame.py:1358] [3/8] last reason: 3/7: self._modules['self_attn']._modules['k_proj'].imatrix_cnt == 56 # module.imatrix_cnt += input.shape[0] # auto_round/compressors/base.py:1179 in get_imatrix_hook (HINT: torch.compile considers integer attributes of the nn.Module to be static. If you are observing recompilation, you might want to make this integer dynamic using torch._dynamo.config.allow_unspec_int_on_nn_module = True, or convert this integer into a tensor.)
chaiml-pony-d2-q235b-pv-37537-v1-uploader: W0228 21:37:18.551000 7 torch/_dynamo/convert_frame.py:1358] [3/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
chaiml-pony-d2-q235b-pv-37537-v1-uploader: W0228 21:37:18.551000 7 torch/_dynamo/convert_frame.py:1358] [3/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/torch.compiler_troubleshooting.html
chaiml-pony-d2-q235b-pv-37537-v1-uploader: 2026-02-28 21:37:36 WARNING gguf.py L297: please use more data via setting `nsamples` to improve accuracy as calibration activations contain 0
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Checking if ChaiML/pony-d2-q235b-pv2-lr5e6ep2r64g4-W4A16 already exists in ChaiML
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Creating repo ChaiML/pony-d2-q235b-pv2-lr5e6ep2r64g4-W4A16 and uploading /dev/shm/model_output to it
chaiml-pony-d2-q235b-pv-37537-v1-uploader: ---------- 2026-02-28 22:51:23 (0:00:00) ----------
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Files: hashed 11/38 (26.1M/131.9G) | pre-uploaded: 0/1 (0.0/131.9G) (+27 unsure) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Workers: hashing: 27 | get upload mode: 0 | pre-uploading: 1 | committing: 0 | waiting: 98
chaiml-pony-d2-q235b-pv-37537-v1-uploader: ---------------------------------------------------
chaiml-pony-d2-q235b-pv-37537-v1-uploader:       
chaiml-pony-d2-q235b-pv-37537-v1-uploader: ---------- 2026-02-28 22:52:23 (0:01:00) ----------
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 2/28 (1.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 26 | committing: 0 | waiting: 100
chaiml-pony-d2-q235b-pv-37537-v1-uploader: ---------------------------------------------------
chaiml-pony-d2-q235b-pv-37537-v1-uploader:       
chaiml-pony-d2-q235b-pv-37537-v1-uploader: ---------- 2026-02-28 22:53:23 (0:02:00) ----------
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 10/28 (41.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 18 | committing: 0 | waiting: 108
chaiml-pony-d2-q235b-pv-37537-v1-uploader: ---------------------------------------------------
chaiml-pony-d2-q235b-pv-37537-v1-uploader:       
chaiml-pony-d2-q235b-pv-37537-v1-uploader: ---------- 2026-02-28 22:54:23 (0:03:00) ----------
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 18/28 (81.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 10 | committing: 0 | waiting: 116
chaiml-pony-d2-q235b-pv-37537-v1-uploader: ---------------------------------------------------
chaiml-pony-d2-q235b-pv-37537-v1-uploader:       
chaiml-pony-d2-q235b-pv-37537-v1-uploader: ---------- 2026-02-28 22:55:23 (0:04:00) ----------
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 28/28 (131.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 0 | committing: 1 | waiting: 125
chaiml-pony-d2-q235b-pv-37537-v1-uploader: ---------------------------------------------------
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Processed model ChaiML/pony-d2-q235b-pv2-lr5e6ep2r64g4 in 4977.026s
chaiml-pony-d2-q235b-pv-37537-v1-uploader: creating bucket guanaco-vllm-models
chaiml-pony-d2-q235b-pv-37537-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d2-q235b-pv-37537-v1-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-pony-d2-q235b-pv-37537-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-pony-d2-q235b-pv-37537-v1-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-pony-d2-q235b-pv-37537-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d2-q235b-pv-37537-v1-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-pony-d2-q235b-pv-37537-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d2-q235b-pv-37537-v1-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-pony-d2-q235b-pv-37537-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d2-q235b-pv-37537-v1-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-pony-d2-q235b-pv-37537-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d2-q235b-pv-37537-v1-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-pony-d2-q235b-pv-37537-v1-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-pony-d2-q235b-pv-37537-v1-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-pony-d2-q235b-pv-37537-v1-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-pony-d2-q235b-pv-37537-v1-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-pony-d2-q235b-pv-37537-v1-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-pony-d2-q235b-pv-37537-v1-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/generation_config.json
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/chat_template.jinja
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/special_tokens_map.json
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/merges.txt
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/tokenizer_config.json
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/quantization_config.json
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/config.json
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/added_tokens.json
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/vocab.json
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model.safetensors.index.json
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/tokenizer.json
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00027-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00016-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00023-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00020-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00012-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00009-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00018-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00025-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00022-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00001-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00003-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00021-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00008-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00004-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00002-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00026-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00006-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00013-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00005-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00024-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00017-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00010-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00007-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00019-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00015-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00011-of-00027.safetensors
chaiml-pony-d2-q235b-pv-37537-v1-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-d2-q235b-pv-37537-v1/default/model-00014-of-00027.safetensors
Job chaiml-pony-d2-q235b-pv-37537-v1-uploader completed after 5092.32s with status: succeeded
Stopping job with name chaiml-pony-d2-q235b-pv-37537-v1-uploader
Pipeline stage VLLMUploader completed in 5092.76s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.52s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-pony-d2-q235b-pv-37537-v1
Waiting for inference service chaiml-pony-d2-q235b-pv-37537-v1 to be ready
Inference service chaiml-pony-d2-q235b-pv-37537-v1 ready after 393.0776860713959s
Pipeline stage VLLMDeployer completed in 393.56s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.1055803298950195s
Received healthy response to inference request in 2.051077365875244s
Received healthy response to inference request in 2.0551939010620117s
Received healthy response to inference request in 1.933129072189331s
Received healthy response to inference request in 2.0921084880828857s
Received healthy response to inference request in 2.155010461807251s
Received healthy response to inference request in 2.0598297119140625s
Received healthy response to inference request in 1.8712599277496338s
Received healthy response to inference request in 1.9429759979248047s
Received healthy response to inference request in 1.8962154388427734s
Received healthy response to inference request in 2.1575756072998047s
Received healthy response to inference request in 1.956263780593872s
Received healthy response to inference request in 2.2278265953063965s
Received healthy response to inference request in 1.8715355396270752s
Received healthy response to inference request in 1.8889200687408447s
Received healthy response to inference request in 2.0252416133880615s
Received healthy response to inference request in 1.8909540176391602s
Received healthy response to inference request in 1.9591090679168701s
Received healthy response to inference request in 2.0069074630737305s
Received healthy response to inference request in 2.096709728240967s
Received healthy response to inference request in 2.20504093170166s
Received healthy response to inference request in 1.9504308700561523s
Received healthy response to inference request in 2.0211660861968994s
Received healthy response to inference request in 1.873992919921875s
Received healthy response to inference request in 1.910625696182251s
Received healthy response to inference request in 1.9668316841125488s
Received healthy response to inference request in 2.1964821815490723s
Received healthy response to inference request in 2.2201335430145264s
Received healthy response to inference request in 2.069007396697998s
Received healthy response to inference request in 2.063532829284668s
30 requests
0 failed requests
5th percentile: 1.8726413607597352
10th percentile: 1.8874273538589477
20th percentile: 1.9077436447143554
30th percentile: 1.948194408416748
40th percentile: 1.9637426376342773
50th percentile: 2.0232038497924805
60th percentile: 2.057048225402832
70th percentile: 2.0759377241134644
80th percentile: 2.115466356277466
90th percentile: 2.1973380565643312
95th percentile: 2.2133418679237367
99th percentile: 2.225595610141754
mean time: 2.024022277196248
Pipeline stage StressChecker completed in 69.06s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.18s
Shutdown handler de-registered
chaiml-pony-d2-q235b-pv_37537_v1 status is now deployed due to DeploymentManager action
chaiml-pony-d2-q235b-pv_37537_v1 status is now inactive due to auto deactivation removed underperforming models