developer_uid: acehao-chai
submission_id: chaiml-grpo-q235b-opusd-_3319_v1
model_name: chaiml-grpo-q235b-opusd-_3319_v1
model_group: ChaiML/grpo-q235b-opusd-
status: inactive
timestamp: 2026-02-23T22:36:48+00:00
num_battles: 10636
num_wins: 5642
celo_rating: 1314.4
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/grpo-q235b-opusd-v1-merged-nemo70b-chai-rm-step-1600
model_architecture: Qwen3MoeForCausalLM
model_num_parameters: 18790207488.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-grpo-q235b-opusd-_3319_v1
ineligible_reason: max_output_tokens!=64
is_internal_developer: False
language_model: ChaiML/grpo-q235b-opusd-v1-merged-nemo70b-chai-rm-step-1600
model_size: 19B
ranking_group: single
us_pacific_date: 2026-02-23
win_ratio: 0.5304625799172621
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['<|user|>', '<|assistant|>', '<|im_end|>', '</s>', '####', '</think>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-grpo-q235b-opusd-3319-v1-uploader
Waiting for job on chaiml-grpo-q235b-opusd-3319-v1-uploader to finish
chaiml-grpo-q235b-opusd-3319-v1-uploader: Using quantization_mode: w4a16
chaiml-grpo-q235b-opusd-3319-v1-uploader: Checking if ChaiML/grpo-q235b-opusd-v1-merged-nemo70b-chai-rm-step-1600-W4A16 already exists in ChaiML
chaiml-grpo-q235b-opusd-3319-v1-uploader: Downloading snapshot of ChaiML/grpo-q235b-opusd-v1-merged-nemo70b-chai-rm-step-1600...
chaiml-grpo-q235b-opusd-3319-v1-uploader: Downloaded in 173.202s
chaiml-grpo-q235b-opusd-3319-v1-uploader: Applying quantization...
chaiml-grpo-q235b-opusd-3319-v1-uploader: The tokenizer you are loading from '/tmp/model_input' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-grpo-q235b-opusd-3319-v1-uploader: 2026-02-23 10:15:06 WARNING modeling_utils.py L4670: `torch_dtype` is deprecated! Use `dtype` instead!
chaiml-grpo-q235b-opusd-3319-v1-uploader: 2026-02-23 10:15:27 INFO base.py L366: using torch.bfloat16 for quantization tuning
chaiml-grpo-q235b-opusd-3319-v1-uploader: 2026-02-23 10:15:31 INFO base.py L1145: start to compute imatrix
chaiml-grpo-q235b-opusd-3319-v1-uploader: W0223 10:16:37.045000 7 torch/_dynamo/convert_frame.py:1358] [6/8] torch._dynamo hit config.recompile_limit (8)
chaiml-grpo-q235b-opusd-3319-v1-uploader: W0223 10:16:37.045000 7 torch/_dynamo/convert_frame.py:1358] [6/8] function: 'forward' (/usr/local/lib/python3.12/dist-packages/transformers/models/qwen3_moe/modeling_qwen3_moe.py:208)
chaiml-grpo-q235b-opusd-3319-v1-uploader: W0223 10:16:37.045000 7 torch/_dynamo/convert_frame.py:1358] [6/8] last reason: 6/7: self._modules['up_proj'].imatrix_cnt == 1342 # module.imatrix_cnt += input.shape[0] # auto_round/compressors/base.py:1179 in get_imatrix_hook (HINT: torch.compile considers integer attributes of the nn.Module to be static. If you are observing recompilation, you might want to make this integer dynamic using torch._dynamo.config.allow_unspec_int_on_nn_module = True, or convert this integer into a tensor.)
chaiml-grpo-q235b-opusd-3319-v1-uploader: W0223 10:16:37.045000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
chaiml-grpo-q235b-opusd-3319-v1-uploader: W0223 10:16:37.045000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/torch.compiler_troubleshooting.html
chaiml-grpo-q235b-opusd-3319-v1-uploader: W0223 10:16:39.927000 7 torch/_dynamo/convert_frame.py:1358] [3/8] torch._dynamo hit config.recompile_limit (8)
chaiml-grpo-q235b-opusd-3319-v1-uploader: W0223 10:16:39.927000 7 torch/_dynamo/convert_frame.py:1358] [3/8] function: 'forward' (/usr/local/lib/python3.12/dist-packages/transformers/models/qwen3_moe/modeling_qwen3_moe.py:305)
chaiml-grpo-q235b-opusd-3319-v1-uploader: W0223 10:16:39.927000 7 torch/_dynamo/convert_frame.py:1358] [3/8] last reason: 3/7: self._modules['self_attn']._modules['k_proj'].imatrix_cnt == 56 # module.imatrix_cnt += input.shape[0] # auto_round/compressors/base.py:1179 in get_imatrix_hook (HINT: torch.compile considers integer attributes of the nn.Module to be static. If you are observing recompilation, you might want to make this integer dynamic using torch._dynamo.config.allow_unspec_int_on_nn_module = True, or convert this integer into a tensor.)
chaiml-grpo-q235b-opusd-3319-v1-uploader: W0223 10:16:39.927000 7 torch/_dynamo/convert_frame.py:1358] [3/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
chaiml-grpo-q235b-opusd-3319-v1-uploader: W0223 10:16:39.927000 7 torch/_dynamo/convert_frame.py:1358] [3/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/torch.compiler_troubleshooting.html
chaiml-grpo-q235b-opusd-3319-v1-uploader: 2026-02-23 10:16:57 WARNING gguf.py L297: please use more data via setting `nsamples` to improve accuracy as calibration activations contain 0
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v3: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Max retries exceeded with url: /v1/completions (Caused by ConnectTimeoutError(<urllib3.connection.HTTPConnection object at 0x70f500b2f390>, 'Connection to chaiml-grpo-q235b-kimid-37540-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com timed out. (connect timeout=12.0)'))
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-sft-qwen-235b-ro_85751_v1: ('http://chaiml-sft-qwen-235b-ro-85751-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/completions', '')
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
chaiml-grpo-q235b-opusd-3319-v1-uploader: Checking if ChaiML/grpo-q235b-opusd-v1-merged-nemo70b-chai-rm-step-1600-W4A16 already exists in ChaiML
chaiml-grpo-q235b-opusd-3319-v1-uploader: Creating repo ChaiML/grpo-q235b-opusd-v1-merged-nemo70b-chai-rm-step-1600-W4A16 and uploading /dev/shm/model_output to it
chaiml-grpo-q235b-opusd-3319-v1-uploader: ---------- 2026-02-23 11:37:34 (0:00:00) ----------
chaiml-grpo-q235b-opusd-3319-v1-uploader: Files: hashed 11/38 (26.1M/131.9G) | pre-uploaded: 0/1 (0.0/131.9G) (+27 unsure) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-grpo-q235b-opusd-3319-v1-uploader: Workers: hashing: 27 | get upload mode: 0 | pre-uploading: 1 | committing: 0 | waiting: 98
chaiml-grpo-q235b-opusd-3319-v1-uploader: ---------------------------------------------------
chaiml-grpo-q235b-opusd-3319-v1-uploader:       
chaiml-grpo-q235b-opusd-3319-v1-uploader: ---------- 2026-02-23 11:38:34 (0:01:00) ----------
chaiml-grpo-q235b-opusd-3319-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 2/28 (1.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-grpo-q235b-opusd-3319-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 26 | committing: 0 | waiting: 100
chaiml-grpo-q235b-opusd-3319-v1-uploader: ---------------------------------------------------
chaiml-grpo-q235b-opusd-3319-v1-uploader:       
chaiml-grpo-q235b-opusd-3319-v1-uploader: ---------- 2026-02-23 11:39:34 (0:02:00) ----------
chaiml-grpo-q235b-opusd-3319-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 10/28 (41.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-grpo-q235b-opusd-3319-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 18 | committing: 0 | waiting: 108
chaiml-grpo-q235b-opusd-3319-v1-uploader: ---------------------------------------------------
chaiml-grpo-q235b-opusd-3319-v1-uploader:       
chaiml-grpo-q235b-opusd-3319-v1-uploader: ---------- 2026-02-23 11:40:35 (0:03:00) ----------
chaiml-grpo-q235b-opusd-3319-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 18/28 (81.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-grpo-q235b-opusd-3319-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 10 | committing: 0 | waiting: 116
chaiml-grpo-q235b-opusd-3319-v1-uploader: ---------------------------------------------------
chaiml-grpo-q235b-opusd-3319-v1-uploader:       
chaiml-grpo-q235b-opusd-3319-v1-uploader: ---------- 2026-02-23 11:41:35 (0:04:00) ----------
chaiml-grpo-q235b-opusd-3319-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 26/28 (121.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-grpo-q235b-opusd-3319-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 2 | committing: 0 | waiting: 124
chaiml-grpo-q235b-opusd-3319-v1-uploader: ---------------------------------------------------
Unable to record family friendly update due to error: Invalid JSON input: Expecting value: line 1 column 1 (char 0)
chaiml-grpo-q235b-opusd-3319-v1-uploader:       
chaiml-grpo-q235b-opusd-3319-v1-uploader: ---------- 2026-02-23 11:42:35 (0:05:00) ----------
chaiml-grpo-q235b-opusd-3319-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 28/28 (131.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-grpo-q235b-opusd-3319-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 0 | committing: 1 | waiting: 125
chaiml-grpo-q235b-opusd-3319-v1-uploader: ---------------------------------------------------
chaiml-grpo-q235b-opusd-3319-v1-uploader: Processed model ChaiML/grpo-q235b-opusd-v1-merged-nemo70b-chai-rm-step-1600 in 5452.468s
chaiml-grpo-q235b-opusd-3319-v1-uploader: creating bucket guanaco-vllm-models
chaiml-grpo-q235b-opusd-3319-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-3319-v1-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-grpo-q235b-opusd-3319-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-grpo-q235b-opusd-3319-v1-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-grpo-q235b-opusd-3319-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-3319-v1-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-grpo-q235b-opusd-3319-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-3319-v1-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-grpo-q235b-opusd-3319-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-3319-v1-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-grpo-q235b-opusd-3319-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-3319-v1-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-grpo-q235b-opusd-3319-v1-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-grpo-q235b-opusd-3319-v1-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-grpo-q235b-opusd-3319-v1-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-grpo-q235b-opusd-3319-v1-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-grpo-q235b-opusd-3319-v1-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-grpo-q235b-opusd-3319-v1-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/added_tokens.json
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/config.json
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/special_tokens_map.json
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/tokenizer_config.json
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/quantization_config.json
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/merges.txt
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/chat_template.jinja
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/generation_config.json
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/vocab.json
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/tokenizer.json
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model.safetensors.index.json
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00027-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00016-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00002-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00018-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00026-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00017-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00011-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00005-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00021-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00022-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00013-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00024-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00025-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00001-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00014-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00020-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00010-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00008-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00009-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00019-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00012-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00004-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00015-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00007-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00003-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00006-of-00027.safetensors
chaiml-grpo-q235b-opusd-3319-v1-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-3319-v1/default/model-00023-of-00027.safetensors
Job chaiml-grpo-q235b-opusd-3319-v1-uploader completed after 5543.23s with status: succeeded
Stopping job with name chaiml-grpo-q235b-opusd-3319-v1-uploader
Pipeline stage VLLMUploader completed in 5543.81s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.17s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-grpo-q235b-opusd-3319-v1
Waiting for inference service chaiml-grpo-q235b-opusd-3319-v1 to be ready
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service chaiml-grpo-q235b-opusd-3319-v1 ready after 393.614595413208s
Pipeline stage VLLMDeployer completed in 394.21s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.679992914199829s
Received healthy response to inference request in 1.7567827701568604s
Received healthy response to inference request in 1.8995442390441895s
Received healthy response to inference request in 2.032336473464966s
Received healthy response to inference request in 1.7986855506896973s
Received healthy response to inference request in 1.5953419208526611s
Received healthy response to inference request in 1.7889325618743896s
Received healthy response to inference request in 1.755040168762207s
Received healthy response to inference request in 1.7874608039855957s
Received healthy response to inference request in 1.8716673851013184s
Received healthy response to inference request in 1.9614284038543701s
Received healthy response to inference request in 1.909912109375s
Received healthy response to inference request in 1.7033147811889648s
Received healthy response to inference request in 1.8671283721923828s
Received healthy response to inference request in 1.7651267051696777s
Received healthy response to inference request in 1.815086841583252s
Received healthy response to inference request in 1.6110553741455078s
Received healthy response to inference request in 1.9858484268188477s
Received healthy response to inference request in 1.9635393619537354s
Received healthy response to inference request in 1.8954010009765625s
Received healthy response to inference request in 2.1306872367858887s
Received healthy response to inference request in 1.8560209274291992s
Received healthy response to inference request in 1.8947169780731201s
Received healthy response to inference request in 1.8738195896148682s
Received healthy response to inference request in 1.977689504623413s
Received healthy response to inference request in 1.869077205657959s
Received healthy response to inference request in 1.6590421199798584s
Received healthy response to inference request in 1.873671531677246s
Received healthy response to inference request in 1.7121880054473877s
Received healthy response to inference request in 1.9634289741516113s
30 requests
0 failed requests
5th percentile: 1.6326494097709656
10th percentile: 1.677897834777832
20th percentile: 1.7464697360992432
30th percentile: 1.7807605743408204
40th percentile: 1.8085263252258301
50th percentile: 1.868102788925171
60th percentile: 1.873730754852295
70th percentile: 1.8966439723968507
80th percentile: 1.9618285179138184
90th percentile: 1.9785053968429565
95th percentile: 2.0114168524742126
99th percentile: 2.102165515422821
mean time: 1.8417989412943523
Pipeline stage StressChecker completed in 60.69s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.74s
Shutdown handler de-registered
chaiml-grpo-q235b-opusd-_3319_v1 status is now deployed due to DeploymentManager action
chaiml-grpo-q235b-opusd-_3319_v1 status is now inactive due to auto deactivation removed underperforming models