developer_uid: acehao-chai
submission_id: chaiml-grpo-q235b-opusd_89664_v1
model_name: chaiml-grpo-q235b-opusd_89664_v1
model_group: ChaiML/grpo-q235b-opusd-
status: torndown
timestamp: 2026-02-24T00:46:46+00:00
num_battles: 10572
num_wins: 5639
celo_rating: 1318.06
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/grpo-q235b-opusd-v1-merged-chai-rm-step-500
model_architecture: Qwen3MoeForCausalLM
model_num_parameters: 18790207488.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-grpo-q235b-opusd_89664_v1
ineligible_reason: max_output_tokens!=64
is_internal_developer: False
language_model: ChaiML/grpo-q235b-opusd-v1-merged-chai-rm-step-500
model_size: 19B
ranking_group: single
us_pacific_date: 2026-02-17
win_ratio: 0.5333900870223232
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['<|assistant|>', '<|im_end|>', '####', '<|user|>', '</think>', '</s>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-grpo-q235b-opusd-89664-v1-uploader
Waiting for job on chaiml-grpo-q235b-opusd-89664-v1-uploader to finish
chaiml-grpo-q235b-opusd-89664-v1-uploader: Using quantization_mode: w4a16
chaiml-grpo-q235b-opusd-89664-v1-uploader: Checking if ChaiML/grpo-q235b-opusd-v1-merged-chai-rm-step-500-W4A16 already exists in ChaiML
chaiml-grpo-q235b-opusd-89664-v1-uploader: Downloading snapshot of ChaiML/grpo-q235b-opusd-v1-merged-chai-rm-step-500...
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-grpo-q235b-opusd-89664-v1-uploader: Downloaded in 165.876s
chaiml-grpo-q235b-opusd-89664-v1-uploader: Applying quantization...
chaiml-grpo-q235b-opusd-89664-v1-uploader: The tokenizer you are loading from '/tmp/model_input' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
chaiml-grpo-q235b-opusd-89664-v1-uploader: 2026-02-17 12:01:01 WARNING modeling_utils.py L4670: `torch_dtype` is deprecated! Use `dtype` instead!
chaiml-grpo-q235b-opusd-89664-v1-uploader: 2026-02-17 12:01:16 INFO base.py L366: using torch.bfloat16 for quantization tuning
chaiml-grpo-q235b-opusd-89664-v1-uploader: 2026-02-17 12:01:20 INFO base.py L1145: start to compute imatrix
chaiml-grpo-q235b-opusd-89664-v1-uploader: /usr/local/lib/python3.12/dist-packages/torch/backends/cuda/__init__.py:131: UserWarning: Please use the new API settings to control TF32 behavior, such as torch.backends.cudnn.conv.fp32_precision = 'tf32' or torch.backends.cuda.matmul.fp32_precision = 'ieee'. Old settings, e.g, torch.backends.cuda.matmul.allow_tf32 = True, torch.backends.cudnn.allow_tf32 = True, allowTF32CuDNN() and allowTF32CuBLAS() will be deprecated after Pytorch 2.9. Please see https://pytorch.org/docs/main/notes/cuda.html#tensorfloat-32-tf32-on-ampere-and-later-devices (Triggered internally at /pytorch/aten/src/ATen/Context.cpp:80.)
chaiml-grpo-q235b-opusd-89664-v1-uploader: return torch._C._get_cublas_allow_tf32()
chaiml-grpo-q235b-opusd-89664-v1-uploader: W0217 12:02:21.825000 7 torch/_dynamo/convert_frame.py:1358] [6/8] torch._dynamo hit config.recompile_limit (8)
chaiml-grpo-q235b-opusd-89664-v1-uploader: W0217 12:02:21.825000 7 torch/_dynamo/convert_frame.py:1358] [6/8] function: 'forward' (/usr/local/lib/python3.12/dist-packages/transformers/models/qwen3_moe/modeling_qwen3_moe.py:208)
chaiml-grpo-q235b-opusd-89664-v1-uploader: W0217 12:02:21.825000 7 torch/_dynamo/convert_frame.py:1358] [6/8] last reason: 6/7: self._modules['up_proj'].imatrix_cnt == 1342 # module.imatrix_cnt += input.shape[0] # auto_round/compressors/base.py:1179 in get_imatrix_hook (HINT: torch.compile considers integer attributes of the nn.Module to be static. If you are observing recompilation, you might want to make this integer dynamic using torch._dynamo.config.allow_unspec_int_on_nn_module = True, or convert this integer into a tensor.)
chaiml-grpo-q235b-opusd-89664-v1-uploader: W0217 12:02:21.825000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
chaiml-grpo-q235b-opusd-89664-v1-uploader: W0217 12:02:21.825000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/torch.compiler_troubleshooting.html
chaiml-grpo-q235b-opusd-89664-v1-uploader: 2026-02-17 12:02:42 WARNING gguf.py L297: please use more data via setting `nsamples` to improve accuracy as calibration activations contain 0
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Unable to record family friendly update due to error: Invalid JSON input: Expecting value: line 1 column 1 (char 0)
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-grpo-q235b-opusd-89664-v1-uploader: Checking if ChaiML/grpo-q235b-opusd-v1-merged-chai-rm-step-500-W4A16 already exists in ChaiML
chaiml-grpo-q235b-opusd-89664-v1-uploader: ---------- 2026-02-17 13:15:29 (0:00:00) ----------
chaiml-grpo-q235b-opusd-89664-v1-uploader: Files: hashed 11/38 (26.1M/131.9G) | pre-uploaded: 0/1 (0.0/131.9G) (+27 unsure) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-grpo-q235b-opusd-89664-v1-uploader: Workers: hashing: 27 | get upload mode: 0 | pre-uploading: 1 | committing: 0 | waiting: 98
chaiml-grpo-q235b-opusd-89664-v1-uploader: ---------------------------------------------------
chaiml-grpo-q235b-opusd-89664-v1-uploader:       
chaiml-grpo-q235b-opusd-89664-v1-uploader: ---------- 2026-02-17 13:16:29 (0:01:00) ----------
chaiml-grpo-q235b-opusd-89664-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 2/28 (1.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-grpo-q235b-opusd-89664-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 26 | committing: 0 | waiting: 100
chaiml-grpo-q235b-opusd-89664-v1-uploader: ---------------------------------------------------
chaiml-grpo-q235b-opusd-89664-v1-uploader:       
chaiml-grpo-q235b-opusd-89664-v1-uploader: ---------- 2026-02-17 13:17:29 (0:02:00) ----------
chaiml-grpo-q235b-opusd-89664-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 10/28 (41.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-grpo-q235b-opusd-89664-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 18 | committing: 0 | waiting: 108
chaiml-grpo-q235b-opusd-89664-v1-uploader: ---------------------------------------------------
chaiml-grpo-q235b-opusd-89664-v1-uploader:       
chaiml-grpo-q235b-opusd-89664-v1-uploader: ---------- 2026-02-17 13:18:29 (0:03:00) ----------
chaiml-grpo-q235b-opusd-89664-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 18/28 (81.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-grpo-q235b-opusd-89664-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 10 | committing: 0 | waiting: 116
chaiml-grpo-q235b-opusd-89664-v1-uploader: ---------------------------------------------------
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-grpo-q235b-opusd-89664-v1-uploader:       
chaiml-grpo-q235b-opusd-89664-v1-uploader: ---------- 2026-02-17 13:19:29 (0:04:00) ----------
chaiml-grpo-q235b-opusd-89664-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 19/28 (86.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-grpo-q235b-opusd-89664-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 9 | committing: 0 | waiting: 117
chaiml-grpo-q235b-opusd-89664-v1-uploader: ---------------------------------------------------
chaiml-grpo-q235b-opusd-89664-v1-uploader:       
chaiml-grpo-q235b-opusd-89664-v1-uploader: ---------- 2026-02-17 13:20:29 (0:05:00) ----------
chaiml-grpo-q235b-opusd-89664-v1-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 28/28 (131.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-grpo-q235b-opusd-89664-v1-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 0 | committing: 1 | waiting: 125
chaiml-grpo-q235b-opusd-89664-v1-uploader: ---------------------------------------------------
chaiml-grpo-q235b-opusd-89664-v1-uploader: Processed model ChaiML/grpo-q235b-opusd-v1-merged-chai-rm-step-500 in 4968.122s
chaiml-grpo-q235b-opusd-89664-v1-uploader: creating bucket guanaco-vllm-models
chaiml-grpo-q235b-opusd-89664-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-89664-v1-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-grpo-q235b-opusd-89664-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-grpo-q235b-opusd-89664-v1-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-grpo-q235b-opusd-89664-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-89664-v1-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-grpo-q235b-opusd-89664-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-89664-v1-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-grpo-q235b-opusd-89664-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-89664-v1-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-grpo-q235b-opusd-89664-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-89664-v1-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-grpo-q235b-opusd-89664-v1-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-grpo-q235b-opusd-89664-v1-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-grpo-q235b-opusd-89664-v1-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-grpo-q235b-opusd-89664-v1-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-grpo-q235b-opusd-89664-v1-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-grpo-q235b-opusd-89664-v1-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/added_tokens.json
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/generation_config.json
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/chat_template.jinja
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/tokenizer_config.json
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/quantization_config.json
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/config.json
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/special_tokens_map.json
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/vocab.json
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/merges.txt
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/tokenizer.json
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model.safetensors.index.json
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00027-of-00027.safetensors
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00013-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00022-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00015-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00008-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00002-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00009-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00010-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00019-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00007-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00024-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00026-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00023-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00003-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00020-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00006-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00014-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00001-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00011-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00012-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00004-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00018-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00025-of-00027.safetensors
chaiml-grpo-q235b-opusd-89664-v1-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-89664-v1/default/model-00005-of-00027.safetensors
Job chaiml-grpo-q235b-opusd-89664-v1-uploader completed after 5228.63s with status: succeeded
Stopping job with name chaiml-grpo-q235b-opusd-89664-v1-uploader
Pipeline stage VLLMUploader completed in 5230.07s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-grpo-q235b-opusd-89664-v1
Waiting for inference service chaiml-grpo-q235b-opusd-89664-v1 to be ready
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Inference service chaiml-grpo-q235b-opusd-89664-v1 ready after 714.90420794487s
Pipeline stage VLLMDeployer completed in 715.49s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.7279415130615234s
Received healthy response to inference request in 2.0069079399108887s
Received healthy response to inference request in 2.1358156204223633s
Received healthy response to inference request in 1.9965343475341797s
Received healthy response to inference request in 2.080411434173584s
Received healthy response to inference request in 2.024876117706299s
Received healthy response to inference request in 1.929248571395874s
Received healthy response to inference request in 2.910923719406128s
Received healthy response to inference request in 2.410451889038086s
Received healthy response to inference request in 2.482287645339966s
Received healthy response to inference request in 2.0172598361968994s
Received healthy response to inference request in 2.2255423069000244s
Received healthy response to inference request in 2.431772470474243s
Received healthy response to inference request in 1.8413059711456299s
Received healthy response to inference request in 1.8981761932373047s
Received healthy response to inference request in 2.3163697719573975s
Received healthy response to inference request in 2.127133846282959s
Received healthy response to inference request in 2.1964781284332275s
Received healthy response to inference request in 1.8747498989105225s
Received healthy response to inference request in 2.2118372917175293s
Received healthy response to inference request in 2.103452444076538s
Received healthy response to inference request in 2.322824239730835s
Received healthy response to inference request in 2.421346664428711s
Received healthy response to inference request in 1.9015727043151855s
Received healthy response to inference request in 2.268420457839966s
Received healthy response to inference request in 2.2723114490509033s
Received healthy response to inference request in 2.741820812225342s
Received healthy response to inference request in 2.097780466079712s
Received healthy response to inference request in 1.8514437675476074s
Received healthy response to inference request in 2.0618202686309814s
30 requests
0 failed requests
5th percentile: 1.861931526660919
10th percentile: 1.8958335638046264
20th percentile: 1.9830771923065187
30th percentile: 2.022591233253479
40th percentile: 2.0908328533172607
50th percentile: 2.131474733352661
60th percentile: 2.217319297790527
70th percentile: 2.2855289459228514
80th percentile: 2.4126308441162108
90th percentile: 2.506853032112122
95th percentile: 2.7355751276016234
99th percentile: 2.8618838763237
mean time: 2.1962939262390138
Pipeline stage StressChecker completed in 70.61s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.70s
Shutdown handler de-registered
chaiml-grpo-q235b-opusd_89664_v1 status is now deployed due to DeploymentManager action
chaiml-grpo-q235b-opusd_89664_v1 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of chaiml-grpo-q235b-opusd_89664_v1
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMDeleter
Checking if service chaiml-grpo-q235b-opusd-89664-v1 is running
Tearing down inference service chaiml-grpo-q235b-opusd-89664-v1
Service chaiml-grpo-q235b-opusd-89664-v1 has been torndown
Pipeline stage VLLMDeleter completed in 1.13s
run pipeline stage %s
Running pipeline stage VLLMModelDeleter
Cleaning model data from S3
Cleaning model data from model cache
%s, retrying in %s seconds...
Cleaning model data from S3
Cleaning model data from model cache
%s, retrying in %s seconds...
Cleaning model data from S3
Cleaning model data from model cache
clean up pipeline due to error=TeardownError("Got unexpected keyword argument 'request_checksum_calculation'")
Shutdown handler de-registered
chaiml-grpo-q235b-opusd_89664_v1 status is now torndown due to DeploymentManager action