developer_uid: rirv938
submission_id: chaiml-reward-dpo-6cee-_18001_v1
model_name: chaiml-reward-dpo-6cee-_18001_v1
model_group: ChaiML/reward-dpo-6cee-c
status: torndown
timestamp: 2026-02-22T10:31:44+00:00
num_battles: 10852
num_wins: 6439
celo_rating: 1366.51
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/reward-dpo-6cee-chaiml-235b-sft-new-rm-_42002_v1-W4A16
model_architecture: Qwen3MoeForCausalLM
model_num_parameters: 18790207488.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-reward-dpo-6cee-_18001_v1
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/reward-dpo-6cee-chaiml-235b-sft-new-rm-_42002_v1-W4A16
model_size: 19B
ranking_group: single
us_pacific_date: 2026-02-19
win_ratio: 0.5933468485071877
generation_params: {'temperature': 1.0, 'top_p': 0.95, 'min_p': 0.05, 'top_k': 60, 'presence_penalty': 0.1, 'frequency_penalty': 0.0, 'stopping_words': ['<|im_end|>', 'You:', '</s>', '###', '<|im_start|>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': '<|im_start|>system\nRespond as a high quality storyteller.<|im_end|>\n<|im_start|>user\n', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '<|im_end|>\n<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-reward-dpo-6cee-18001-v1-uploader
Waiting for job on chaiml-reward-dpo-6cee-18001-v1-uploader to finish
chaiml-reward-dpo-6cee-18001-v1-uploader: Using quantization_mode: w4a16
chaiml-reward-dpo-6cee-18001-v1-uploader: Repo ChaiML/reward-dpo-6cee-chaiml-235b-sft-new-rm-_42002_v1-W4A16 already ends in W4A16. Skipping...
chaiml-reward-dpo-6cee-18001-v1-uploader: Checking if ChaiML/reward-dpo-6cee-chaiml-235b-sft-new-rm-_42002_v1-W4A16 already exists in ChaiML
chaiml-reward-dpo-6cee-18001-v1-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-reward-dpo-6cee-18001-v1-uploader: Downloading snapshot of ChaiML/reward-dpo-6cee-chaiml-235b-sft-new-rm-_42002_v1-W4A16...
chaiml-reward-dpo-6cee-18001-v1-uploader: Downloaded in 67.785s
chaiml-reward-dpo-6cee-18001-v1-uploader: Processed model ChaiML/reward-dpo-6cee-chaiml-235b-sft-new-rm-_42002_v1-W4A16 in 68.441s
chaiml-reward-dpo-6cee-18001-v1-uploader: creating bucket guanaco-vllm-models
chaiml-reward-dpo-6cee-18001-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-reward-dpo-6cee-18001-v1-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-reward-dpo-6cee-18001-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-reward-dpo-6cee-18001-v1-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-reward-dpo-6cee-18001-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-reward-dpo-6cee-18001-v1-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-reward-dpo-6cee-18001-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-reward-dpo-6cee-18001-v1-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-reward-dpo-6cee-18001-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-reward-dpo-6cee-18001-v1-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-reward-dpo-6cee-18001-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-reward-dpo-6cee-18001-v1-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-reward-dpo-6cee-18001-v1-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-reward-dpo-6cee-18001-v1-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-reward-dpo-6cee-18001-v1-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-reward-dpo-6cee-18001-v1-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-reward-dpo-6cee-18001-v1-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-reward-dpo-6cee-18001-v1-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/.gitattributes
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/added_tokens.json
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/tokenizer_config.json
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/chat_template.jinja
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/generation_config.json
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/special_tokens_map.json
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/quantization_config.json
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/config.json
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/merges.txt
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model.safetensors.index.json
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/tokenizer.json
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00027-of-00027.safetensors
Unable to record family friendly update due to error: Invalid JSON input: Expecting value: line 1 column 1 (char 0)
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00005-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00011-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00018-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00016-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00004-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00024-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00025-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00006-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00021-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00026-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00010-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00017-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00012-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00009-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00008-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00015-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00013-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00022-of-00027.safetensors
chaiml-reward-dpo-6cee-18001-v1-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-6cee-18001-v1/default/model-00014-of-00027.safetensors
Job chaiml-reward-dpo-6cee-18001-v1-uploader completed after 994.45s with status: succeeded
Stopping job with name chaiml-reward-dpo-6cee-18001-v1-uploader
Pipeline stage VLLMUploader completed in 994.95s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-reward-dpo-6cee-18001-v1
Waiting for inference service chaiml-reward-dpo-6cee-18001-v1 to be ready
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Inference service chaiml-reward-dpo-6cee-18001-v1 ready after 1034.3718209266663s
Pipeline stage VLLMDeployer completed in 1034.92s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0945754051208496s
Received healthy response to inference request in 1.9428637027740479s
Received healthy response to inference request in 2.012471914291382s
Received healthy response to inference request in 2.056864023208618s
Received healthy response to inference request in 1.9108991622924805s
Received healthy response to inference request in 1.9482882022857666s
Received healthy response to inference request in 1.9296150207519531s
Received healthy response to inference request in 2.0125744342803955s
Received healthy response to inference request in 1.9714126586914062s
Received healthy response to inference request in 1.9622676372528076s
Received healthy response to inference request in 2.0388548374176025s
Received healthy response to inference request in 2.0063440799713135s
Received healthy response to inference request in 2.140554904937744s
Received healthy response to inference request in 1.9590308666229248s
Received healthy response to inference request in 2.026489496231079s
Received healthy response to inference request in 1.9684937000274658s
Received healthy response to inference request in 1.9185688495635986s
Received healthy response to inference request in 1.9566552639007568s
Received healthy response to inference request in 2.0447041988372803s
Received healthy response to inference request in 2.2023496627807617s
Received healthy response to inference request in 1.9364893436431885s
Received healthy response to inference request in 1.982762336730957s
Received healthy response to inference request in 2.166452646255493s
Received healthy response to inference request in 2.0297513008117676s
Received healthy response to inference request in 1.9382023811340332s
Received healthy response to inference request in 1.886108636856079s
Received healthy response to inference request in 1.9919390678405762s
Received healthy response to inference request in 1.9690132141113281s
Received healthy response to inference request in 2.075806140899658s
Received healthy response to inference request in 2.0153660774230957s
30 requests
0 failed requests
5th percentile: 1.9143505215644836
10th percentile: 1.9285104036331178
20th percentile: 1.9419314384460449
30th percentile: 1.9583181858062744
40th percentile: 1.9688054084777833
50th percentile: 1.9873507022857666
60th percentile: 2.0125129222869873
70th percentile: 2.027468037605286
80th percentile: 2.047136163711548
90th percentile: 2.0991733551025393
95th percentile: 2.154798662662506
99th percentile: 2.191939527988434
mean time: 2.0031923055648804
Pipeline stage StressChecker completed in 63.77s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.69s
Shutdown handler de-registered
chaiml-reward-dpo-6cee-_18001_v1 status is now deployed due to DeploymentManager action
chaiml-reward-dpo-6cee-_18001_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-reward-dpo-6cee-_18001_v1 status is now torndown due to DeploymentManager action