developer_uid: acehao-chai
submission_id: chaiml-grpo-q235b-kimid_96451_v2
model_name: chaiml-grpo-q235b-kimid_96451_v2
model_group: ChaiML/grpo-q235b-kimid-
status: torndown
timestamp: 2026-02-23T02:21:45+00:00
num_battles: 11311
num_wins: 6071
celo_rating: 1321.7
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/grpo-q235b-kimid-v5d-merged-chai-rm-mid-kl-averaged-loras
model_architecture: Qwen3MoeForCausalLM
model_num_parameters: 18790207488.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-grpo-q235b-kimid_96451_v2
ineligible_reason: max_output_tokens!=64
is_internal_developer: False
language_model: ChaiML/grpo-q235b-kimid-v5d-merged-chai-rm-mid-kl-averaged-loras
model_size: 19B
ranking_group: single
us_pacific_date: 2026-02-19
win_ratio: 0.5367341525948192
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '</think>', '<|assistant|>', '####', '<|im_end|>', '<|user|>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-grpo-q235b-kimid-96451-v2-uploader
Waiting for job on chaiml-grpo-q235b-kimid-96451-v2-uploader to finish
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-grpo-q235b-kimid-96451-v2-uploader: Using quantization_mode: w4a16
chaiml-grpo-q235b-kimid-96451-v2-uploader: Checking if ChaiML/grpo-q235b-kimid-v5d-merged-chai-rm-mid-kl-averaged-loras-W4A16 already exists in ChaiML
chaiml-grpo-q235b-kimid-96451-v2-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-grpo-q235b-kimid-96451-v2-uploader: Downloading snapshot of ChaiML/grpo-q235b-kimid-v5d-merged-chai-rm-mid-kl-averaged-loras-W4A16...
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
chaiml-grpo-q235b-kimid-96451-v2-uploader: Downloaded in 58.334s
chaiml-grpo-q235b-kimid-96451-v2-uploader: Processed model ChaiML/grpo-q235b-kimid-v5d-merged-chai-rm-mid-kl-averaged-loras in 58.941s
chaiml-grpo-q235b-kimid-96451-v2-uploader: creating bucket guanaco-vllm-models
chaiml-grpo-q235b-kimid-96451-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-96451-v2-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-grpo-q235b-kimid-96451-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-grpo-q235b-kimid-96451-v2-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-grpo-q235b-kimid-96451-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-96451-v2-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-grpo-q235b-kimid-96451-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-96451-v2-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-grpo-q235b-kimid-96451-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-96451-v2-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-grpo-q235b-kimid-96451-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-96451-v2-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-grpo-q235b-kimid-96451-v2-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-grpo-q235b-kimid-96451-v2-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-grpo-q235b-kimid-96451-v2-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-grpo-q235b-kimid-96451-v2-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-grpo-q235b-kimid-96451-v2-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-grpo-q235b-kimid-96451-v2-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/chat_template.jinja
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/config.json
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/merges.txt
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/quantization_config.json
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/tokenizer_config.json
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/generation_config.json
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/added_tokens.json
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/special_tokens_map.json
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/vocab.json
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model.safetensors.index.json
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/tokenizer.json
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/.gitattributes
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00027-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00014-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00025-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00023-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00021-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00002-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00008-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00010-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00006-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00004-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00013-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00019-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00003-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00024-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00020-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00005-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00016-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00022-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00017-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00026-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00018-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00007-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00011-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00012-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00009-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00001-of-00027.safetensors
chaiml-grpo-q235b-kimid-96451-v2-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-96451-v2/default/model-00015-of-00027.safetensors
Job chaiml-grpo-q235b-kimid-96451-v2-uploader completed after 147.73s with status: succeeded
Stopping job with name chaiml-grpo-q235b-kimid-96451-v2-uploader
Pipeline stage VLLMUploader completed in 148.28s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.37s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-grpo-q235b-kimid-96451-v2
Waiting for inference service chaiml-grpo-q235b-kimid-96451-v2 to be ready
Unable to record family friendly update due to error: Invalid JSON input: Expecting value: line 1 column 1 (char 0)
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Inference service chaiml-grpo-q235b-kimid-96451-v2 ready after 362.15509247779846s
Pipeline stage VLLMDeployer completed in 362.80s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0974812507629395s
Received healthy response to inference request in 1.9312505722045898s
Received healthy response to inference request in 1.9838511943817139s
Received healthy response to inference request in 1.9677238464355469s
Received healthy response to inference request in 1.86576509475708s
Received healthy response to inference request in 2.139848470687866s
Received healthy response to inference request in 1.9697299003601074s
Received healthy response to inference request in 1.922415018081665s
Received healthy response to inference request in 1.8951139450073242s
Received healthy response to inference request in 1.955570936203003s
Received healthy response to inference request in 1.9733963012695312s
Received healthy response to inference request in 1.8894643783569336s
Received healthy response to inference request in 2.0163047313690186s
Received healthy response to inference request in 1.904843807220459s
Received healthy response to inference request in 2.2058935165405273s
Received healthy response to inference request in 2.0340416431427s
Received healthy response to inference request in 1.9431793689727783s
Received healthy response to inference request in 1.9214541912078857s
Received healthy response to inference request in 2.1878645420074463s
Received healthy response to inference request in 2.105023145675659s
Received healthy response to inference request in 1.9735338687896729s
Received healthy response to inference request in 1.9474701881408691s
Received healthy response to inference request in 2.1068239212036133s
Received healthy response to inference request in 1.9319350719451904s
Received healthy response to inference request in 2.036121129989624s
Received healthy response to inference request in 2.058210611343384s
Received healthy response to inference request in 2.158583879470825s
Received healthy response to inference request in 2.0246200561523438s
Received healthy response to inference request in 1.8915009498596191s
Received healthy response to inference request in 1.944920301437378s
30 requests
0 failed requests
5th percentile: 1.8903808355331422
10th percentile: 1.8947526454925536
20th percentile: 1.9222228527069092
30th percentile: 1.939806079864502
40th percentile: 1.9523306369781495
50th percentile: 1.9715631008148193
60th percentile: 1.9968326091766357
70th percentile: 2.0346654891967773
80th percentile: 2.0989896297454833
90th percentile: 2.141722011566162
95th percentile: 2.1746882438659667
99th percentile: 2.200665113925934
mean time: 1.9994645277659098
Pipeline stage StressChecker completed in 64.04s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.81s
Shutdown handler de-registered
chaiml-grpo-q235b-kimid_96451_v2 status is now deployed due to DeploymentManager action
chaiml-grpo-q235b-kimid_96451_v2 status is now inactive due to auto deactivation removed underperforming models
chaiml-grpo-q235b-kimid_96451_v2 status is now torndown due to DeploymentManager action