developer_uid: acehao-chai
submission_id: chaiml-grpo-q235b-kimid_10073_v2
model_name: chaiml-grpo-q235b-kimid_10073_v2
model_group: ChaiML/grpo-q235b-kimid-
status: torndown
timestamp: 2026-03-11T17:33:21+00:00
num_battles: 10823
num_wins: 5830
celo_rating: 1324.91
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/grpo-q235b-kimid-v5a-merged-chai-rm-step-200
model_architecture: Qwen3MoeForCausalLM
model_num_parameters: 18790207488.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-grpo-q235b-kimid_10073_v2
ineligible_reason: max_output_tokens!=64
is_internal_developer: False
language_model: ChaiML/grpo-q235b-kimid-v5a-merged-chai-rm-step-200
model_size: 19B
ranking_group: single
us_pacific_date: 2026-02-20
win_ratio: 0.5386676522221195
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['<|assistant|>', '<|im_end|>', '####', '<|user|>', '</think>', '</s>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-grpo-q235b-kimid-10073-v2-uploader
Waiting for job on chaiml-grpo-q235b-kimid-10073-v2-uploader to finish
chaiml-grpo-q235b-kimid-10073-v2-uploader: Using quantization_mode: w4a16
chaiml-grpo-q235b-kimid-10073-v2-uploader: Checking if ChaiML/grpo-q235b-kimid-v5a-merged-chai-rm-step-200-W4A16 already exists in ChaiML
chaiml-grpo-q235b-kimid-10073-v2-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-grpo-q235b-kimid-10073-v2-uploader: Downloading snapshot of ChaiML/grpo-q235b-kimid-v5a-merged-chai-rm-step-200-W4A16...
chaiml-grpo-q235b-kimid-10073-v2-uploader: creating bucket guanaco-vllm-models
chaiml-grpo-q235b-kimid-10073-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-10073-v2-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-grpo-q235b-kimid-10073-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-grpo-q235b-kimid-10073-v2-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-grpo-q235b-kimid-10073-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-10073-v2-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-grpo-q235b-kimid-10073-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-10073-v2-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-grpo-q235b-kimid-10073-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-10073-v2-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-grpo-q235b-kimid-10073-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-10073-v2-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-grpo-q235b-kimid-10073-v2-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-grpo-q235b-kimid-10073-v2-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-grpo-q235b-kimid-10073-v2-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-grpo-q235b-kimid-10073-v2-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-grpo-q235b-kimid-10073-v2-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-grpo-q235b-kimid-10073-v2-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/generation_config.json
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/chat_template.jinja
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/.gitattributes
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/merges.txt
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/added_tokens.json
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/tokenizer_config.json
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/quantization_config.json
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/special_tokens_map.json
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/config.json
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/vocab.json
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/tokenizer.json
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model.safetensors.index.json
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00004-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00018-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00012-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00022-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00003-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00013-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00014-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00015-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00010-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00006-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00025-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00024-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00011-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00009-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00007-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00023-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00020-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00001-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00026-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00008-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00019-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00017-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00016-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00002-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00021-of-00027.safetensors
chaiml-grpo-q235b-kimid-10073-v2-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-10073-v2/default/model-00005-of-00027.safetensors
Job chaiml-grpo-q235b-kimid-10073-v2-uploader completed after 137.01s with status: succeeded
Stopping job with name chaiml-grpo-q235b-kimid-10073-v2-uploader
Pipeline stage VLLMUploader completed in 142.70s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-grpo-q235b-kimid-10073-v2
Waiting for inference service chaiml-grpo-q235b-kimid-10073-v2 to be ready
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service chaiml-grpo-q235b-kimid-10073-v2 ready after 432.3075370788574s
Pipeline stage VLLMDeployer completed in 432.83s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2425403594970703s
Received healthy response to inference request in 2.053154468536377s
Received healthy response to inference request in 3.3754732608795166s
Received healthy response to inference request in 2.253009080886841s
Received healthy response to inference request in 2.5478363037109375s
Received healthy response to inference request in 3.0385830402374268s
Received healthy response to inference request in 2.745382070541382s
Received healthy response to inference request in 2.9604156017303467s
Received healthy response to inference request in 3.592407703399658s
Received healthy response to inference request in 2.2583813667297363s
Received healthy response to inference request in 2.925147771835327s
Received healthy response to inference request in 2.8117995262145996s
Received healthy response to inference request in 2.6277339458465576s
Received healthy response to inference request in 2.1571402549743652s
Received healthy response to inference request in 1.9535655975341797s
Received healthy response to inference request in 2.223545789718628s
Received healthy response to inference request in 2.255735158920288s
Received healthy response to inference request in 2.117823600769043s
Received healthy response to inference request in 2.011085271835327s
Received healthy response to inference request in 2.1410348415374756s
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Received healthy response to inference request in 2.18725323677063s
Received healthy response to inference request in 2.0496208667755127s
Received healthy response to inference request in 1.9373748302459717s
Received healthy response to inference request in 2.450518846511841s
Received healthy response to inference request in 1.9173436164855957s
Received healthy response to inference request in 2.181544065475464s
Received healthy response to inference request in 2.1909267902374268s
Received healthy response to inference request in 2.0698697566986084s
Received healthy response to inference request in 2.0144200325012207s
Received healthy response to inference request in 2.0534353256225586s
30 requests
0 failed requests
5th percentile: 1.9446606755256652
10th percentile: 2.0053333044052124
20th percentile: 2.0524477481842043
30th percentile: 2.1034374475479125
40th percentile: 2.1717825412750242
50th percentile: 2.2072362899780273
60th percentile: 2.2540995121002196
70th percentile: 2.4797140836715696
80th percentile: 2.7586655616760254
90th percentile: 2.968232345581055
95th percentile: 3.2238726615905753
99th percentile: 3.5294967150688175
mean time: 2.3781367460886638
Pipeline stage StressChecker completed in 86.21s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
Shutdown handler de-registered
chaiml-grpo-q235b-kimid_10073_v2 status is now deployed due to DeploymentManager action
chaiml-grpo-q235b-kimid_10073_v2 status is now inactive due to auto deactivation removed underperforming models
chaiml-grpo-q235b-kimid_10073_v2 status is now inactive due to Froze recruitment for AB test 0220_feynman
Deleting key chaiml-grpo-q235b-opusd-11130-v3/default/tokenizer.json from bucket guanaco-vllm-models
Deleting key chaiml-grpo-q235b-kimid-83709-v2/default/tokenizer_config.json from bucket guanaco-vllm-models
Deleting key chaiml-grpo-q235b-opusd-34252-v2/default/quantization_config.json from bucket guanaco-vllm-models
Deleting key chaiml-grpo-q235b-kimid-35892-v2/default/vocab.json from bucket guanaco-vllm-models
chaiml-grpo-q235b-kimid_10073_v2 status is now torndown due to DeploymentManager action