developer_uid: zonemercy
submission_id: chaiml-pony-v2-q235b-lr_18913_v2
model_name: chaiml-pony-v2-q235b-lr_18913_v2
model_group: ChaiML/pony-v2-q235b-lr1
status: torndown
timestamp: 2026-03-02T09:51:02+00:00
num_battles: 11163
num_wins: 6017
celo_rating: 1320.41
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/pony-v2-q235b-lr1e4ep1r64g4
model_architecture: Qwen3MoeForCausalLM
model_num_parameters: 18790207488.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-pony-v2-q235b-lr_18913_v2
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/pony-v2-q235b-lr1e4ep1r64g4
model_size: 19B
ranking_group: single
us_pacific_date: 2026-02-27
win_ratio: 0.5390128101764758
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['<|assistant|>', '<|user|>', '<|im_end|>', '</s>', '####', '</think>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-pony-v2-q235b-lr-18913-v2-uploader
Waiting for job on chaiml-pony-v2-q235b-lr-18913-v2-uploader to finish
chaiml-pony-v2-q235b-lr-18913-v2-uploader: Using quantization_mode: w4a16
chaiml-pony-v2-q235b-lr-18913-v2-uploader: Checking if ChaiML/pony-v2-q235b-lr1e4ep1r64g4-W4A16 already exists in ChaiML
chaiml-pony-v2-q235b-lr-18913-v2-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-pony-v2-q235b-lr-18913-v2-uploader: Downloading snapshot of ChaiML/pony-v2-q235b-lr1e4ep1r64g4-W4A16...
chaiml-pony-v2-q235b-lr-18913-v2-uploader: Downloaded in 48.150s
chaiml-pony-v2-q235b-lr-18913-v2-uploader: Processed model ChaiML/pony-v2-q235b-lr1e4ep1r64g4 in 48.773s
chaiml-pony-v2-q235b-lr-18913-v2-uploader: creating bucket guanaco-vllm-models
chaiml-pony-v2-q235b-lr-18913-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v2-q235b-lr-18913-v2-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-pony-v2-q235b-lr-18913-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-pony-v2-q235b-lr-18913-v2-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-pony-v2-q235b-lr-18913-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v2-q235b-lr-18913-v2-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-pony-v2-q235b-lr-18913-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v2-q235b-lr-18913-v2-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-pony-v2-q235b-lr-18913-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v2-q235b-lr-18913-v2-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-pony-v2-q235b-lr-18913-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v2-q235b-lr-18913-v2-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-pony-v2-q235b-lr-18913-v2-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-pony-v2-q235b-lr-18913-v2-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-pony-v2-q235b-lr-18913-v2-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-pony-v2-q235b-lr-18913-v2-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-pony-v2-q235b-lr-18913-v2-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-pony-v2-q235b-lr-18913-v2-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/special_tokens_map.json
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/.gitattributes
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/tokenizer_config.json
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/added_tokens.json
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/chat_template.jinja
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/config.json
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/vocab.json
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/quantization_config.json
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/merges.txt
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model.safetensors.index.json
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/tokenizer.json
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/generation_config.json
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00027-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00016-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00024-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00006-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00017-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00004-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00009-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00019-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00007-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00020-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00014-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00010-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00011-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00022-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00003-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00008-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00002-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00005-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00025-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00026-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00015-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00023-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00001-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00018-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00013-of-00027.safetensors
chaiml-pony-v2-q235b-lr-18913-v2-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v2-q235b-lr-18913-v2/default/model-00012-of-00027.safetensors
Job chaiml-pony-v2-q235b-lr-18913-v2-uploader completed after 155.98s with status: succeeded
Stopping job with name chaiml-pony-v2-q235b-lr-18913-v2-uploader
Pipeline stage VLLMUploader completed in 156.35s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 1.01s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-pony-v2-q235b-lr-18913-v2
Waiting for inference service chaiml-pony-v2-q235b-lr-18913-v2 to be ready
Inference service chaiml-pony-v2-q235b-lr-18913-v2 ready after 600.4126808643341s
Pipeline stage VLLMDeployer completed in 600.77s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0815317630767822s
Received healthy response to inference request in 1.9600026607513428s
Received healthy response to inference request in 1.925433874130249s
Received healthy response to inference request in 1.9106788635253906s
Received healthy response to inference request in 2.280302047729492s
Received healthy response to inference request in 1.8768470287322998s
Received healthy response to inference request in 2.069239854812622s
Received healthy response to inference request in 1.8872201442718506s
Received healthy response to inference request in 2.2413718700408936s
Received healthy response to inference request in 2.074808120727539s
Received healthy response to inference request in 1.9106330871582031s
Received healthy response to inference request in 1.98964262008667s
Received healthy response to inference request in 2.0144882202148438s
Received healthy response to inference request in 2.0433905124664307s
Received healthy response to inference request in 1.8461594581604004s
Received healthy response to inference request in 2.0814125537872314s
Received healthy response to inference request in 1.9166231155395508s
Received healthy response to inference request in 1.9085521697998047s
Received healthy response to inference request in 2.072558641433716s
Received healthy response to inference request in 2.02189040184021s
Received healthy response to inference request in 2.026954174041748s
Received healthy response to inference request in 2.045740842819214s
Received healthy response to inference request in 2.2308220863342285s
Received healthy response to inference request in 1.9646124839782715s
Received healthy response to inference request in 2.0327582359313965s
Received healthy response to inference request in 2.0332629680633545s
Received healthy response to inference request in 2.1862611770629883s
Received healthy response to inference request in 2.0536861419677734s
Received healthy response to inference request in 2.0092811584472656s
Received healthy response to inference request in 1.9501311779022217s
30 requests
0 failed requests
5th percentile: 1.8815149307250976
10th percentile: 1.9064189672470093
20th percentile: 1.9154342651367187
30th percentile: 1.9570412158966064
40th percentile: 2.0014257431030273
50th percentile: 2.024422287940979
60th percentile: 2.037313985824585
70th percentile: 2.058352255821228
80th percentile: 2.0761290073394774
90th percentile: 2.1907172679901126
95th percentile: 2.236624467372894
99th percentile: 2.2690122961997985
mean time: 2.021543248494466
Pipeline stage StressChecker completed in 65.15s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.75s
Shutdown handler de-registered
chaiml-pony-v2-q235b-lr_18913_v2 status is now deployed due to DeploymentManager action
chaiml-pony-v2-q235b-lr_18913_v2 status is now inactive due to auto deactivation removed underperforming models
chaiml-pony-v2-q235b-lr_18913_v2 status is now torndown due to DeploymentManager action