developer_uid: zonemercy
submission_id: chaiml-pony-v1-q235b-lr_32150_v5
model_name: chaiml-pony-v1-q235b-lr_32150_v5
model_group: ChaiML/pony-v1-q235b-lr1
status: torndown
timestamp: 2026-03-02T16:21:13+00:00
num_battles: 11421
num_wins: 6094
celo_rating: 1327.62
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/pony-v1-q235b-lr1e4ep1r64g8
model_architecture: Qwen3MoeForCausalLM
model_num_parameters: 18790207488.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-pony-v1-q235b-lr_32150_v5
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/pony-v1-q235b-lr1e4ep1r64g8
model_size: 19B
ranking_group: single
us_pacific_date: 2026-02-27
win_ratio: 0.5335784957534366
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['<|assistant|>', '<|user|>', '<|im_end|>', '</s>', '####', '</think>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-pony-v1-q235b-lr-32150-v5-uploader
Waiting for job on chaiml-pony-v1-q235b-lr-32150-v5-uploader to finish
chaiml-pony-v1-q235b-lr-32150-v5-uploader: Using quantization_mode: w4a16
chaiml-pony-v1-q235b-lr-32150-v5-uploader: Checking if ChaiML/pony-v1-q235b-lr1e4ep1r64g8-W4A16 already exists in ChaiML
chaiml-pony-v1-q235b-lr-32150-v5-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-pony-v1-q235b-lr-32150-v5-uploader: Downloading snapshot of ChaiML/pony-v1-q235b-lr1e4ep1r64g8-W4A16...
chaiml-pony-v1-q235b-lr-32150-v5-uploader: Downloaded in 54.323s
chaiml-pony-v1-q235b-lr-32150-v5-uploader: Processed model ChaiML/pony-v1-q235b-lr1e4ep1r64g8 in 54.858s
chaiml-pony-v1-q235b-lr-32150-v5-uploader: creating bucket guanaco-vllm-models
chaiml-pony-v1-q235b-lr-32150-v5-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-lr-32150-v5-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-pony-v1-q235b-lr-32150-v5-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-pony-v1-q235b-lr-32150-v5-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-pony-v1-q235b-lr-32150-v5-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-lr-32150-v5-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-pony-v1-q235b-lr-32150-v5-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-lr-32150-v5-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-pony-v1-q235b-lr-32150-v5-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-lr-32150-v5-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-pony-v1-q235b-lr-32150-v5-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-lr-32150-v5-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-pony-v1-q235b-lr-32150-v5-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-pony-v1-q235b-lr-32150-v5-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-pony-v1-q235b-lr-32150-v5-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-pony-v1-q235b-lr-32150-v5-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-pony-v1-q235b-lr-32150-v5-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-pony-v1-q235b-lr-32150-v5-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/added_tokens.json
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/.gitattributes
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/generation_config.json
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/tokenizer_config.json
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/quantization_config.json
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/special_tokens_map.json
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/chat_template.jinja
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/config.json
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/merges.txt
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model.safetensors.index.json
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/vocab.json
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/tokenizer.json
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00027-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00025-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00024-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00019-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00004-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00016-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00009-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00003-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00011-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00010-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00015-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00017-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00007-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00008-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00014-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00005-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00018-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00006-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00020-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00026-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00013-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00021-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00012-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00002-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00001-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00023-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v5-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v5/default/model-00022-of-00027.safetensors
Job chaiml-pony-v1-q235b-lr-32150-v5-uploader completed after 186.68s with status: succeeded
Stopping job with name chaiml-pony-v1-q235b-lr-32150-v5-uploader
Pipeline stage VLLMUploader completed in 187.45s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.33s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-pony-v1-q235b-lr-32150-v5
Waiting for inference service chaiml-pony-v1-q235b-lr-32150-v5 to be ready
Inference service chaiml-pony-v1-q235b-lr-32150-v5 ready after 470.4591484069824s
Pipeline stage VLLMDeployer completed in 471.05s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.017568826675415s
Received healthy response to inference request in 1.927232265472412s
Received healthy response to inference request in 2.1878247261047363s
Received healthy response to inference request in 1.9874820709228516s
Received healthy response to inference request in 2.0782339572906494s
Received healthy response to inference request in 2.207604169845581s
Received healthy response to inference request in 1.9494411945343018s
Received healthy response to inference request in 2.074815273284912s
Received healthy response to inference request in 2.0874297618865967s
Received healthy response to inference request in 1.9864575862884521s
Received healthy response to inference request in 2.253750801086426s
Received healthy response to inference request in 2.21156644821167s
Received healthy response to inference request in 1.9682819843292236s
Received healthy response to inference request in 1.9912889003753662s
Received healthy response to inference request in 1.9886348247528076s
Received healthy response to inference request in 2.1903560161590576s
Received healthy response to inference request in 2.337080478668213s
Received healthy response to inference request in 2.2689929008483887s
Received healthy response to inference request in 2.0141215324401855s
Received healthy response to inference request in 1.9608821868896484s
Received healthy response to inference request in 2.332463502883911s
Received healthy response to inference request in 2.1742634773254395s
Received healthy response to inference request in 2.0844528675079346s
Received healthy response to inference request in 1.9227654933929443s
Received healthy response to inference request in 1.9947426319122314s
Received healthy response to inference request in 1.8979606628417969s
Received healthy response to inference request in 1.9275665283203125s
Received healthy response to inference request in 2.0194830894470215s
Received healthy response to inference request in 2.1113948822021484s
Received healthy response to inference request in 1.9297313690185547s
30 requests
0 failed requests
5th percentile: 1.9247755408287048
10th percentile: 1.9275331020355224
20th percentile: 1.958593988418579
30th percentile: 1.9871747255325318
40th percentile: 1.9933611392974853
50th percentile: 2.0185259580612183
60th percentile: 2.0807215213775634
70th percentile: 2.1302554607391357
80th percentile: 2.1938056468963625
90th percentile: 2.255275011062622
95th percentile: 2.303901731967926
99th percentile: 2.3357415556907655
mean time: 2.0694623470306395
Pipeline stage StressChecker completed in 66.76s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.85s
Shutdown handler de-registered
chaiml-pony-v1-q235b-lr_32150_v5 status is now deployed due to DeploymentManager action
chaiml-pony-v1-q235b-lr_32150_v5 status is now inactive due to auto deactivation removed underperforming models
chaiml-pony-v1-q235b-lr_32150_v5 status is now torndown due to DeploymentManager action