developer_uid: zonemercy
submission_id: chaiml-pony-v1-q235b-lr_32150_v3
model_name: chaiml-pony-v1-q235b-lr_32150_v3
model_group: ChaiML/pony-v1-q235b-lr1
status: torndown
timestamp: 2026-02-25T14:40:58+00:00
num_battles: 10584
num_wins: 5661
celo_rating: 1321.55
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/pony-v1-q235b-lr1e4ep1r64g8
model_architecture: Qwen3MoeForCausalLM
model_num_parameters: 18790207488.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-pony-v1-q235b-lr_32150_v3
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/pony-v1-q235b-lr1e4ep1r64g8
model_size: 19B
ranking_group: single
us_pacific_date: 2026-02-22
win_ratio: 0.5348639455782312
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['<|assistant|>', '<|user|>', '<|im_end|>', '</s>', '####', '</think>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-pony-v1-q235b-lr-32150-v3-uploader
Waiting for job on chaiml-pony-v1-q235b-lr-32150-v3-uploader to finish
chaiml-pony-v1-q235b-lr-32150-v3-uploader: Using quantization_mode: w4a16
chaiml-pony-v1-q235b-lr-32150-v3-uploader: Checking if ChaiML/pony-v1-q235b-lr1e4ep1r64g8-W4A16 already exists in ChaiML
chaiml-pony-v1-q235b-lr-32150-v3-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-pony-v1-q235b-lr-32150-v3-uploader: Downloading snapshot of ChaiML/pony-v1-q235b-lr1e4ep1r64g8-W4A16...
chaiml-pony-v1-q235b-lr-32150-v3-uploader: Downloaded in 38.775s
chaiml-pony-v1-q235b-lr-32150-v3-uploader: Processed model ChaiML/pony-v1-q235b-lr1e4ep1r64g8 in 39.310s
chaiml-pony-v1-q235b-lr-32150-v3-uploader: creating bucket guanaco-vllm-models
chaiml-pony-v1-q235b-lr-32150-v3-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-lr-32150-v3-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-pony-v1-q235b-lr-32150-v3-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-pony-v1-q235b-lr-32150-v3-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-pony-v1-q235b-lr-32150-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-lr-32150-v3-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-pony-v1-q235b-lr-32150-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-lr-32150-v3-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-pony-v1-q235b-lr-32150-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-lr-32150-v3-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-pony-v1-q235b-lr-32150-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-lr-32150-v3-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-pony-v1-q235b-lr-32150-v3-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-pony-v1-q235b-lr-32150-v3-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-pony-v1-q235b-lr-32150-v3-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-pony-v1-q235b-lr-32150-v3-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-pony-v1-q235b-lr-32150-v3-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-pony-v1-q235b-lr-32150-v3-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/.gitattributes
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/chat_template.jinja
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/vocab.json
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/config.json
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/special_tokens_map.json
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/tokenizer_config.json
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/tokenizer.json
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model.safetensors.index.json
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/added_tokens.json
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/merges.txt
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/generation_config.json
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/quantization_config.json
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00027-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00025-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00015-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00022-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00023-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00011-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00004-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00026-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00017-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00009-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00001-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00012-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00008-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00006-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00016-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00024-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00013-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00003-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00019-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00014-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00018-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00002-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00021-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00020-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00007-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00005-of-00027.safetensors
chaiml-pony-v1-q235b-lr-32150-v3-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-lr-32150-v3/default/model-00010-of-00027.safetensors
Job chaiml-pony-v1-q235b-lr-32150-v3-uploader completed after 156.23s with status: succeeded
Stopping job with name chaiml-pony-v1-q235b-lr-32150-v3-uploader
Pipeline stage VLLMUploader completed in 156.62s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-pony-v1-q235b-lr-32150-v3
Waiting for inference service chaiml-pony-v1-q235b-lr-32150-v3 to be ready
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Retrying (%r) after connection broken by '%r': %s
HTTP Request: %s %s "%s %d %s"
Inference service chaiml-pony-v1-q235b-lr-32150-v3 ready after 390.2223777770996s
Pipeline stage VLLMDeployer completed in 390.59s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0614354610443115s
Received healthy response to inference request in 1.856680154800415s
Received healthy response to inference request in 2.1221253871917725s
Received healthy response to inference request in 1.9491288661956787s
Received healthy response to inference request in 2.1675851345062256s
Received healthy response to inference request in 1.9596538543701172s
Received healthy response to inference request in 2.0454816818237305s
Received healthy response to inference request in 2.158241033554077s
Received healthy response to inference request in 1.9170336723327637s
Received healthy response to inference request in 2.018519401550293s
Received healthy response to inference request in 1.8994646072387695s
Received healthy response to inference request in 1.926126480102539s
Received healthy response to inference request in 2.1882712841033936s
Received healthy response to inference request in 1.878178596496582s
Received healthy response to inference request in 2.0002024173736572s
Received healthy response to inference request in 2.119615316390991s
Received healthy response to inference request in 1.942925214767456s
Received healthy response to inference request in 1.9047255516052246s
Received healthy response to inference request in 2.0462398529052734s
Received healthy response to inference request in 2.0185787677764893s
Received healthy response to inference request in 2.2635555267333984s
Received healthy response to inference request in 2.331855058670044s
Received healthy response to inference request in 2.2419986724853516s
Received healthy response to inference request in 2.056619167327881s
Received healthy response to inference request in 2.0568480491638184s
Received healthy response to inference request in 2.1550347805023193s
Received healthy response to inference request in 1.926368236541748s
Received healthy response to inference request in 2.07141375541687s
Received healthy response to inference request in 1.9041545391082764s
Received healthy response to inference request in 1.907083511352539s
30 requests
0 failed requests
5th percentile: 1.8877573013305664
10th percentile: 1.9036855459213258
20th percentile: 1.9150436401367188
30th percentile: 1.9379581212997437
40th percentile: 1.9839829921722412
50th percentile: 2.03203022480011
60th percentile: 2.056710720062256
70th percentile: 2.0858742237091064
80th percentile: 2.155676031112671
90th percentile: 2.1936440229415894
95th percentile: 2.253854942321777
99th percentile: 2.312048194408417
mean time: 2.0365048011144
Pipeline stage StressChecker completed in 64.79s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.77s
Shutdown handler de-registered
chaiml-pony-v1-q235b-lr_32150_v3 status is now deployed due to DeploymentManager action
chaiml-pony-v1-q235b-lr_32150_v3 status is now inactive due to auto deactivation removed underperforming models
chaiml-pony-v1-q235b-lr_32150_v3 status is now torndown due to DeploymentManager action