developer_uid: zonemercy
submission_id: chaiml-pony-v1-q32b-2k_v2
model_name: chaiml-pony-v1-q32b-2k_v2
model_group: ChaiML/pony-v1-q32b-2k
status: torndown
timestamp: 2026-02-26T18:00:01+00:00
num_battles: 11369
num_wins: 5621
celo_rating: 1293.97
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/pony-v1-q32b-2k
model_architecture: Qwen3ForCausalLM
model_num_parameters: 30497182720.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-pony-v1-q32b-2k_v2
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/pony-v1-q32b-2k
model_size: 30B
ranking_group: single
us_pacific_date: 2026-02-22
win_ratio: 0.4944146362916703
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['<|assistant|>', '<|user|>', '<|im_end|>', '</s>', '####'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-pony-v1-q32b-2k-v2-uploader
Waiting for job on chaiml-pony-v1-q32b-2k-v2-uploader to finish
chaiml-pony-v1-q32b-2k-v2-uploader: Using quantization_mode: none
chaiml-pony-v1-q32b-2k-v2-uploader: Downloading snapshot of ChaiML/pony-v1-q32b-2k...
chaiml-pony-v1-q32b-2k-v2-uploader: Downloaded in 22.512s
chaiml-pony-v1-q32b-2k-v2-uploader: Processed model ChaiML/pony-v1-q32b-2k in 46.818s
chaiml-pony-v1-q32b-2k-v2-uploader: creating bucket guanaco-vllm-models
chaiml-pony-v1-q32b-2k-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q32b-2k-v2-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-pony-v1-q32b-2k-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-pony-v1-q32b-2k-v2-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-pony-v1-q32b-2k-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q32b-2k-v2-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-pony-v1-q32b-2k-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q32b-2k-v2-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-pony-v1-q32b-2k-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q32b-2k-v2-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-pony-v1-q32b-2k-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q32b-2k-v2-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-pony-v1-q32b-2k-v2-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-pony-v1-q32b-2k-v2-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-pony-v1-q32b-2k-v2-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-pony-v1-q32b-2k-v2-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-pony-v1-q32b-2k-v2-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-pony-v1-q32b-2k-v2-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/.gitattributes
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model.safetensors.index.json
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/trainer_state.json s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/trainer_state.json
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/generation_config.json
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/training_args.bin s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/training_args.bin
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/tokenizer_config.json
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/special_tokens_map.json
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/config.json
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/chat_template.jinja
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/added_tokens.json
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/args.json s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/args.json
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/merges.txt
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/vocab.json
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/tokenizer.json
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00014-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00014-of-00014.safetensors
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00011-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00011-of-00014.safetensors
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00005-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00005-of-00014.safetensors
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00013-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00013-of-00014.safetensors
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00010-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00010-of-00014.safetensors
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00009-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00009-of-00014.safetensors
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00007-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00007-of-00014.safetensors
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00012-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00012-of-00014.safetensors
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00008-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00008-of-00014.safetensors
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00006-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00006-of-00014.safetensors
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00001-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00001-of-00014.safetensors
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00004-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00004-of-00014.safetensors
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00002-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00002-of-00014.safetensors
chaiml-pony-v1-q32b-2k-v2-uploader: cp /dev/shm/model_output/model-00003-of-00014.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q32b-2k-v2/default/model-00003-of-00014.safetensors
Job chaiml-pony-v1-q32b-2k-v2-uploader completed after 91.84s with status: succeeded
Stopping job with name chaiml-pony-v1-q32b-2k-v2-uploader
Pipeline stage VLLMUploader completed in 92.31s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.14s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-pony-v1-q32b-2k-v2
Waiting for inference service chaiml-pony-v1-q32b-2k-v2 to be ready
Inference service chaiml-pony-v1-q32b-2k-v2 ready after 180.7354338169098s
Pipeline stage VLLMDeployer completed in 181.23s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.915189743041992s
Received healthy response to inference request in 3.952803373336792s
Received healthy response to inference request in 3.951737403869629s
Received healthy response to inference request in 3.7881357669830322s
Received healthy response to inference request in 3.7684154510498047s
Received healthy response to inference request in 3.8504278659820557s
Received healthy response to inference request in 4.027027368545532s
Received healthy response to inference request in 3.858093500137329s
Received healthy response to inference request in 4.211582899093628s
Received healthy response to inference request in 4.321272611618042s
Received healthy response to inference request in 3.8173584938049316s
Received healthy response to inference request in 4.016543865203857s
Received healthy response to inference request in 3.890810966491699s
Received healthy response to inference request in 4.265859365463257s
Received healthy response to inference request in 3.8239498138427734s
Received healthy response to inference request in 3.765604257583618s
Received healthy response to inference request in 3.85274338722229s
Received healthy response to inference request in 3.752302885055542s
Received healthy response to inference request in 3.7856998443603516s
Received healthy response to inference request in 3.755204200744629s
Received healthy response to inference request in 3.7474498748779297s
Received healthy response to inference request in 3.850133180618286s
Received healthy response to inference request in 3.9179954528808594s
Received healthy response to inference request in 3.7764852046966553s
Received healthy response to inference request in 4.0954039096832275s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 3.870512008666992s
Received healthy response to inference request in 3.758835554122925s
Received healthy response to inference request in 4.154461860656738s
Received healthy response to inference request in 3.7606616020202637s
Received healthy response to inference request in 3.9014830589294434s
30 requests
0 failed requests
5th percentile: 3.7536084771156313
10th percentile: 3.7584724187850953
20th percentile: 3.7678532123565676
30th percentile: 3.787404990196228
40th percentile: 3.839659833908081
50th percentile: 3.8554184436798096
60th percentile: 3.895079803466797
70th percentile: 3.92811803817749
80th percentile: 4.018640565872192
90th percentile: 4.160173964500427
95th percentile: 4.241434955596923
99th percentile: 4.305202770233154
mean time: 3.90680615901947
Pipeline stage StressChecker completed in 121.08s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.68s
Shutdown handler de-registered
chaiml-pony-v1-q32b-2k_v2 status is now deployed due to DeploymentManager action
chaiml-pony-v1-q32b-2k_v2 status is now inactive due to auto deactivation removed underperforming models
chaiml-pony-v1-q32b-2k_v2 status is now torndown due to DeploymentManager action