developer_uid: zonemercy
submission_id: chaiml-pony-v1-q235b-l_99625_v13
model_name: chaiml-pony-v1-q235b-l_99625_v13
model_group: ChaiML/pony-v1-q235b-lr1
status: inactive
timestamp: 2026-02-28T15:17:30+00:00
num_battles: 13556
num_wins: 7319
celo_rating: 1324.46
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/pony-v1-q235b-lr1e4ep1r64g4
model_architecture: Qwen3MoeForCausalLM
model_num_parameters: 18790207488.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-pony-v1-q235b-l_99625_v13
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/pony-v1-q235b-lr1e4ep1r64g4
model_size: 19B
ranking_group: single
us_pacific_date: 2026-02-28
win_ratio: 0.5399085275892593
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '<|im_end|>', '####', '<|assistant|>', '</think>', '<|user|>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Connection pool is full, discarding connection: %s. Connection pool size: %s
Starting job with name chaiml-pony-v1-q235b-l-99625-v13-uploader
Waiting for job on chaiml-pony-v1-q235b-l-99625-v13-uploader to finish
chaiml-pony-v1-q235b-l-99625-v13-uploader: Using quantization_mode: w4a16
chaiml-pony-v1-q235b-l-99625-v13-uploader: Checking if ChaiML/pony-v1-q235b-lr1e4ep1r64g4-W4A16 already exists in ChaiML
chaiml-pony-v1-q235b-l-99625-v13-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-pony-v1-q235b-l-99625-v13-uploader: Downloading snapshot of ChaiML/pony-v1-q235b-lr1e4ep1r64g4-W4A16...
chaiml-pony-v1-q235b-l-99625-v13-uploader: Downloaded in 47.937s
chaiml-pony-v1-q235b-l-99625-v13-uploader: Processed model ChaiML/pony-v1-q235b-lr1e4ep1r64g4 in 48.469s
chaiml-pony-v1-q235b-l-99625-v13-uploader: creating bucket guanaco-vllm-models
chaiml-pony-v1-q235b-l-99625-v13-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-l-99625-v13-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-pony-v1-q235b-l-99625-v13-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-pony-v1-q235b-l-99625-v13-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-pony-v1-q235b-l-99625-v13-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-l-99625-v13-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-pony-v1-q235b-l-99625-v13-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-l-99625-v13-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-pony-v1-q235b-l-99625-v13-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-l-99625-v13-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-pony-v1-q235b-l-99625-v13-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-v1-q235b-l-99625-v13-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-pony-v1-q235b-l-99625-v13-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-pony-v1-q235b-l-99625-v13-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-pony-v1-q235b-l-99625-v13-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-pony-v1-q235b-l-99625-v13-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-pony-v1-q235b-l-99625-v13-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-pony-v1-q235b-l-99625-v13-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/.gitattributes
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/added_tokens.json
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/generation_config.json
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/chat_template.jinja
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/special_tokens_map.json
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/tokenizer_config.json
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/config.json
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model.safetensors.index.json
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/quantization_config.json
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/merges.txt
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/vocab.json
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/tokenizer.json
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00027-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00003-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00021-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00008-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00019-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00017-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00004-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00020-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00024-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00016-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00015-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00018-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00025-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00005-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00012-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00002-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00023-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00007-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00011-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00006-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00009-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00022-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00014-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00010-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00013-of-00027.safetensors
chaiml-pony-v1-q235b-l-99625-v13-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-pony-v1-q235b-l-99625-v13/default/model-00026-of-00027.safetensors
Job chaiml-pony-v1-q235b-l-99625-v13-uploader completed after 156.13s with status: succeeded
Stopping job with name chaiml-pony-v1-q235b-l-99625-v13-uploader
Pipeline stage VLLMUploader completed in 157.19s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.52s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-pony-v1-q235b-l-99625-v13
Waiting for inference service chaiml-pony-v1-q235b-l-99625-v13 to be ready
Inference service chaiml-pony-v1-q235b-l-99625-v13 ready after 431.28288197517395s
Pipeline stage VLLMDeployer completed in 431.66s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9907646179199219s
Received healthy response to inference request in 1.9318063259124756s
Received healthy response to inference request in 1.8718233108520508s
Received healthy response to inference request in 1.9542944431304932s
Received healthy response to inference request in 2.285533905029297s
Received healthy response to inference request in 1.9177227020263672s
Received healthy response to inference request in 1.892956018447876s
Received healthy response to inference request in 1.8855605125427246s
Received healthy response to inference request in 1.947232723236084s
Received healthy response to inference request in 1.8813104629516602s
Received healthy response to inference request in 2.045708656311035s
Received healthy response to inference request in 2.0665833950042725s
Received healthy response to inference request in 1.9200878143310547s
Received healthy response to inference request in 1.8947737216949463s
Received healthy response to inference request in 2.1215686798095703s
Received healthy response to inference request in 2.037604331970215s
Received healthy response to inference request in 2.3266453742980957s
Received healthy response to inference request in 2.03872013092041s
Received healthy response to inference request in 2.1263959407806396s
Received healthy response to inference request in 2.0582878589630127s
Received healthy response to inference request in 2.1981194019317627s
Received healthy response to inference request in 2.680175542831421s
Received healthy response to inference request in 1.952239751815796s
Received healthy response to inference request in 1.8833694458007812s
Received healthy response to inference request in 1.925072431564331s
Received healthy response to inference request in 2.006227970123291s
Received healthy response to inference request in 1.9199066162109375s
Received healthy response to inference request in 2.0175528526306152s
Received healthy response to inference request in 2.078991413116455s
Received healthy response to inference request in 1.9359569549560547s
30 requests
0 failed requests
5th percentile: 1.8822370052337647
10th percentile: 1.8853414058685303
20th percentile: 1.913132905960083
30th percentile: 1.9235770463943482
40th percentile: 1.9427224159240724
50th percentile: 1.9725295305252075
60th percentile: 2.0255734443664553
70th percentile: 2.0494824171066286
80th percentile: 2.087506866455078
90th percentile: 2.2068608522415163
95th percentile: 2.308145213127136
99th percentile: 2.5776517939567567
mean time: 2.0264331102371216
Pipeline stage StressChecker completed in 79.79s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.31s
Shutdown handler de-registered
chaiml-pony-v1-q235b-l_99625_v13 status is now deployed due to DeploymentManager action
chaiml-pony-v1-q235b-l_99625_v13 status is now inactive due to auto deactivation removed underperforming models