developer_uid: zonemercy
submission_id: chaiml-pony-d3a-mv1-son_96936_v1
model_name: chaiml-pony-d3a-mv1-son_96936_v1
model_group: ChaiML/pony-d3a-mv1-sonn
status: torndown
timestamp: 2026-03-28T17:00:58+00:00
num_battles: 10610
num_wins: 5518
celo_rating: 8468.98
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/pony-d3a-mv1-sonnetwintop2-q35b-lr5e6ep1g8
model_architecture: Qwen3_5MoeForConditionalGeneration
model_num_parameters: 33753909248.0
best_of: 16
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-pony-d3a-mv1-son_96936_v1
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/pony-d3a-mv1-sonnetwintop2-q35b-lr5e6ep1g8
model_size: 34B
ranking_group: single
us_pacific_date: 2026-03-25
win_ratio: 0.5200754005655043
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['####', '<|assistant|>', '</s>', '<|user|>', '<|im_end|>'], 'max_input_tokens': 2048, 'best_of': 16, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-pony-d3a-mv1-son-96936-v1-uploader
Waiting for job on chaiml-pony-d3a-mv1-son-96936-v1-uploader to finish
chaiml-pony-d3a-mv1-son-96936-v1-uploader: Using quantization_mode: none
chaiml-pony-d3a-mv1-son-96936-v1-uploader: Downloading snapshot of ChaiML/pony-d3a-mv1-sonnetwintop2-q35b-lr5e6ep1g8...
chaiml-pony-d3a-mv1-son-96936-v1-uploader: Downloaded in 24.290s
2026-03-25T14:42:10.719668+00:00 monitor updated for chaiml-pony-d3a-mv1-son_96936_v1
chaiml-pony-d3a-mv1-son-96936-v1-uploader: Processed model ChaiML/pony-d3a-mv1-sonnetwintop2-q35b-lr5e6ep1g8 in 50.379s
chaiml-pony-d3a-mv1-son-96936-v1-uploader: creating bucket guanaco-vllm-models
chaiml-pony-d3a-mv1-son-96936-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3a-mv1-son-96936-v1-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-pony-d3a-mv1-son-96936-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-pony-d3a-mv1-son-96936-v1-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-pony-d3a-mv1-son-96936-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3a-mv1-son-96936-v1-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-pony-d3a-mv1-son-96936-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3a-mv1-son-96936-v1-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-pony-d3a-mv1-son-96936-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3a-mv1-son-96936-v1-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-pony-d3a-mv1-son-96936-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3a-mv1-son-96936-v1-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-pony-d3a-mv1-son-96936-v1-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-pony-d3a-mv1-son-96936-v1-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-pony-d3a-mv1-son-96936-v1-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-pony-d3a-mv1-son-96936-v1-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-pony-d3a-mv1-son-96936-v1-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-pony-d3a-mv1-son-96936-v1-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/added_tokens.json
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/config.json
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/processor_config.json s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/processor_config.json
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/preprocessor_config.json s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/preprocessor_config.json
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/.gitattributes
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/chat_template.jinja
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/generation_config.json
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/args.json s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/args.json
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/README.md
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/tokenizer_config.json
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/special_tokens_map.json
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/merges.txt
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/vocab.json
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/model.safetensors.index.json
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/tokenizer.json
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/model-00016-of-00016.safetensors s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/model-00016-of-00016.safetensors
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/model-00013-of-00016.safetensors s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/model-00013-of-00016.safetensors
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/model-00010-of-00016.safetensors s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/model-00010-of-00016.safetensors
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/model-00007-of-00016.safetensors s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/model-00007-of-00016.safetensors
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/model-00004-of-00016.safetensors s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/model-00004-of-00016.safetensors
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/model-00002-of-00016.safetensors s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/model-00002-of-00016.safetensors
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/model-00001-of-00016.safetensors s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/model-00001-of-00016.safetensors
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/model-00008-of-00016.safetensors s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/model-00008-of-00016.safetensors
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/model-00014-of-00016.safetensors s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/model-00014-of-00016.safetensors
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/model-00005-of-00016.safetensors s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/model-00005-of-00016.safetensors
chaiml-pony-d3a-mv1-son-96936-v1-uploader: cp /dev/shm/model_output/model-00011-of-00016.safetensors s3://guanaco-vllm-models/chaiml-pony-d3a-mv1-son-96936-v1/default/model-00011-of-00016.safetensors
Job chaiml-pony-d3a-mv1-son-96936-v1-uploader completed after 83.28s with status: succeeded
Stopping job with name chaiml-pony-d3a-mv1-son-96936-v1-uploader
Pipeline stage VLLMUploader completed in 83.83s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 3.22s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-pony-d3a-mv1-son-96936-v1
Waiting for inference service chaiml-pony-d3a-mv1-son-96936-v1 to be ready
2026-03-25T14:43:10.807981+00:00 monitor updated for chaiml-pony-d3a-mv1-son_96936_v1
2026-03-25T14:44:10.897508+00:00 monitor updated for chaiml-pony-d3a-mv1-son_96936_v1
2026-03-25T14:45:10.981706+00:00 monitor updated for chaiml-pony-d3a-mv1-son_96936_v1
Inference service chaiml-pony-d3a-mv1-son-96936-v1 ready after 190.21551084518433s
Pipeline stage VLLMDeployer completed in 190.70s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
2026-03-25T14:46:11.070162+00:00 monitor updated for chaiml-pony-d3a-mv1-son_96936_v1
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
2026-03-25T14:47:11.171387+00:00 monitor updated for chaiml-pony-d3a-mv1-son_96936_v1
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 6.058803558349609s
2026-03-25T14:48:11.267159+00:00 monitor updated for chaiml-pony-d3a-mv1-son_96936_v1
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 1.4073524475097656s
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 4.407008409500122s
Received healthy response to inference request in 2.5574707984924316s
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
2026-03-25T14:49:11.375573+00:00 monitor updated for chaiml-pony-d3a-mv1-son_96936_v1
Received healthy response to inference request in 7.592759132385254s
Received healthy response to inference request in 1.7664155960083008s
Received healthy response to inference request in 2.635559320449829s
Received healthy response to inference request in 1.433464527130127s
Received healthy response to inference request in 1.642606258392334s
Received healthy response to inference request in 2.051687002182007s
Received healthy response to inference request in 1.4107677936553955s
Received healthy response to inference request in 1.6065683364868164s
Received healthy response to inference request in 1.4992220401763916s
Received healthy response to inference request in 7.424103260040283s
Received healthy response to inference request in 1.811197280883789s
Received healthy response to inference request in 1.9926352500915527s
Received healthy response to inference request in 2.2804877758026123s
Received healthy response to inference request in 2.552415609359741s
Received healthy response to inference request in 1.5024356842041016s
Received healthy response to inference request in 1.482839822769165s
Received healthy response to inference request in 1.7428655624389648s
30 requests
9 failed requests
5th percentile: 1.4209813237190247
10th percentile: 1.4779022932052612
20th percentile: 1.5857418060302735
30th percentile: 1.7593505859375
40th percentile: 2.028066301345825
50th percentile: 2.5549432039260864
60th percentile: 5.067726469039915
70th percentile: 11.346320319175685
80th percentile: 20.128784942626954
90th percentile: 20.150415444374083
95th percentile: 20.214002752304076
99th percentile: 20.50014727830887
mean time: 7.955550106366475
%s, retrying in %s seconds...
Received healthy response to inference request in 1.4795808792114258s
Received healthy response to inference request in 1.3695762157440186s
Received healthy response to inference request in 1.3429954051971436s
Received healthy response to inference request in 1.2831604480743408s
Received healthy response to inference request in 1.4113025665283203s
Received healthy response to inference request in 1.4833550453186035s
Received healthy response to inference request in 1.424712896347046s
Received healthy response to inference request in 1.6417336463928223s
Received healthy response to inference request in 1.5064475536346436s
Received healthy response to inference request in 1.3377714157104492s
Received healthy response to inference request in 1.3541529178619385s
Received healthy response to inference request in 1.3448765277862549s
Received healthy response to inference request in 1.5330235958099365s
Received healthy response to inference request in 1.4190945625305176s
2026-03-25T14:50:11.477634+00:00 monitor updated for chaiml-pony-d3a-mv1-son_96936_v1
Received healthy response to inference request in 1.48813796043396s
Received healthy response to inference request in 1.3675956726074219s
Received healthy response to inference request in 1.677492618560791s
Received healthy response to inference request in 1.5461528301239014s
Received healthy response to inference request in 1.4859862327575684s
Received healthy response to inference request in 1.7145164012908936s
Received healthy response to inference request in 1.7942097187042236s
Received healthy response to inference request in 1.684966802597046s
Received healthy response to inference request in 1.3911035060882568s
Received healthy response to inference request in 1.3786115646362305s
Received healthy response to inference request in 1.3418059349060059s
Received healthy response to inference request in 1.4037230014801025s
Received healthy response to inference request in 1.4177196025848389s
Received healthy response to inference request in 1.6510636806488037s
Received healthy response to inference request in 2.1044108867645264s
Received healthy response to inference request in 1.452427864074707s
30 requests
0 failed requests
5th percentile: 1.3395869493484498
10th percentile: 1.3428764581680297
20th percentile: 1.3649071216583253
30th percentile: 1.387355923652649
40th percentile: 1.4151527881622314
50th percentile: 1.4385703802108765
60th percentile: 1.4844075202941895
70th percentile: 1.5144203662872313
80th percentile: 1.6435996532440187
90th percentile: 1.6879217624664307
95th percentile: 1.758347725868225
99th percentile: 2.0144525480270388
mean time: 1.4943902651468912
Pipeline stage StressChecker completed in 288.89s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.92s
Shutdown handler de-registered
chaiml-pony-d3a-mv1-son_96936_v1 status is now deployed due to DeploymentManager action
chaiml-pony-d3a-mv1-son_96936_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-pony-d3a-mv1-son_96936_v1 status is now torndown due to DeploymentManager action