developer_uid: zonemercy
submission_id: chaiml-pony-d3b-mv1-top2_9386_v5
model_name: chaiml-pony-d3b-mv1-top2_9386_v5
model_group: ChaiML/pony-d3b-mv1-top2
status: torndown
timestamp: 2026-03-31T06:51:50+00:00
num_battles: 10381
num_wins: 5437
celo_rating: 1311.39
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/pony-d3b-mv1-top2-q35b-lr5e6ep2g8
model_architecture: Qwen3_5MoeForConditionalGeneration
model_num_parameters: 33753909248.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-pony-d3b-mv1-top2_9386_v5
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/pony-d3b-mv1-top2-q35b-lr5e6ep2g8
model_size: 34B
ranking_group: single
us_pacific_date: 2026-03-27
win_ratio: 0.5237453039206242
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.8, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '####', '<|assistant|>', '<|user|>', '<|im_end|>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-pony-d3b-mv1-top2-9386-v5-uploader
Waiting for job on chaiml-pony-d3b-mv1-top2-9386-v5-uploader to finish
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: Using quantization_mode: fp8
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: Checking if ChaiML/pony-d3b-mv1-top2-q35b-lr5e6ep2g8-FP8 already exists in ChaiML
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: Downloading snapshot of ChaiML/pony-d3b-mv1-top2-q35b-lr5e6ep2g8-FP8...
2026-03-28T03:49:44.184778+00:00 monitor updated for chaiml-pony-d3b-mv1-top2_9386_v5
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: Downloaded in 37.768s
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: Processed model ChaiML/pony-d3b-mv1-top2-q35b-lr5e6ep2g8 in 40.250s
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: creating bucket guanaco-vllm-models
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-pony-d3b-mv1-top2-9386-v5/default
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-pony-d3b-mv1-top2-9386-v5/default/.gitattributes
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-pony-d3b-mv1-top2-9386-v5/default/config.json
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-pony-d3b-mv1-top2-9386-v5/default/generation_config.json
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-pony-d3b-mv1-top2-9386-v5/default/recipe.yaml
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-pony-d3b-mv1-top2-9386-v5/default/tokenizer_config.json
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-pony-d3b-mv1-top2-9386-v5/default/chat_template.jinja
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-pony-d3b-mv1-top2-9386-v5/default/tokenizer.json
2026-03-28T03:50:44.272223+00:00 monitor updated for chaiml-pony-d3b-mv1-top2_9386_v5
chaiml-pony-d3b-mv1-top2-9386-v5-uploader: cp /dev/shm/model_output/model.safetensors s3://guanaco-vllm-models/chaiml-pony-d3b-mv1-top2-9386-v5/default/model.safetensors
Job chaiml-pony-d3b-mv1-top2-9386-v5-uploader completed after 154.48s with status: succeeded
Stopping job with name chaiml-pony-d3b-mv1-top2-9386-v5-uploader
Pipeline stage VLLMUploader completed in 154.91s
run pipeline stage %s
Running pipeline stage VLLMUploaderAMD
Pipeline stage vllm_upload_amd skipped, reason=not amd cluster
Pipeline stage VLLMUploaderAMD completed in 0.09s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 1.76s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-pony-d3b-mv1-top2-9386-v5
Waiting for inference service chaiml-pony-d3b-mv1-top2-9386-v5 to be ready
2026-03-28T03:51:44.629026+00:00 monitor updated for chaiml-pony-d3b-mv1-top2_9386_v5
2026-03-28T03:52:44.723895+00:00 monitor updated for chaiml-pony-d3b-mv1-top2_9386_v5
2026-03-28T03:53:44.809337+00:00 monitor updated for chaiml-pony-d3b-mv1-top2_9386_v5
Inference service chaiml-pony-d3b-mv1-top2-9386-v5 ready after 190.59703755378723s
Pipeline stage VLLMDeployer completed in 191.31s
run pipeline stage %s
Running pipeline stage StressChecker
2026-03-28T03:54:44.900935+00:00 monitor updated for chaiml-pony-d3b-mv1-top2_9386_v5
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
2026-03-28T03:55:45.346685+00:00 monitor updated for chaiml-pony-d3b-mv1-top2_9386_v5
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.901750326156616s
Received healthy response to inference request in 1.4265053272247314s
Received healthy response to inference request in 6.141824007034302s
Received healthy response to inference request in 1.108015775680542s
Received healthy response to inference request in 17.661383867263794s
2026-03-28T03:56:45.437057+00:00 monitor updated for chaiml-pony-d3b-mv1-top2_9386_v5
Received healthy response to inference request in 3.737529754638672s
Received healthy response to inference request in 17.15423011779785s
Received healthy response to inference request in 1.0838439464569092s
Received healthy response to inference request in 1.203962802886963s
Received healthy response to inference request in 1.54453444480896s
Received healthy response to inference request in 1.4628474712371826s
Received healthy response to inference request in 1.0548508167266846s
Received healthy response to inference request in 1.0205092430114746s
Received healthy response to inference request in 1.1423850059509277s
Received healthy response to inference request in 1.1658093929290771s
Received healthy response to inference request in 1.068856954574585s
Received healthy response to inference request in 1.0717272758483887s
Received healthy response to inference request in 1.4486033916473389s
Received healthy response to inference request in 1.0655851364135742s
Received healthy response to inference request in 1.2610282897949219s
Received healthy response to inference request in 1.1907436847686768s
Received healthy response to inference request in 1.0841951370239258s
Received healthy response to inference request in 1.1302006244659424s
Received healthy response to inference request in 1.235050916671753s
Received healthy response to inference request in 1.615710973739624s
30 requests
5 failed requests
5th percentile: 1.0596812605857848
10th percentile: 1.068529772758484
20th percentile: 1.0841248989105225
30th percentile: 1.1387296915054321
40th percentile: 1.1986751556396484
50th percentile: 1.3437668085098267
60th percentile: 1.4955222606658933
70th percentile: 3.786795926094055
80th percentile: 17.25566086769104
90th percentile: 20.157117581367494
95th percentile: 20.220591068267822
99th percentile: 20.284905757904053
mean time: 5.799939711888631
%s, retrying in %s seconds...
Received healthy response to inference request in 1.0025427341461182s
Received healthy response to inference request in 1.0158686637878418s
Received healthy response to inference request in 1.028491497039795s
Received healthy response to inference request in 1.8974199295043945s
Received healthy response to inference request in 1.0135462284088135s
Received healthy response to inference request in 1.0834877490997314s
Received healthy response to inference request in 1.626128911972046s
Received healthy response to inference request in 1.0652306079864502s
Received healthy response to inference request in 1.355658769607544s
Received healthy response to inference request in 1.0617437362670898s
Received healthy response to inference request in 1.212543249130249s
2026-03-28T03:57:45.528188+00:00 monitor updated for chaiml-pony-d3b-mv1-top2_9386_v5
Received healthy response to inference request in 1.1231715679168701s
Received healthy response to inference request in 1.178494930267334s
Received healthy response to inference request in 1.1450836658477783s
Received healthy response to inference request in 1.205169439315796s
Received healthy response to inference request in 1.074953317642212s
Received healthy response to inference request in 1.1556732654571533s
Received healthy response to inference request in 1.082723617553711s
Received healthy response to inference request in 1.3540382385253906s
Received healthy response to inference request in 1.078765869140625s
Received healthy response to inference request in 1.1022913455963135s
Received healthy response to inference request in 1.0930585861206055s
Received healthy response to inference request in 1.0692615509033203s
Received healthy response to inference request in 1.1189215183258057s
Received healthy response to inference request in 1.0566017627716064s
Received healthy response to inference request in 1.1719489097595215s
Received healthy response to inference request in 1.1471455097198486s
Received healthy response to inference request in 1.0870931148529053s
Received healthy response to inference request in 1.1379001140594482s
Received healthy response to inference request in 1.175410270690918s
30 requests
0 failed requests
5th percentile: 1.0145913243293763
10th percentile: 1.0272292137145995
20th percentile: 1.0645332336425781
30th percentile: 1.0776221036911011
40th percentile: 1.0856509685516358
50th percentile: 1.1106064319610596
60th percentile: 1.1407735347747803
70th percentile: 1.1605559587478638
80th percentile: 1.1838298320770264
90th percentile: 1.354200291633606
95th percentile: 1.5044173479080192
99th percentile: 1.8187455344200136
mean time: 1.1640122890472413
Pipeline stage StressChecker completed in 215.31s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.04s
Shutdown handler de-registered
chaiml-pony-d3b-mv1-top2_9386_v5 status is now deployed due to DeploymentManager action
chaiml-pony-d3b-mv1-top2_9386_v5 status is now inactive due to auto deactivation removed underperforming models
chaiml-pony-d3b-mv1-top2_9386_v5 status is now torndown due to DeploymentManager action