developer_uid: richhx
submission_id: chaiml-llama31-mer-v2-_44570_v93
model_name: chaiml-llama31-mer-v2-_44570_v93
model_group: ChaiML/llama31-mer-v2-tr
status: deployed
timestamp: 2026-04-01T14:59:15+00:00
num_battles: 42901
num_wins: 21876
celo_rating: 1307.26
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/llama31-mer-v2-try1-new8m-filterv3-full-512seq-bestep-572
model_architecture: LlamaForSequenceClassification
model_num_parameters: 8030261248.0
best_of: 1
max_input_tokens: 512
max_output_tokens: 1
reward_model: default
display_name: chaiml-llama31-mer-v2-_44570_v93
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/llama31-mer-v2-try1-new8m-filterv3-full-512seq-bestep-572
model_size: 8B
ranking_group: single
us_pacific_date: 2026-04-01
win_ratio: 0.509918183725321
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 1, 'max_output_tokens': 1}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-llama31-mer-v2-44570-v93-uploader
Waiting for job on chaiml-llama31-mer-v2-44570-v93-uploader to finish
chaiml-llama31-mer-v2-44570-v93-uploader: Using quantization_mode: none
chaiml-llama31-mer-v2-44570-v93-uploader: Downloading snapshot of ChaiML/llama31-mer-v2-try1-new8m-filterv3-full-512seq-bestep-572...
chaiml-llama31-mer-v2-44570-v93-uploader: Downloaded in 12.006s
chaiml-llama31-mer-v2-44570-v93-uploader: Processed model ChaiML/llama31-mer-v2-try1-new8m-filterv3-full-512seq-bestep-572 in 17.780s
chaiml-llama31-mer-v2-44570-v93-uploader: creating bucket guanaco-vllm-models
chaiml-llama31-mer-v2-44570-v93-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-llama31-mer-v2-44570-v93-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-llama31-mer-v2-44570-v93-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-llama31-mer-v2-44570-v93-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-llama31-mer-v2-44570-v93-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-llama31-mer-v2-44570-v93-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-llama31-mer-v2-44570-v93-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-llama31-mer-v2-44570-v93-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-llama31-mer-v2-44570-v93-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-llama31-mer-v2-44570-v93-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-llama31-mer-v2-44570-v93-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-llama31-mer-v2-44570-v93-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-llama31-mer-v2-44570-v93-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-llama31-mer-v2-44570-v93-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-llama31-mer-v2-44570-v93-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-llama31-mer-v2-44570-v93-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-llama31-mer-v2-44570-v93-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-llama31-mer-v2-44570-v93-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-llama31-mer-v2-44570-v93/default
chaiml-llama31-mer-v2-44570-v93-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-llama31-mer-v2-44570-v93/default/README.md
chaiml-llama31-mer-v2-44570-v93-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-llama31-mer-v2-44570-v93/default/config.json
chaiml-llama31-mer-v2-44570-v93-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-llama31-mer-v2-44570-v93/default/tokenizer_config.json
chaiml-llama31-mer-v2-44570-v93-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-llama31-mer-v2-44570-v93/default/special_tokens_map.json
chaiml-llama31-mer-v2-44570-v93-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-llama31-mer-v2-44570-v93/default/.gitattributes
chaiml-llama31-mer-v2-44570-v93-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-llama31-mer-v2-44570-v93/default/model.safetensors.index.json
chaiml-llama31-mer-v2-44570-v93-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-llama31-mer-v2-44570-v93/default/tokenizer.json
chaiml-llama31-mer-v2-44570-v93-uploader: cp /dev/shm/model_output/model-00004-of-00004.safetensors s3://guanaco-vllm-models/chaiml-llama31-mer-v2-44570-v93/default/model-00004-of-00004.safetensors
chaiml-llama31-mer-v2-44570-v93-uploader: cp /dev/shm/model_output/model-00002-of-00004.safetensors s3://guanaco-vllm-models/chaiml-llama31-mer-v2-44570-v93/default/model-00002-of-00004.safetensors
chaiml-llama31-mer-v2-44570-v93-uploader: cp /dev/shm/model_output/model-00003-of-00004.safetensors s3://guanaco-vllm-models/chaiml-llama31-mer-v2-44570-v93/default/model-00003-of-00004.safetensors
chaiml-llama31-mer-v2-44570-v93-uploader: cp /dev/shm/model_output/model-00001-of-00004.safetensors s3://guanaco-vllm-models/chaiml-llama31-mer-v2-44570-v93/default/model-00001-of-00004.safetensors
Job chaiml-llama31-mer-v2-44570-v93-uploader completed after 42.76s with status: succeeded
Stopping job with name chaiml-llama31-mer-v2-44570-v93-uploader
Pipeline stage VLLMUploader completed in 43.22s
run pipeline stage %s
Running pipeline stage VLLMUploaderAMD
Pipeline stage vllm_upload_amd skipped, reason=not amd cluster
Pipeline stage VLLMUploaderAMD completed in 0.10s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 6.67s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-llama31-mer-v2-44570-v93
Waiting for inference service chaiml-llama31-mer-v2-44570-v93 to be ready
2026-04-01T14:56:26.627499+00:00 monitor updated for chaiml-llama31-mer-v2-_44570_v93
2026-04-01T14:57:26.720492+00:00 monitor updated for chaiml-llama31-mer-v2-_44570_v93
Inference service chaiml-llama31-mer-v2-44570-v93 ready after 120.3543713092804s
Pipeline stage VLLMDeployer completed in 120.81s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.5057742595672607s
2026-04-01T14:58:26.825877+00:00 monitor updated for chaiml-llama31-mer-v2-_44570_v93
Received healthy response to inference request in 6.379832983016968s
Received healthy response to inference request in 7.561291933059692s
Received healthy response to inference request in 4.900315999984741s
Received healthy response to inference request in 4.238720655441284s
5 requests
0 failed requests
5th percentile: 2.8523635387420656
10th percentile: 3.19895281791687
20th percentile: 3.8921313762664798
30th percentile: 4.371039724349975
40th percentile: 4.635677862167358
50th percentile: 4.900315999984741
60th percentile: 5.492122793197632
70th percentile: 6.083929586410522
80th percentile: 6.616124773025513
90th percentile: 7.088708353042603
95th percentile: 7.325000143051147
99th percentile: 7.514033575057983
mean time: 5.11718716621399
%s, retrying in %s seconds...
Received healthy response to inference request in 5.316777229309082s
Received healthy response to inference request in 2.8954319953918457s
Received healthy response to inference request in 9.121878623962402s
Received healthy response to inference request in 6.074979543685913s
Received healthy response to inference request in 4.335594177246094s
5 requests
0 failed requests
5th percentile: 3.1834644317626952
10th percentile: 3.4714968681335447
20th percentile: 4.047561740875244
30th percentile: 4.531830787658691
40th percentile: 4.924304008483887
50th percentile: 5.316777229309082
60th percentile: 5.620058155059814
70th percentile: 5.923339080810547
80th percentile: 6.684359359741212
90th percentile: 7.903118991851807
95th percentile: 8.512498807907104
99th percentile: 9.000002660751342
mean time: 5.548932313919067
Pipeline stage StressChecker completed in 55.62s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.79s
Shutdown handler de-registered
chaiml-llama31-mer-v2-_44570_v93 status is now deployed due to DeploymentManager action