developer_uid: richhx
submission_id: chaiml-llama31-mer-v2-_44570_v61
model_name: chaiml-llama31-mer-v2-_44570_v61
model_group: ChaiML/llama31-mer-v2-tr
status: inactive
timestamp: 2026-01-08T22:27:53+00:00
num_battles: 3806134
num_wins: 1911732
celo_rating: 1294.54
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/llama31-mer-v2-try1-new8m-filterv3-full-512seq-bestep-572
model_architecture: LlamaForSequenceClassification
model_num_parameters: 8030261248.0
best_of: 1
max_input_tokens: 512
max_output_tokens: 1
reward_model: default
display_name: chaiml-llama31-mer-v2-_44570_v61
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/llama31-mer-v2-try1-new8m-filterv3-full-512seq-bestep-572
model_size: 8B
ranking_group: single
us_pacific_date: 2026-01-08
win_ratio: 0.5022765882651531
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 1, 'max_output_tokens': 1}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.14s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-llama31-mer-v2-44570-v61
Waiting for inference service chaiml-llama31-mer-v2-44570-v61 to be ready
Inference service chaiml-llama31-mer-v2-44570-v61 ready after 80.0526955127716s
Pipeline stage VLLMDeployer completed in 80.38s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 6.19626522064209s
Received healthy response to inference request in 4.098700523376465s
Received healthy response to inference request in 5.385828971862793s
Received healthy response to inference request in 4.094463586807251s
Received healthy response to inference request in 3.5927700996398926s
5 requests
0 failed requests
5th percentile: 3.6931087970733643
10th percentile: 3.793447494506836
20th percentile: 3.9941248893737793
30th percentile: 4.095310974121094
40th percentile: 4.09700574874878
50th percentile: 4.098700523376465
60th percentile: 4.613551902770996
70th percentile: 5.128403282165527
80th percentile: 5.547916221618652
90th percentile: 5.8720907211303714
95th percentile: 6.034177970886231
99th percentile: 6.163847770690918
mean time: 4.673605680465698
Pipeline stage StressChecker completed in 24.45s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
chaiml-llama31-mer-v2-_44570_v61 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Received signal 15, running shutdown handler
Shutdown handler de-registered
chaiml-llama31-mer-v2-_44570_v61 status is now inactive due to system request