developer_uid: chai_backend_admin
submission_id: chaiml-2fe5-c13f-linear_99554_v1
model_name: chaiml-2fe5-c13f-linear_99554_v1
model_group: ChaiML/2fe5-c13f-linear-
status: torndown
timestamp: 2025-12-29T22:56:44+00:00
num_battles: 10226
num_wins: 5206
celo_rating: 1297.22
family_friendly_score: 0.5272
family_friendly_standard_error: 0.00706059714188538
submission_type: basic
model_repo: ChaiML/2fe5-c13f-linear-w01-W4A16-G128-AutoRound
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 10
max_input_tokens: 1024
max_output_tokens: 64
reward_model: default
display_name: chaiml-2fe5-c13f-linear_99554_v1
is_internal_developer: True
language_model: ChaiML/2fe5-c13f-linear-w01-W4A16-G128-AutoRound
model_size: 13B
ranking_group: single
us_pacific_date: 2025-12-26
win_ratio: 0.5090944650889888
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '\n', '<|eot_id|>', 'User:', '####', 'You:', 'Bot:', '<|im_end|>'], 'max_input_tokens': 1024, 'best_of': 10, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Pipeline stage VLLMTemplater completed in 0.81s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-2fe5-c13f-linear-99554-v1
Waiting for inference service chaiml-2fe5-c13f-linear-99554-v1 to be ready
Inference service chaiml-2fe5-c13f-linear-99554-v1 ready after 140.66665816307068s
Pipeline stage VLLMDeployer completed in 141.11s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 0.9055490493774414s
Received healthy response to inference request in 0.7897448539733887s
Received healthy response to inference request in 0.8437857627868652s
Received healthy response to inference request in 0.7580313682556152s
Received healthy response to inference request in 0.7498612403869629s
Received healthy response to inference request in 0.8029699325561523s
Received healthy response to inference request in 0.6840424537658691s
Received healthy response to inference request in 0.83127760887146s
Received healthy response to inference request in 0.7140703201293945s
Received healthy response to inference request in 1.09334135055542s
Received healthy response to inference request in 0.7186195850372314s
Received healthy response to inference request in 0.7086126804351807s
Received healthy response to inference request in 0.737809419631958s
Received healthy response to inference request in 0.8629043102264404s
Received healthy response to inference request in 0.7526881694793701s
Received healthy response to inference request in 0.7243638038635254s
Received healthy response to inference request in 0.8065273761749268s
Received healthy response to inference request in 0.7099993228912354s
Received healthy response to inference request in 0.7086770534515381s
Received healthy response to inference request in 0.9153943061828613s
Received healthy response to inference request in 0.7829174995422363s
Received healthy response to inference request in 0.739269495010376s
Received healthy response to inference request in 1.0957517623901367s
Received healthy response to inference request in 0.7104542255401611s
Received healthy response to inference request in 1.018394947052002s
Received healthy response to inference request in 0.7969179153442383s
Received healthy response to inference request in 0.7282705307006836s
Received healthy response to inference request in 0.9760441780090332s
Received healthy response to inference request in 0.7418582439422607s
Received healthy response to inference request in 0.7067151069641113s
30 requests
0 failed requests
5th percentile: 0.7075690150260925
10th percentile: 0.7086706161499023
20th percentile: 0.7133471012115479
30th percentile: 0.7270985126495362
40th percentile: 0.7408227443695068
50th percentile: 0.7553597688674927
60th percentile: 0.7926140785217285
70th percentile: 0.8139524459838866
80th percentile: 0.8714332580566407
90th percentile: 0.9802792549133301
95th percentile: 1.0596154689788817
99th percentile: 1.0950527429580688
mean time: 0.8038287957509359
Pipeline stage StressChecker completed in 26.56s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
Shutdown handler de-registered
chaiml-2fe5-c13f-linear_99554_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 1820.39s
Shutdown handler de-registered
chaiml-2fe5-c13f-linear_99554_v1 status is now torndown due to DeploymentManager action