chaiml-kimid-v9-opusdv_83165

developer_uid: richhx

submission_id: chaiml-kimid-v9-opusdv_83165_v11

model_name: chaiml-kimid-v9-opusdv_83165_v11

model_group: ChaiML/kimid-v9-opusdv1-

status: torndown

timestamp: 2026-01-17T00:30:36+00:00

num_battles: 14775

num_wins: 8172

celo_rating: 1334.06

family_friendly_score: 0.5464

family_friendly_standard_error: 0.007040554523615309

submission_type: basic

model_repo: ChaiML/kimid-v9-opusdv1-lr5e6ep2r64g4b01-int4-mixed

model_architecture: Qwen3MoeForCausalLM

model_num_parameters: 18790207488.0

best_of: 4

max_input_tokens: 2048

max_output_tokens: 80

reward_model: default

display_name: chaiml-kimid-v9-opusdv_83165_v11

ineligible_reason: max_output_tokens!=64

is_internal_developer: True

language_model: ChaiML/kimid-v9-opusdv1-lr5e6ep2r64g4b01-int4-mixed

model_size: 19B

ranking_group: single

us_pacific_date: 2026-01-13

win_ratio: 0.5530964467005076

generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['<|user|>', '<|im_end|>', '<|assistant|>', '</think>', '####', '</s>'], 'max_input_tokens': 2048, 'best_of': 4, 'max_output_tokens': 80}

formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.65s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-kimid-v9-opusdv-83165-v11
Waiting for inference service chaiml-kimid-v9-opusdv-83165-v11 to be ready
HTTP Request: %s %s "%s %d %s"
Inference service chaiml-kimid-v9-opusdv-83165-v11 ready after 201.20544147491455s
Pipeline stage VLLMDeployer completed in 201.54s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.942305326461792s
Received healthy response to inference request in 1.7511699199676514s
Received healthy response to inference request in 1.8811352252960205s
Received healthy response to inference request in 2.1964361667633057s
Received healthy response to inference request in 1.8444628715515137s
Received healthy response to inference request in 2.0622503757476807s
Received healthy response to inference request in 1.7031922340393066s
Received healthy response to inference request in 2.393108367919922s
Received healthy response to inference request in 1.934751272201538s
Received healthy response to inference request in 2.172069787979126s
Received healthy response to inference request in 2.0413601398468018s
Received healthy response to inference request in 1.7502644062042236s
Received healthy response to inference request in 2.1479899883270264s
Received healthy response to inference request in 2.0817089080810547s
Received healthy response to inference request in 2.5489349365234375s
Received healthy response to inference request in 1.7008192539215088s
Received healthy response to inference request in 1.8292815685272217s
Received healthy response to inference request in 2.167592763900757s
Received healthy response to inference request in 1.7215831279754639s
Received healthy response to inference request in 2.3878540992736816s
Received healthy response to inference request in 2.256096839904785s
Received healthy response to inference request in 1.8489282131195068s
Received healthy response to inference request in 1.7971627712249756s
Received healthy response to inference request in 2.323948860168457s
Received healthy response to inference request in 1.789039134979248s
Received healthy response to inference request in 2.100214719772339s
Received healthy response to inference request in 1.716554880142212s
Received healthy response to inference request in 1.747901439666748s
Received healthy response to inference request in 1.8846538066864014s
Received healthy response to inference request in 1.6736230850219727s
30 requests
0 failed requests
5th percentile: 1.7018870949745177
10th percentile: 1.7152186155319213
20th percentile: 1.7497918128967285
30th percentile: 1.7947256803512572
40th percentile: 1.8471420764923097
50th percentile: 1.9097025394439697
60th percentile: 2.0497162342071533
70th percentile: 2.114547300338745
80th percentile: 2.176943063735962
90th percentile: 2.3303393840789797
95th percentile: 2.390743947029114
99th percentile: 2.503745231628418
mean time: 1.9798798163731892
Pipeline stage StressChecker completed in 62.62s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
chaiml-kimid-v9-opusdv_83165_v11 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 5343.60s
Shutdown handler de-registered
chaiml-kimid-v9-opusdv_83165_v11 status is now torndown due to DeploymentManager action