developer_uid: chai_backend_admin
submission_id: chaiml-kimid-v8b-kimidv_63800_v1
model_name: chaiml-kimid-v8b-kimidv_63800_v1
model_group: ChaiML/kimid-v8b-kimidv5
status: torndown
timestamp: 2025-12-23T18:14:25+00:00
num_battles: 7924
num_wins: 4424
celo_rating: 1333.89
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01-int4-mixed
model_architecture: Qwen3MoeForCausalLM
model_num_parameters: 18790207488.0
best_of: 6
max_input_tokens: 2048
max_output_tokens: 72
reward_model: default
display_name: chaiml-kimid-v8b-kimidv_63800_v1
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01-int4-mixed
model_size: 19B
ranking_group: single
us_pacific_date: 2025-12-23
win_ratio: 0.558303886925795
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</think>', '<|im_end|>', '####', '<|assistant|>', '</s>', '<|user|>'], 'max_input_tokens': 2048, 'best_of': 6, 'max_output_tokens': 72}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.13s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-kimid-v8b-kimidv-63800-v1
Waiting for inference service chaiml-kimid-v8b-kimidv-63800-v1 to be ready
Failed to get response for submission chaiml-4d70-fd43-linear-w01_v8: ('http://guanaco-model-mesh-load-balancer.model-mesh.k2.chaiverse.com/models/chaiml-4d70-fd43-linear-w01_v8/predict', '{"detail":"1 validation error for RuntimeResponse\\npredictions\\n Field required [type=missing, input_value={\'detail\': \\"503, message=...linear-w01_v8/predict\'\\"}, input_type=dict]\\n For further information visit https://errors.pydantic.dev/2.11/v/missing"}')
Inference service chaiml-kimid-v8b-kimidv-63800-v1 ready after 492.7998733520508s
Pipeline stage VLLMDeployer completed in 493.33s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0476620197296143s
Received healthy response to inference request in 2.0755813121795654s
Received healthy response to inference request in 2.1321945190429688s
Received healthy response to inference request in 1.9116508960723877s
Received healthy response to inference request in 2.4197118282318115s
Received healthy response to inference request in 2.003931999206543s
Received healthy response to inference request in 1.8618335723876953s
Received healthy response to inference request in 2.3191773891448975s
Received healthy response to inference request in 2.340325117111206s
Received healthy response to inference request in 1.9342982769012451s
Received healthy response to inference request in 2.4177663326263428s
Received healthy response to inference request in 2.712271213531494s
Received healthy response to inference request in 1.937851905822754s
Received healthy response to inference request in 2.1022145748138428s
Received healthy response to inference request in 2.272135019302368s
Received healthy response to inference request in 2.178070068359375s
Received healthy response to inference request in 1.9738388061523438s
Received healthy response to inference request in 1.943277359008789s
Received healthy response to inference request in 1.9675514698028564s
Received healthy response to inference request in 1.902339220046997s
Received healthy response to inference request in 1.9589390754699707s
Received healthy response to inference request in 1.9544365406036377s
Received healthy response to inference request in 1.8197717666625977s
Received healthy response to inference request in 2.1011552810668945s
Received healthy response to inference request in 2.0696828365325928s
Received healthy response to inference request in 2.2260780334472656s
Received healthy response to inference request in 2.3904011249542236s
Received healthy response to inference request in 2.124558448791504s
Received healthy response to inference request in 1.87568998336792s
Received healthy response to inference request in 2.0299932956695557s
30 requests
0 failed requests
5th percentile: 1.8680689573287963
10th percentile: 1.8996742963790894
20th percentile: 1.9371411800384521
30th percentile: 1.9575883150100708
40th percentile: 1.9918947219848633
50th percentile: 2.0586724281311035
60th percentile: 2.1015789985656737
70th percentile: 2.1459571838378904
80th percentile: 2.281543493270874
90th percentile: 2.3931376457214357
95th percentile: 2.4188363552093506
99th percentile: 2.6274289917945866
mean time: 2.1001463095347086
Pipeline stage StressChecker completed in 65.78s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.66s
Shutdown handler de-registered
chaiml-kimid-v8b-kimidv_63800_v1 status is now deployed due to DeploymentManager action
chaiml-kimid-v8b-kimidv_63800_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-kimid-v8b-kimidv_63800_v1 status is now torndown due to DeploymentManager action