intel-qwen3-235b-a22b-i_29740

developer_uid: chai_backend_admin

submission_id: intel-qwen3-235b-a22b-i_29740_v4

model_name: intel-qwen3-235b-a22b-i_29740_v4

model_group: Intel/Qwen3-235B-A22B-In

status: torndown

timestamp: 2025-12-19T18:33:43+00:00

num_battles: 1312

num_wins: 690

celo_rating: 1307.14

family_friendly_score: 0.0

family_friendly_standard_error: 0.0

submission_type: basic

model_repo: Intel/Qwen3-235B-A22B-Instruct-2507-int4-mixed-AutoRound

model_architecture: Qwen3MoeForCausalLM

model_num_parameters: 18790207488.0

best_of: 8

max_input_tokens: 1992

max_output_tokens: 80

reward_model: default

display_name: intel-qwen3-235b-a22b-i_29740_v4

ineligible_reason: max_output_tokens!=64

is_internal_developer: True

language_model: Intel/Qwen3-235B-A22B-Instruct-2507-int4-mixed-AutoRound

model_size: 19B

ranking_group: single

us_pacific_date: 2025-12-19

win_ratio: 0.5259146341463414

generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '####', '<|assistant|>', '<|im_end|>', '<|user|>', '</think>'], 'max_input_tokens': 1992, 'best_of': 8, 'max_output_tokens': 80}

formatter: {'memory_template': '<|im_start|>system\nYou are {bot_name} engaged in a roleplay with user.<|im_end|>\n', 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n', 'truncate_by_message': True}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.25s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service intel-qwen3-235b-a22b-i-29740-v4
Waiting for inference service intel-qwen3-235b-a22b-i-29740-v4 to be ready
Inference service intel-qwen3-235b-a22b-i-29740-v4 ready after 395.9256718158722s
Pipeline stage VLLMDeployer completed in 397.29s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.004685163497925s
Received healthy response to inference request in 2.197620153427124s
Received healthy response to inference request in 2.184422016143799s
Received healthy response to inference request in 2.1552538871765137s
Received healthy response to inference request in 2.0152764320373535s
Received healthy response to inference request in 2.1602859497070312s
Received healthy response to inference request in 2.1453757286071777s
Received healthy response to inference request in 2.0077438354492188s
Received healthy response to inference request in 2.543323516845703s
Received healthy response to inference request in 2.8221123218536377s
Received healthy response to inference request in 2.11215877532959s
Received healthy response to inference request in 2.1211774349212646s
Received healthy response to inference request in 2.5187089443206787s
Received healthy response to inference request in 1.9574284553527832s
Received healthy response to inference request in 1.9373815059661865s
Received healthy response to inference request in 2.865373134613037s
Received healthy response to inference request in 2.3606555461883545s
Received healthy response to inference request in 2.1026477813720703s
Received healthy response to inference request in 1.9284858703613281s
Received healthy response to inference request in 2.1297104358673096s
Received healthy response to inference request in 2.252575635910034s
Received healthy response to inference request in 2.4293854236602783s
Received healthy response to inference request in 2.308884620666504s
Received healthy response to inference request in 2.1228859424591064s
Received healthy response to inference request in 2.319584608078003s
Received healthy response to inference request in 2.5253920555114746s
Received healthy response to inference request in 2.394927978515625s
Received healthy response to inference request in 2.7363133430480957s
Received healthy response to inference request in 2.02775239944458s
Received healthy response to inference request in 2.467470169067383s
30 requests
0 failed requests
5th percentile: 1.946402633190155
10th percentile: 1.9999594926834106
20th percentile: 2.025257205963135
30th percentile: 2.1184718370437623
40th percentile: 2.1391096115112305
50th percentile: 2.172353982925415
60th percentile: 2.275099229812622
70th percentile: 2.3709372758865355
80th percentile: 2.4777179241180423
90th percentile: 2.5626224994659426
95th percentile: 2.7835027813911437
99th percentile: 2.8528274989128115
mean time: 2.2618333021799724
Pipeline stage StressChecker completed in 71.95s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.34s
Shutdown handler de-registered
intel-qwen3-235b-a22b-i_29740_v4 status is now deployed due to DeploymentManager action
intel-qwen3-235b-a22b-i_29740_v4 status is now torndown due to DeploymentManager action