developer_uid: chai_backend_admin
submission_id: qwen-qwen3-235b-a22b-i_47730_v10
model_name: qwen-qwen3-235b-a22b-i_47730_v10
model_group: Qwen/Qwen3-235B-A22B-Ins
status: torndown
timestamp: 2025-12-09T07:23:02+00:00
num_battles: 6314
num_wins: 3178
celo_rating: 1295.33
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: Qwen/Qwen3-235B-A22B-Instruct-2507
model_architecture: Qwen3MoeForCausalLM
model_num_parameters: 18790207488.0
best_of: 8
max_input_tokens: 3000
max_output_tokens: 70
reward_model: default
display_name: qwen-qwen3-235b-a22b-i_47730_v10
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: Qwen/Qwen3-235B-A22B-Instruct-2507
model_size: 19B
ranking_group: single
us_pacific_date: 2025-12-08
win_ratio: 0.5033259423503326
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['<|assistant|>', '####', '</think>', '<|user|>', '</s>', '<|im_end|>'], 'max_input_tokens': 3000, 'best_of': 8, 'max_output_tokens': 70}
formatter: {'memory_template': "<|im_start|>system\nYou are {bot_name}. Engaged in a highly engaging roleplay with user. NSFW allowed. Be descriptive and creative.\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.24s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service qwen-qwen3-235b-a22b-i-47730-v10
Waiting for inference service qwen-qwen3-235b-a22b-i-47730-v10 to be ready
Inference service qwen-qwen3-235b-a22b-i-47730-v10 ready after 426.9995265007019s
Pipeline stage VLLMDeployer completed in 428.07s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.647287607192993s
Received healthy response to inference request in 2.867872714996338s
Received healthy response to inference request in 2.8104820251464844s
Received healthy response to inference request in 1.9033796787261963s
Received healthy response to inference request in 2.581038475036621s
Received healthy response to inference request in 1.8005220890045166s
Received healthy response to inference request in 1.5954596996307373s
Received healthy response to inference request in 1.7743034362792969s
Received healthy response to inference request in 2.254096746444702s
Received healthy response to inference request in 1.9513568878173828s
Received healthy response to inference request in 2.2277469635009766s
Received healthy response to inference request in 1.9323523044586182s
Received healthy response to inference request in 2.4738705158233643s
Received healthy response to inference request in 2.039041519165039s
Received healthy response to inference request in 2.497328996658325s
Received healthy response to inference request in 1.8432574272155762s
Received healthy response to inference request in 1.6127543449401855s
Received healthy response to inference request in 2.888411521911621s
Received healthy response to inference request in 2.004469871520996s
Received healthy response to inference request in 1.6986019611358643s
Received healthy response to inference request in 1.6179938316345215s
Received healthy response to inference request in 1.7543272972106934s
Received healthy response to inference request in 1.6791338920593262s
Received healthy response to inference request in 2.310152292251587s
Received healthy response to inference request in 1.699885606765747s
Received healthy response to inference request in 1.7773547172546387s
Received healthy response to inference request in 1.7399682998657227s
Received healthy response to inference request in 1.6348495483398438s
Received healthy response to inference request in 2.115325689315796s
Received healthy response to inference request in 1.6548786163330078s
30 requests
0 failed requests
5th percentile: 1.6151121139526368
10th percentile: 1.6331639766693116
20th percentile: 1.6947083473205566
30th percentile: 1.7500195980072022
40th percentile: 1.7912551403045656
50th percentile: 1.9178659915924072
60th percentile: 2.018298530578613
70th percentile: 2.235651898384094
80th percentile: 2.4785622119903565
90th percentile: 2.6636070489883426
95th percentile: 2.8420469045639036
99th percentile: 2.882455267906189
mean time: 2.0462501525878904
Pipeline stage StressChecker completed in 77.65s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.15s
Shutdown handler de-registered
qwen-qwen3-235b-a22b-i_47730_v10 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError("('http://qwen-qwen3-235b-a22b-i-47730-v10-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/completions', '')")
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError("('http://qwen-qwen3-235b-a22b-i-47730-v10-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/completions', '')")
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError("('http://qwen-qwen3-235b-a22b-i-47730-v10-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/completions', '')")
Shutdown handler de-registered