qwen-qwen3-4b-instruct-2507

developer_uid: zonemercy

submission_id: qwen-qwen3-4b-instruct-2507_v9

model_name: qwen-qwen3-4b-instruct-2507_v7

model_group: Qwen/Qwen3-4B-Instruct-2

status: inactive

timestamp: 2026-02-15T18:47:28+00:00

num_battles: 10518

num_wins: 3700

celo_rating: 1187.73

family_friendly_score: 0.0

family_friendly_standard_error: 0.0

submission_type: basic

model_repo: Qwen/Qwen3-4B-Instruct-2507

model_architecture: Qwen3ForCausalLM

model_num_parameters: 4057520640.0

best_of: 8

max_input_tokens: 2048

max_output_tokens: 64

reward_model: default

display_name: qwen-qwen3-4b-instruct-2507_v7

is_internal_developer: True

language_model: Qwen/Qwen3-4B-Instruct-2507

model_size: 4B

ranking_group: single

us_pacific_date: 2026-02-15

win_ratio: 0.35177790454459024

generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 64}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': True}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name qwen-qwen3-4b-instruct-2507-v9-uploader
Waiting for job on qwen-qwen3-4b-instruct-2507-v9-uploader to finish
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
qwen-qwen3-4b-instruct-2507-v9-uploader: Using quantization_mode: none
qwen-qwen3-4b-instruct-2507-v9-uploader: Downloading snapshot of Qwen/Qwen3-4B-Instruct-2507...
qwen-qwen3-4b-instruct-2507-v9-uploader: Downloaded in 3.887s
qwen-qwen3-4b-instruct-2507-v9-uploader: Processed model Qwen/Qwen3-4B-Instruct-2507 in 7.186s
qwen-qwen3-4b-instruct-2507-v9-uploader: creating bucket guanaco-vllm-models
qwen-qwen3-4b-instruct-2507-v9-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
qwen-qwen3-4b-instruct-2507-v9-uploader:   RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
qwen-qwen3-4b-instruct-2507-v9-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
qwen-qwen3-4b-instruct-2507-v9-uploader:   RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
qwen-qwen3-4b-instruct-2507-v9-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
qwen-qwen3-4b-instruct-2507-v9-uploader:   invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
qwen-qwen3-4b-instruct-2507-v9-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
qwen-qwen3-4b-instruct-2507-v9-uploader:   invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
qwen-qwen3-4b-instruct-2507-v9-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
qwen-qwen3-4b-instruct-2507-v9-uploader:   if re.search("-\.", bucket, re.UNICODE):
qwen-qwen3-4b-instruct-2507-v9-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
qwen-qwen3-4b-instruct-2507-v9-uploader:   if re.search("\.\.", bucket, re.UNICODE):
qwen-qwen3-4b-instruct-2507-v9-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
qwen-qwen3-4b-instruct-2507-v9-uploader:   _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
qwen-qwen3-4b-instruct-2507-v9-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
qwen-qwen3-4b-instruct-2507-v9-uploader:   wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
qwen-qwen3-4b-instruct-2507-v9-uploader: Bucket 's3://guanaco-vllm-models/' created
qwen-qwen3-4b-instruct-2507-v9-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default
qwen-qwen3-4b-instruct-2507-v9-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default/.gitattributes
qwen-qwen3-4b-instruct-2507-v9-uploader: cp /dev/shm/model_output/LICENSE s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default/LICENSE
qwen-qwen3-4b-instruct-2507-v9-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default/README.md
qwen-qwen3-4b-instruct-2507-v9-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default/tokenizer_config.json
qwen-qwen3-4b-instruct-2507-v9-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default/config.json
qwen-qwen3-4b-instruct-2507-v9-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default/generation_config.json
qwen-qwen3-4b-instruct-2507-v9-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default/model.safetensors.index.json
qwen-qwen3-4b-instruct-2507-v9-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default/vocab.json
qwen-qwen3-4b-instruct-2507-v9-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default/merges.txt
qwen-qwen3-4b-instruct-2507-v9-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default/tokenizer.json
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
qwen-qwen3-4b-instruct-2507-v9-uploader: cp /dev/shm/model_output/model-00003-of-00003.safetensors s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default/model-00003-of-00003.safetensors
qwen-qwen3-4b-instruct-2507-v9-uploader: cp /dev/shm/model_output/model-00002-of-00003.safetensors s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default/model-00002-of-00003.safetensors
qwen-qwen3-4b-instruct-2507-v9-uploader: cp /dev/shm/model_output/model-00001-of-00003.safetensors s3://guanaco-vllm-models/qwen-qwen3-4b-instruct-2507-v9/default/model-00001-of-00003.safetensors
Job qwen-qwen3-4b-instruct-2507-v9-uploader completed after 83.78s with status: succeeded
Stopping job with name qwen-qwen3-4b-instruct-2507-v9-uploader
Pipeline stage VLLMUploader completed in 84.30s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service qwen-qwen3-4b-instruct-2507-v9
Waiting for inference service qwen-qwen3-4b-instruct-2507-v9 to be ready
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Inference service qwen-qwen3-4b-instruct-2507-v9 ready after 160.98072576522827s
Pipeline stage VLLMDeployer completed in 161.92s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.5484387874603271s
Received healthy response to inference request in 1.1283378601074219s
Received healthy response to inference request in 1.190378189086914s
Received healthy response to inference request in 0.9563589096069336s
Received healthy response to inference request in 1.345306634902954s
Received healthy response to inference request in 1.457184076309204s
Received healthy response to inference request in 1.0854313373565674s
Received healthy response to inference request in 1.0352444648742676s
Received healthy response to inference request in 1.2496378421783447s
Received healthy response to inference request in 1.3782968521118164s
Received healthy response to inference request in 0.9927332401275635s
Received healthy response to inference request in 1.6495885848999023s
Received healthy response to inference request in 1.1296882629394531s
Received healthy response to inference request in 1.0697062015533447s
Received healthy response to inference request in 1.3358168601989746s
Received healthy response to inference request in 1.4277980327606201s
Received healthy response to inference request in 1.1466217041015625s
Received healthy response to inference request in 1.1102957725524902s
Received healthy response to inference request in 1.1544380187988281s
Received healthy response to inference request in 2.195909023284912s
Received healthy response to inference request in 1.3368232250213623s
Received healthy response to inference request in 1.3584883213043213s
Received healthy response to inference request in 1.1390371322631836s
Received healthy response to inference request in 1.0187220573425293s
Received healthy response to inference request in 1.499568223953247s
Received healthy response to inference request in 1.37324857711792s
Received healthy response to inference request in 1.4217314720153809s
Received healthy response to inference request in 1.3387811183929443s
Received healthy response to inference request in 1.6908080577850342s
Received healthy response to inference request in 1.1301140785217285s
30 requests
0 failed requests
5th percentile: 1.004428207874298
10th percentile: 1.0335922241210938
20th percentile: 1.1053228855133057
30th percentile: 1.129986333847046
40th percentile: 1.151311492919922
50th percentile: 1.2927273511886597
60th percentile: 1.3413913249969482
70th percentile: 1.374763059616089
80th percentile: 1.433675241470337
90th percentile: 1.5585537672042848
95th percentile: 1.6722592949867248
99th percentile: 2.0494297432899478
mean time: 1.2964844306310017
Pipeline stage StressChecker completed in 45.45s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.67s
Shutdown handler de-registered
qwen-qwen3-4b-instruct-2507_v9 status is now deployed due to DeploymentManager action
qwen-qwen3-4b-instruct-2507_v9 status is now inactive due to auto deactivation removed underperforming models