developer_uid: chai_backend_admin
submission_id: chaiml-7b07-69d4-linear_87172_v4
model_name: chaiml-7b07-69d4-linear_87172_v4
model_group: ChaiML/7b07-69d4-linear-
status: torndown
timestamp: 2026-02-10T00:48:18+00:00
num_battles: 10719
num_wins: 4968
celo_rating: 9999.0
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/7b07-69d4-linear-w01-W4A16-G128-AutoRound
model_architecture: MistralForCausalLM
model_num_parameters: 24096691200.0
best_of: 6
max_input_tokens: 1440
max_output_tokens: 60
reward_model: default
display_name: chaiml-7b07-69d4-linear_87172_v4
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/7b07-69d4-linear-w01-W4A16-G128-AutoRound
model_size: 24B
ranking_group: single
us_pacific_date: 2026-02-06
win_ratio: 0.4634760705289673
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.1, 'frequency_penalty': 0.1, 'stopping_words': ['\n', '####\n', '####', 'You:', '</s>'], 'max_input_tokens': 1440, 'best_of': 6, 'max_output_tokens': 60}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}</s>\n', 'user_template': 'You: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Starting job with name chaiml-7b07-69d4-linear-87172-v4-uploader
Connection pool is full, discarding connection: %s. Connection pool size: %s
Waiting for job on chaiml-7b07-69d4-linear-87172-v4-uploader to finish
Failed to get response for submission chaiml-mistral-24b-2048_25909_v3: ('http://chaiml-llama-1-8b-user-48416-v4-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'upstream connect error or disconnect/reset before headers. reset reason: connection timeout')
chaiml-7b07-69d4-linear-87172-v4-uploader: Using quantization_mode: none
chaiml-7b07-69d4-linear-87172-v4-uploader: Downloading snapshot of ChaiML/7b07-69d4-linear-w01-W4A16-G128-AutoRound...
chaiml-7b07-69d4-linear-87172-v4-uploader: Fetching 12 files: 0%| | 0/12 [00:00<?, ?it/s] Fetching 12 files: 8%|▊ | 1/12 [00:00<00:03, 3.07it/s] Fetching 12 files: 42%|████▏ | 5/12 [00:07<00:10, 1.56s/it] Fetching 12 files: 50%|█████ | 6/12 [00:08<00:08, 1.48s/it] Fetching 12 files: 100%|██████████| 12/12 [00:08<00:00, 1.39it/s]
chaiml-7b07-69d4-linear-87172-v4-uploader: Downloaded in 8.785s
chaiml-7b07-69d4-linear-87172-v4-uploader: Processed model ChaiML/7b07-69d4-linear-w01-W4A16-G128-AutoRound in 14.482s
chaiml-7b07-69d4-linear-87172-v4-uploader: creating bucket guanaco-vllm-models
chaiml-7b07-69d4-linear-87172-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-7b07-69d4-linear-87172-v4-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-7b07-69d4-linear-87172-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-7b07-69d4-linear-87172-v4-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-7b07-69d4-linear-87172-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-7b07-69d4-linear-87172-v4-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-7b07-69d4-linear-87172-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-7b07-69d4-linear-87172-v4-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-7b07-69d4-linear-87172-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-7b07-69d4-linear-87172-v4-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-7b07-69d4-linear-87172-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-7b07-69d4-linear-87172-v4-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-7b07-69d4-linear-87172-v4-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-7b07-69d4-linear-87172-v4-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-7b07-69d4-linear-87172-v4-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-7b07-69d4-linear-87172-v4-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-7b07-69d4-linear-87172-v4-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-7b07-69d4-linear-87172-v4-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-87172-v4
chaiml-7b07-69d4-linear-87172-v4-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-87172-v4/.gitattributes
chaiml-7b07-69d4-linear-87172-v4-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-87172-v4/recipe.yaml
chaiml-7b07-69d4-linear-87172-v4-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-87172-v4/config.json
chaiml-7b07-69d4-linear-87172-v4-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-87172-v4/README.md
chaiml-7b07-69d4-linear-87172-v4-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-87172-v4/generation_config.json
chaiml-7b07-69d4-linear-87172-v4-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-87172-v4/special_tokens_map.json
chaiml-7b07-69d4-linear-87172-v4-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-87172-v4/model.safetensors.index.json
chaiml-7b07-69d4-linear-87172-v4-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-87172-v4/tokenizer_config.json
chaiml-7b07-69d4-linear-87172-v4-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-87172-v4/tokenizer.json
chaiml-7b07-69d4-linear-87172-v4-uploader: cp /dev/shm/model_output/model-00003-of-00003.safetensors s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-87172-v4/model-00003-of-00003.safetensors
chaiml-7b07-69d4-linear-87172-v4-uploader: cp /dev/shm/model_output/model-00002-of-00003.safetensors s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-87172-v4/model-00002-of-00003.safetensors
chaiml-7b07-69d4-linear-87172-v4-uploader: cp /dev/shm/model_output/model-00001-of-00003.safetensors s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-87172-v4/model-00001-of-00003.safetensors
Job chaiml-7b07-69d4-linear-87172-v4-uploader completed after 83.23s with status: succeeded
Stopping job with name chaiml-7b07-69d4-linear-87172-v4-uploader
Pipeline stage VLLMUploader completed in 84.38s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.14s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-7b07-69d4-linear-87172-v4
Waiting for inference service chaiml-7b07-69d4-linear-87172-v4 to be ready
Inference service chaiml-7b07-69d4-linear-87172-v4 ready after 160.95988416671753s
Pipeline stage VLLMDeployer completed in 162.20s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 0.927839994430542s
Received healthy response to inference request in 1.5303924083709717s
Received healthy response to inference request in 1.0515849590301514s
Received healthy response to inference request in 0.9077177047729492s
Received healthy response to inference request in 0.9261713027954102s
Received healthy response to inference request in 0.9310092926025391s
Received healthy response to inference request in 1.1623303890228271s
Received healthy response to inference request in 0.9527585506439209s
Received healthy response to inference request in 1.2727980613708496s
Received healthy response to inference request in 1.3603127002716064s
Received healthy response to inference request in 0.9178810119628906s
Received healthy response to inference request in 1.023664951324463s
Received healthy response to inference request in 1.7078306674957275s
Received healthy response to inference request in 1.0599365234375s
Received healthy response to inference request in 1.5321338176727295s
Received healthy response to inference request in 1.197727918624878s
Received healthy response to inference request in 1.1533143520355225s
Received healthy response to inference request in 1.0096783638000488s
Received healthy response to inference request in 1.1375491619110107s
Received healthy response to inference request in 1.1594419479370117s
Received healthy response to inference request in 1.34206223487854s
Received healthy response to inference request in 0.9563884735107422s
Received healthy response to inference request in 1.5796151161193848s
Received healthy response to inference request in 1.0988121032714844s
Received healthy response to inference request in 1.1623430252075195s
Received healthy response to inference request in 0.9296996593475342s
Received healthy response to inference request in 1.266503095626831s
Received healthy response to inference request in 0.9271245002746582s
Received healthy response to inference request in 0.970409631729126s
Received healthy response to inference request in 1.009908676147461s
30 requests
0 failed requests
5th percentile: 0.9216116428375244
10th percentile: 0.9270291805267334
20th percentile: 0.9307473659515381
30th percentile: 0.9662032842636108
40th percentile: 1.018162441253662
50th percentile: 1.0793743133544922
60th percentile: 1.1557653903961183
70th percentile: 1.172958493232727
80th percentile: 1.286650896072388
90th percentile: 1.5305665493011475
95th percentile: 1.5582485318183898
99th percentile: 1.6706481575965881
mean time: 1.1388313531875611
Pipeline stage StressChecker completed in 37.81s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.67s
Shutdown handler de-registered
chaiml-7b07-69d4-linear_87172_v4 status is now deployed due to DeploymentManager action
chaiml-7b07-69d4-linear_87172_v4 status is now inactive due to auto deactivation removed underperforming models
chaiml-7b07-69d4-linear_87172_v4 status is now torndown due to DeploymentManager action