developer_uid: chai_backend_admin
submission_id: chaiml-02f4-69d4-linear_30131_v4
model_name: chaiml-02f4-69d4-linear_30131_v4
model_group: ChaiML/02f4-69d4-linear-
status: torndown
timestamp: 2026-02-10T20:21:35+00:00
num_battles: 10453
num_wins: 5344
celo_rating: 1311.87
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/02f4-69d4-linear-w01-FP8
model_architecture: MistralForCausalLM
model_num_parameters: 24096691200.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
reward_model: default
display_name: chaiml-02f4-69d4-linear_30131_v4
is_internal_developer: True
language_model: ChaiML/02f4-69d4-linear-w01-FP8
model_size: 24B
ranking_group: single
us_pacific_date: 2026-02-07
win_ratio: 0.5112407921170956
generation_params: {'temperature': 0.7, 'top_p': 0.95, 'min_p': 0.025, 'top_k': 80, 'presence_penalty': 0.4, 'frequency_penalty': 0.4, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '<|im_start|>system\n{memory}<|im_end|>\n', 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{user_name}: {message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-02f4-69d4-linear-30131-v4-uploader
Waiting for job on chaiml-02f4-69d4-linear-30131-v4-uploader to finish
chaiml-02f4-69d4-linear-30131-v4-uploader: Using quantization_mode: none
chaiml-02f4-69d4-linear-30131-v4-uploader: Downloading snapshot of ChaiML/02f4-69d4-linear-w01-FP8...
chaiml-02f4-69d4-linear-30131-v4-uploader: Fetching 14 files: 0%| | 0/14 [00:00<?, ?it/s] Fetching 14 files: 7%|▋ | 1/14 [00:00<00:04, 2.71it/s] Fetching 14 files: 29%|██▊ | 4/14 [00:10<00:27, 2.73s/it] Fetching 14 files: 36%|███▌ | 5/14 [00:11<00:21, 2.35s/it] Fetching 14 files: 43%|████▎ | 6/14 [00:11<00:13, 1.74s/it] Fetching 14 files: 100%|██████████| 14/14 [00:11<00:00, 1.19it/s]
chaiml-02f4-69d4-linear-30131-v4-uploader: Downloaded in 11.884s
chaiml-02f4-69d4-linear-30131-v4-uploader: Processed model ChaiML/02f4-69d4-linear-w01-FP8 in 20.816s
chaiml-02f4-69d4-linear-30131-v4-uploader: creating bucket guanaco-vllm-models
chaiml-02f4-69d4-linear-30131-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-02f4-69d4-linear-30131-v4-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-02f4-69d4-linear-30131-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-02f4-69d4-linear-30131-v4-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-02f4-69d4-linear-30131-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-02f4-69d4-linear-30131-v4-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-02f4-69d4-linear-30131-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-02f4-69d4-linear-30131-v4-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-02f4-69d4-linear-30131-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-02f4-69d4-linear-30131-v4-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-02f4-69d4-linear-30131-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-02f4-69d4-linear-30131-v4-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-02f4-69d4-linear-30131-v4-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-02f4-69d4-linear-30131-v4-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-02f4-69d4-linear-30131-v4-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-02f4-69d4-linear-30131-v4-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-02f4-69d4-linear-30131-v4-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-02f4-69d4-linear-30131-v4-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/.gitattributes
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/generation_config.json
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/config.json
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/recipe.yaml
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/special_tokens_map.json
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/model.safetensors.index.json
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/tokenizer_config.json
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/tokenizer.json
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/model-00006-of-00006.safetensors s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/model-00006-of-00006.safetensors
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/model-00005-of-00006.safetensors s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/model-00005-of-00006.safetensors
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/model-00004-of-00006.safetensors s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/model-00004-of-00006.safetensors
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/model-00001-of-00006.safetensors s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/model-00001-of-00006.safetensors
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/model-00003-of-00006.safetensors s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/model-00003-of-00006.safetensors
chaiml-02f4-69d4-linear-30131-v4-uploader: cp /dev/shm/model_output/model-00002-of-00006.safetensors s3://guanaco-vllm-models/chaiml-02f4-69d4-linear-30131-v4/model-00002-of-00006.safetensors
Job chaiml-02f4-69d4-linear-30131-v4-uploader completed after 133.99s with status: succeeded
Stopping job with name chaiml-02f4-69d4-linear-30131-v4-uploader
Pipeline stage VLLMUploader completed in 134.45s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.14s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-02f4-69d4-linear-30131-v4
Waiting for inference service chaiml-02f4-69d4-linear-30131-v4 to be ready
HTTP Request: %s %s "%s %d %s"
Inference service chaiml-02f4-69d4-linear-30131-v4 ready after 170.8380470275879s
Pipeline stage VLLMDeployer completed in 171.54s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.5100317001342773s
Received healthy response to inference request in 1.4879562854766846s
Received healthy response to inference request in 1.394521951675415s
Received healthy response to inference request in 1.5271315574645996s
Received healthy response to inference request in 1.3705825805664062s
Received healthy response to inference request in 1.6843559741973877s
Received healthy response to inference request in 1.4174528121948242s
Received healthy response to inference request in 1.4442968368530273s
Received healthy response to inference request in 1.494788646697998s
Received healthy response to inference request in 1.4876580238342285s
Received healthy response to inference request in 1.3876960277557373s
Received healthy response to inference request in 1.450622797012329s
Received healthy response to inference request in 1.6466553211212158s
Received healthy response to inference request in 1.5418431758880615s
Received healthy response to inference request in 1.9251134395599365s
Received healthy response to inference request in 1.743302583694458s
Received healthy response to inference request in 1.534841775894165s
Received healthy response to inference request in 1.4272739887237549s
Received healthy response to inference request in 1.4496006965637207s
Received healthy response to inference request in 1.646878957748413s
Received healthy response to inference request in 2.088557004928589s
Received healthy response to inference request in 1.5758919715881348s
Received healthy response to inference request in 1.762779951095581s
Received healthy response to inference request in 1.4674153327941895s
Received healthy response to inference request in 1.4290077686309814s
Received healthy response to inference request in 1.4078192710876465s
Received healthy response to inference request in 1.4733448028564453s
Received healthy response to inference request in 1.6356501579284668s
Received healthy response to inference request in 1.7023189067840576s
Received healthy response to inference request in 1.4346511363983154s
30 requests
0 failed requests
5th percentile: 1.3907676935195923
10th percentile: 1.4064895391464234
20th percentile: 1.428661012649536
30th percentile: 1.4480095386505127
40th percentile: 1.470973014831543
50th percentile: 1.4913724660873413
60th percentile: 1.5302156448364257
70th percentile: 1.5938194274902342
80th percentile: 1.654374361038208
90th percentile: 1.7452503204345704
95th percentile: 1.852063369750976
99th percentile: 2.04115837097168
mean time: 1.5516680479049683
Pipeline stage StressChecker completed in 51.49s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.86s
Shutdown handler de-registered
chaiml-02f4-69d4-linear_30131_v4 status is now deployed due to DeploymentManager action
chaiml-02f4-69d4-linear_30131_v4 status is now inactive due to auto deactivation removed underperforming models
chaiml-02f4-69d4-linear_30131_v4 status is now torndown due to DeploymentManager action