developer_uid: chai_backend_admin
submission_id: chaiml-2fe5-c13f-linear_57126_v2
model_name: chaiml-2fe5-c13f-linear_57126_v2
model_group: ChaiML/2fe5-c13f-linear-
status: torndown
timestamp: 2026-02-10T18:31:41+00:00
num_battles: 11128
num_wins: 5479
celo_rating: 1300.29
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/2fe5-c13f-linear-w01-FP8
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 10
max_input_tokens: 1024
max_output_tokens: 64
reward_model: default
display_name: chaiml-2fe5-c13f-linear_57126_v2
is_internal_developer: True
language_model: ChaiML/2fe5-c13f-linear-w01-FP8
model_size: 13B
ranking_group: single
us_pacific_date: 2026-02-07
win_ratio: 0.49236161035226456
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '####', '<|im_end|>', 'User:', 'You:', 'Bot:', '\n', '<|eot_id|>'], 'max_input_tokens': 1024, 'best_of': 10, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-2fe5-c13f-linear-57126-v2-uploader
Waiting for job on chaiml-2fe5-c13f-linear-57126-v2-uploader to finish
%s, retrying in %s seconds...
chaiml-2fe5-c13f-linear-57126-v2-uploader: Using quantization_mode: none
chaiml-2fe5-c13f-linear-57126-v2-uploader: Downloading snapshot of ChaiML/2fe5-c13f-linear-w01-FP8...
chaiml-2fe5-c13f-linear-57126-v2-uploader: Fetching 12 files: 0%| | 0/12 [00:00<?, ?it/s] Fetching 12 files: 8%|▊ | 1/12 [00:00<00:03, 3.54it/s] Fetching 12 files: 42%|████▏ | 5/12 [00:09<00:13, 1.93s/it] Fetching 12 files: 100%|██████████| 12/12 [00:09<00:00, 1.31it/s]
chaiml-2fe5-c13f-linear-57126-v2-uploader: Downloaded in 9.281s
chaiml-2fe5-c13f-linear-57126-v2-uploader: Processed model ChaiML/2fe5-c13f-linear-w01-FP8 in 14.686s
chaiml-2fe5-c13f-linear-57126-v2-uploader: creating bucket guanaco-vllm-models
chaiml-2fe5-c13f-linear-57126-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-2fe5-c13f-linear-57126-v2-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-2fe5-c13f-linear-57126-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-2fe5-c13f-linear-57126-v2-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-2fe5-c13f-linear-57126-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-2fe5-c13f-linear-57126-v2-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-2fe5-c13f-linear-57126-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-2fe5-c13f-linear-57126-v2-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-2fe5-c13f-linear-57126-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-2fe5-c13f-linear-57126-v2-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-2fe5-c13f-linear-57126-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-2fe5-c13f-linear-57126-v2-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-2fe5-c13f-linear-57126-v2-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-2fe5-c13f-linear-57126-v2-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-2fe5-c13f-linear-57126-v2-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-2fe5-c13f-linear-57126-v2-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-2fe5-c13f-linear-57126-v2-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-2fe5-c13f-linear-57126-v2-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-2fe5-c13f-linear-57126-v2
chaiml-2fe5-c13f-linear-57126-v2-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-2fe5-c13f-linear-57126-v2/.gitattributes
chaiml-2fe5-c13f-linear-57126-v2-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-2fe5-c13f-linear-57126-v2/config.json
chaiml-2fe5-c13f-linear-57126-v2-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-2fe5-c13f-linear-57126-v2/generation_config.json
chaiml-2fe5-c13f-linear-57126-v2-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-2fe5-c13f-linear-57126-v2/special_tokens_map.json
chaiml-2fe5-c13f-linear-57126-v2-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-2fe5-c13f-linear-57126-v2/recipe.yaml
chaiml-2fe5-c13f-linear-57126-v2-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-2fe5-c13f-linear-57126-v2/chat_template.jinja
chaiml-2fe5-c13f-linear-57126-v2-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-2fe5-c13f-linear-57126-v2/model.safetensors.index.json
chaiml-2fe5-c13f-linear-57126-v2-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-2fe5-c13f-linear-57126-v2/tokenizer_config.json
chaiml-2fe5-c13f-linear-57126-v2-uploader: cp /dev/shm/model_output/model-00003-of-00003.safetensors s3://guanaco-vllm-models/chaiml-2fe5-c13f-linear-57126-v2/model-00003-of-00003.safetensors
chaiml-2fe5-c13f-linear-57126-v2-uploader: cp /dev/shm/model_output/model-00001-of-00003.safetensors s3://guanaco-vllm-models/chaiml-2fe5-c13f-linear-57126-v2/model-00001-of-00003.safetensors
chaiml-2fe5-c13f-linear-57126-v2-uploader: cp /dev/shm/model_output/model-00002-of-00003.safetensors s3://guanaco-vllm-models/chaiml-2fe5-c13f-linear-57126-v2/model-00002-of-00003.safetensors
Job chaiml-2fe5-c13f-linear-57126-v2-uploader completed after 180.99s with status: succeeded
Stopping job with name chaiml-2fe5-c13f-linear-57126-v2-uploader
Pipeline stage VLLMUploader completed in 183.87s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.38s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-2fe5-c13f-linear-57126-v2
Waiting for inference service chaiml-2fe5-c13f-linear-57126-v2 to be ready
Inference service chaiml-2fe5-c13f-linear-57126-v2 ready after 160.73598790168762s
Pipeline stage VLLMDeployer completed in 163.23s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2454230785369873s
Received healthy response to inference request in 1.6666607856750488s
Received healthy response to inference request in 2.2458953857421875s
Received healthy response to inference request in 3.0615789890289307s
Received healthy response to inference request in 1.8413233757019043s
Received healthy response to inference request in 1.8714182376861572s
Received healthy response to inference request in 1.4999055862426758s
Received healthy response to inference request in 2.0841779708862305s
Received healthy response to inference request in 1.2383308410644531s
Received healthy response to inference request in 1.4297354221343994s
Received healthy response to inference request in 2.0740787982940674s
Received healthy response to inference request in 2.561772346496582s
Received healthy response to inference request in 2.09598445892334s
Received healthy response to inference request in 2.7856106758117676s
Received healthy response to inference request in 3.252784490585327s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 2.853729724884033s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 2.462613344192505s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 1.3504548072814941s
Received healthy response to inference request in 2.0305042266845703s
Received healthy response to inference request in 2.199293613433838s
Received healthy response to inference request in 2.2328453063964844s
Received healthy response to inference request in 1.5829825401306152s
Received healthy response to inference request in 2.5337717533111572s
Received healthy response to inference request in 1.306260585784912s
Received healthy response to inference request in 1.2832696437835693s
Received healthy response to inference request in 1.8572382926940918s
Received healthy response to inference request in 1.8298068046569824s
Received healthy response to inference request in 1.3697021007537842s
Received healthy response to inference request in 1.4978041648864746s
Received healthy response to inference request in 1.5173885822296143s
30 requests
0 failed requests
5th percentile: 1.2936155676841736
10th percentile: 1.3460353851318358
20th percentile: 1.4841904163360595
30th percentile: 1.563304352760315
40th percentile: 1.8367167472839356
50th percentile: 1.9509612321853638
60th percentile: 2.088900566101074
70th percentile: 2.2366186380386353
80th percentile: 2.4768450260162354
90th percentile: 2.7924225807189944
95th percentile: 2.968046820163726
99th percentile: 3.1973348951339724
mean time: 1.9954115311304728
Pipeline stage StressChecker completed in 123.57s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.64s
Shutdown handler de-registered
chaiml-2fe5-c13f-linear_57126_v2 status is now deployed due to DeploymentManager action
chaiml-2fe5-c13f-linear_57126_v2 status is now inactive due to auto deactivation removed underperforming models
chaiml-2fe5-c13f-linear_57126_v2 status is now torndown due to DeploymentManager action