developer_uid: chai_backend_admin
submission_id: chaiml-4d70-fd43-linear_51732_v2
model_name: chaiml-4d70-fd43-linear_51732_v2
model_group: ChaiML/4d70-fd43-linear-
status: torndown
timestamp: 2026-02-10T18:41:17+00:00
num_battles: 10709
num_wins: 5425
celo_rating: 1308.66
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/4d70-fd43-linear-w01-FP8
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 8
max_input_tokens: 1800
max_output_tokens: 74
reward_model: default
display_name: chaiml-4d70-fd43-linear_51732_v2
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/4d70-fd43-linear-w01-FP8
model_size: 13B
ranking_group: single
us_pacific_date: 2026-02-07
win_ratio: 0.5065832477355495
generation_params: {'temperature': 0.85, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.08, 'frequency_penalty': 0.08, 'stopping_words': ['</s>', '###', '\n', '<|im_end|>', '<|im_start|>'], 'max_input_tokens': 1800, 'best_of': 8, 'max_output_tokens': 74}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-4d70-fd43-linear-51732-v2-uploader
Waiting for job on chaiml-4d70-fd43-linear-51732-v2-uploader to finish
chaiml-4d70-fd43-linear-51732-v2-uploader: Using quantization_mode: none
chaiml-4d70-fd43-linear-51732-v2-uploader: Downloading snapshot of ChaiML/4d70-fd43-linear-w01-FP8...
chaiml-4d70-fd43-linear-51732-v2-uploader: Fetching 12 files: 0%| | 0/12 [00:00<?, ?it/s] Fetching 12 files: 8%|▊ | 1/12 [00:00<00:03, 3.34it/s] Fetching 12 files: 42%|████▏ | 5/12 [00:06<00:10, 1.44s/it] Fetching 12 files: 50%|█████ | 6/12 [00:08<00:08, 1.45s/it] Fetching 12 files: 100%|██████████| 12/12 [00:08<00:00, 1.44it/s]
chaiml-4d70-fd43-linear-51732-v2-uploader: Downloaded in 8.465s
chaiml-4d70-fd43-linear-51732-v2-uploader: Processed model ChaiML/4d70-fd43-linear-w01-FP8 in 13.461s
chaiml-4d70-fd43-linear-51732-v2-uploader: creating bucket guanaco-vllm-models
chaiml-4d70-fd43-linear-51732-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v2-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-4d70-fd43-linear-51732-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-4d70-fd43-linear-51732-v2-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-4d70-fd43-linear-51732-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v2-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-4d70-fd43-linear-51732-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v2-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-4d70-fd43-linear-51732-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v2-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-4d70-fd43-linear-51732-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v2-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-4d70-fd43-linear-51732-v2-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-4d70-fd43-linear-51732-v2-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-4d70-fd43-linear-51732-v2-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-4d70-fd43-linear-51732-v2-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-4d70-fd43-linear-51732-v2-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-4d70-fd43-linear-51732-v2-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v2
chaiml-4d70-fd43-linear-51732-v2-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v2/config.json
chaiml-4d70-fd43-linear-51732-v2-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v2/.gitattributes
chaiml-4d70-fd43-linear-51732-v2-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v2/generation_config.json
chaiml-4d70-fd43-linear-51732-v2-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v2/chat_template.jinja
chaiml-4d70-fd43-linear-51732-v2-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v2/special_tokens_map.json
chaiml-4d70-fd43-linear-51732-v2-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v2/recipe.yaml
chaiml-4d70-fd43-linear-51732-v2-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v2/model.safetensors.index.json
chaiml-4d70-fd43-linear-51732-v2-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v2/tokenizer_config.json
chaiml-4d70-fd43-linear-51732-v2-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v2/tokenizer.json
chaiml-4d70-fd43-linear-51732-v2-uploader: cp /dev/shm/model_output/model-00002-of-00003.safetensors s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v2/model-00002-of-00003.safetensors
chaiml-4d70-fd43-linear-51732-v2-uploader: cp /dev/shm/model_output/model-00001-of-00003.safetensors s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v2/model-00001-of-00003.safetensors
Job chaiml-4d70-fd43-linear-51732-v2-uploader completed after 116.09s with status: succeeded
Stopping job with name chaiml-4d70-fd43-linear-51732-v2-uploader
Pipeline stage VLLMUploader completed in 124.00s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 4.68s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-4d70-fd43-linear-51732-v2
Waiting for inference service chaiml-4d70-fd43-linear-51732-v2 to be ready
Inference service chaiml-4d70-fd43-linear-51732-v2 ready after 170.67375659942627s
Pipeline stage VLLMDeployer completed in 179.09s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.5461361408233643s
Received healthy response to inference request in 1.2579703330993652s
Received healthy response to inference request in 1.3516578674316406s
Received healthy response to inference request in 1.310131311416626s
Received healthy response to inference request in 1.328965663909912s
Received healthy response to inference request in 1.1109795570373535s
Received healthy response to inference request in 1.777043342590332s
Received healthy response to inference request in 1.6044731140136719s
Received healthy response to inference request in 1.6699223518371582s
Received healthy response to inference request in 3.078413963317871s
Received healthy response to inference request in 2.5614051818847656s
Received healthy response to inference request in 2.3604960441589355s
Received healthy response to inference request in 2.0938987731933594s
Received healthy response to inference request in 1.6102180480957031s
Received healthy response to inference request in 1.0260241031646729s
Received healthy response to inference request in 1.8357031345367432s
Received healthy response to inference request in 1.7619502544403076s
Received healthy response to inference request in 3.155703544616699s
Received healthy response to inference request in 3.1746554374694824s
Received healthy response to inference request in 2.2683560848236084s
Received healthy response to inference request in 3.3719534873962402s
Received healthy response to inference request in 1.6961967945098877s
Received healthy response to inference request in 2.0202982425689697s
Received healthy response to inference request in 2.0039641857147217s
Received healthy response to inference request in 2.3637914657592773s
Received healthy response to inference request in 1.9407427310943604s
Received healthy response to inference request in 1.9646296501159668s
Received healthy response to inference request in 1.7284562587738037s
Received healthy response to inference request in 1.4826292991638184s
Received healthy response to inference request in 1.8471858501434326s
30 requests
0 failed requests
5th percentile: 1.1771254062652587
10th percentile: 1.3049152135848998
20th percentile: 1.4564350128173829
30th percentile: 1.6084945678710938
40th percentile: 1.7155524730682374
50th percentile: 1.8063732385635376
60th percentile: 1.9502974987030028
70th percentile: 2.0423784017562863
80th percentile: 2.361155128479004
90th percentile: 3.086142921447754
95th percentile: 3.16612708568573
99th percentile: 3.3147370529174807
mean time: 1.9434650739034016
Pipeline stage StressChecker completed in 109.12s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 9.58s
Shutdown handler de-registered
chaiml-4d70-fd43-linear_51732_v2 status is now deployed due to DeploymentManager action
chaiml-4d70-fd43-linear_51732_v2 status is now inactive due to auto deactivation removed underperforming models
chaiml-4d70-fd43-linear_51732_v2 status is now torndown due to DeploymentManager action