developer_uid: chai_backend_admin
submission_id: chaiml-4d70-fd43-linear_51732_v7
model_name: chaiml-4d70-fd43-linear_51732_v7
model_group: ChaiML/4d70-fd43-linear-
status: torndown
timestamp: 2026-03-08T18:17:11+00:00
num_battles: 3237
num_wins: 1626
celo_rating: 9299.83
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/4d70-fd43-linear-w01-FP8
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 8
max_input_tokens: 1800
max_output_tokens: 74
reward_model: default
display_name: chaiml-4d70-fd43-linear_51732_v7
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/4d70-fd43-linear-w01-FP8
model_size: 13B
ranking_group: single
us_pacific_date: 2026-03-05
win_ratio: 0.5023169601482854
generation_params: {'temperature': 0.85, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.08, 'frequency_penalty': 0.08, 'stopping_words': ['###', '<|im_end|>', '</s>', '<|im_start|>', '\n'], 'max_input_tokens': 1800, 'best_of': 8, 'max_output_tokens': 74}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-4d70-fd43-linear-51732-v7-uploader
Waiting for job on chaiml-4d70-fd43-linear-51732-v7-uploader to finish
chaiml-4d70-fd43-linear-51732-v7-uploader: Using quantization_mode: fp8
chaiml-4d70-fd43-linear-51732-v7-uploader: Repo ChaiML/4d70-fd43-linear-w01-FP8 already ends in FP8. Skipping...
chaiml-4d70-fd43-linear-51732-v7-uploader: Checking if ChaiML/4d70-fd43-linear-w01-FP8 already exists in ChaiML
chaiml-4d70-fd43-linear-51732-v7-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-4d70-fd43-linear-51732-v7-uploader: Downloading snapshot of ChaiML/4d70-fd43-linear-w01-FP8...
chaiml-4d70-fd43-linear-51732-v7-uploader: Downloaded in 8.832s
chaiml-4d70-fd43-linear-51732-v7-uploader: Processed model ChaiML/4d70-fd43-linear-w01-FP8 in 12.312s
chaiml-4d70-fd43-linear-51732-v7-uploader: creating bucket guanaco-vllm-models
chaiml-4d70-fd43-linear-51732-v7-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v7-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-4d70-fd43-linear-51732-v7-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-4d70-fd43-linear-51732-v7-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-4d70-fd43-linear-51732-v7-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v7-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-4d70-fd43-linear-51732-v7-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v7-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-4d70-fd43-linear-51732-v7-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v7-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-4d70-fd43-linear-51732-v7-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-4d70-fd43-linear-51732-v7-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-4d70-fd43-linear-51732-v7-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-4d70-fd43-linear-51732-v7-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-4d70-fd43-linear-51732-v7-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-4d70-fd43-linear-51732-v7-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-4d70-fd43-linear-51732-v7-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-4d70-fd43-linear-51732-v7-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v7/default
chaiml-4d70-fd43-linear-51732-v7-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v7/default/recipe.yaml
chaiml-4d70-fd43-linear-51732-v7-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v7/default/special_tokens_map.json
chaiml-4d70-fd43-linear-51732-v7-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v7/default/generation_config.json
chaiml-4d70-fd43-linear-51732-v7-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v7/default/.gitattributes
chaiml-4d70-fd43-linear-51732-v7-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v7/default/model.safetensors.index.json
chaiml-4d70-fd43-linear-51732-v7-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v7/default/tokenizer_config.json
chaiml-4d70-fd43-linear-51732-v7-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v7/default/chat_template.jinja
chaiml-4d70-fd43-linear-51732-v7-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v7/default/config.json
chaiml-4d70-fd43-linear-51732-v7-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v7/default/tokenizer.json
chaiml-4d70-fd43-linear-51732-v7-uploader: cp /dev/shm/model_output/model-00002-of-00003.safetensors s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v7/default/model-00002-of-00003.safetensors
chaiml-4d70-fd43-linear-51732-v7-uploader: cp /dev/shm/model_output/model-00001-of-00003.safetensors s3://guanaco-vllm-models/chaiml-4d70-fd43-linear-51732-v7/default/model-00001-of-00003.safetensors
Job chaiml-4d70-fd43-linear-51732-v7-uploader completed after 72.73s with status: succeeded
Stopping job with name chaiml-4d70-fd43-linear-51732-v7-uploader
Pipeline stage VLLMUploader completed in 73.71s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.92s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-4d70-fd43-linear-51732-v7
Waiting for inference service chaiml-4d70-fd43-linear-51732-v7 to be ready
Inference service chaiml-4d70-fd43-linear-51732-v7 ready after 120.16124892234802s
Pipeline stage VLLMDeployer completed in 120.58s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.7204606533050537s
Received healthy response to inference request in 2.142700433731079s
Received healthy response to inference request in 2.023728847503662s
Received healthy response to inference request in 2.181511878967285s
Received healthy response to inference request in 2.069551467895508s
Received healthy response to inference request in 1.9097299575805664s
Received healthy response to inference request in 2.334888219833374s
Received healthy response to inference request in 1.907928466796875s
Received healthy response to inference request in 1.997619867324829s
Received healthy response to inference request in 1.9952712059020996s
Received healthy response to inference request in 1.7994301319122314s
Received healthy response to inference request in 1.844081163406372s
Received healthy response to inference request in 1.9344661235809326s
Received healthy response to inference request in 1.819368600845337s
Received healthy response to inference request in 2.2854490280151367s
Received healthy response to inference request in 1.8783454895019531s
Received healthy response to inference request in 1.9223463535308838s
Received healthy response to inference request in 2.025028705596924s
Received healthy response to inference request in 2.184422731399536s
Received healthy response to inference request in 4.103582143783569s
Received healthy response to inference request in 1.995152235031128s
Received healthy response to inference request in 1.9843380451202393s
Received healthy response to inference request in 1.9688115119934082s
Received healthy response to inference request in 1.9028639793395996s
Received healthy response to inference request in 2.485758066177368s
Received healthy response to inference request in 1.8918523788452148s
Received healthy response to inference request in 1.8293540477752686s
Received healthy response to inference request in 2.1349663734436035s
Received healthy response to inference request in 2.5014634132385254s
Received healthy response to inference request in 1.8539376258850098s
30 requests
0 failed requests
5th percentile: 1.8238620519638062
10th percentile: 1.8426084518432617
20th percentile: 1.8891510009765624
30th percentile: 1.909189510345459
40th percentile: 1.955073356628418
50th percentile: 1.9952117204666138
60th percentile: 2.024248790740967
70th percentile: 2.137286591529846
80th percentile: 2.2046279907226567
90th percentile: 2.487328600883484
95th percentile: 2.6219118952751153
99th percentile: 3.702476911544801
mean time: 2.1209469715754192
Pipeline stage StressChecker completed in 67.78s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.64s
Shutdown handler de-registered
chaiml-4d70-fd43-linear_51732_v7 status is now deployed due to DeploymentManager action
chaiml-4d70-fd43-linear_51732_v7 status is now inactive due to system request
chaiml-4d70-fd43-linear_51732_v7 status is now torndown due to DeploymentManager action