developer_uid: richhx
submission_id: chaiml-7b07-69d4-linear_27339_v9
model_name: chaiml-7b07-69d4-linear_27339_v9
model_group: ChaiML/7b07-69d4-linear-
status: inactive
timestamp: 2026-03-05T18:57:12+00:00
num_battles: 2027
num_wins: 955
celo_rating: 9276.65
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/7b07-69d4-linear-w01-FP8
model_architecture: MistralForCausalLM
model_num_parameters: 24096691200.0
best_of: 6
max_input_tokens: 1440
max_output_tokens: 60
reward_model: default
display_name: chaiml-7b07-69d4-linear_27339_v9
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/7b07-69d4-linear-w01-FP8
model_size: 24B
ranking_group: single
us_pacific_date: 2026-03-05
win_ratio: 0.47113961519486924
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.1, 'frequency_penalty': 0.1, 'stopping_words': ['####', '</s>', 'You:', '####\n', '\n'], 'max_input_tokens': 1440, 'best_of': 6, 'max_output_tokens': 60}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}</s>\n', 'user_template': 'You: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-7b07-69d4-linear-27339-v9-uploader
Waiting for job on chaiml-7b07-69d4-linear-27339-v9-uploader to finish
chaiml-7b07-69d4-linear-27339-v9-uploader: Using quantization_mode: fp8
chaiml-7b07-69d4-linear-27339-v9-uploader: Repo ChaiML/7b07-69d4-linear-w01-FP8 already ends in FP8. Skipping...
chaiml-7b07-69d4-linear-27339-v9-uploader: Checking if ChaiML/7b07-69d4-linear-w01-FP8 already exists in ChaiML
chaiml-7b07-69d4-linear-27339-v9-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-7b07-69d4-linear-27339-v9-uploader: Downloading snapshot of ChaiML/7b07-69d4-linear-w01-FP8...
chaiml-7b07-69d4-linear-27339-v9-uploader: Downloaded in 12.976s
chaiml-7b07-69d4-linear-27339-v9-uploader: Processed model ChaiML/7b07-69d4-linear-w01-FP8 in 16.528s
chaiml-7b07-69d4-linear-27339-v9-uploader: creating bucket guanaco-vllm-models
chaiml-7b07-69d4-linear-27339-v9-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-7b07-69d4-linear-27339-v9-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-7b07-69d4-linear-27339-v9-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-7b07-69d4-linear-27339-v9-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-7b07-69d4-linear-27339-v9-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-7b07-69d4-linear-27339-v9-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-7b07-69d4-linear-27339-v9-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-7b07-69d4-linear-27339-v9-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-7b07-69d4-linear-27339-v9-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-7b07-69d4-linear-27339-v9-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-7b07-69d4-linear-27339-v9-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-7b07-69d4-linear-27339-v9-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-7b07-69d4-linear-27339-v9-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-7b07-69d4-linear-27339-v9-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-7b07-69d4-linear-27339-v9-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-7b07-69d4-linear-27339-v9-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-7b07-69d4-linear-27339-v9-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-7b07-69d4-linear-27339-v9-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default
chaiml-7b07-69d4-linear-27339-v9-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default/.gitattributes
chaiml-7b07-69d4-linear-27339-v9-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default/model.safetensors.index.json
chaiml-7b07-69d4-linear-27339-v9-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default/tokenizer_config.json
chaiml-7b07-69d4-linear-27339-v9-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default/special_tokens_map.json
chaiml-7b07-69d4-linear-27339-v9-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default/recipe.yaml
chaiml-7b07-69d4-linear-27339-v9-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default/generation_config.json
chaiml-7b07-69d4-linear-27339-v9-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default/config.json
chaiml-7b07-69d4-linear-27339-v9-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default/tokenizer.json
chaiml-7b07-69d4-linear-27339-v9-uploader: cp /dev/shm/model_output/model-00006-of-00006.safetensors s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default/model-00006-of-00006.safetensors
chaiml-7b07-69d4-linear-27339-v9-uploader: cp /dev/shm/model_output/model-00002-of-00006.safetensors s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default/model-00002-of-00006.safetensors
chaiml-7b07-69d4-linear-27339-v9-uploader: cp /dev/shm/model_output/model-00001-of-00006.safetensors s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default/model-00001-of-00006.safetensors
chaiml-7b07-69d4-linear-27339-v9-uploader: cp /dev/shm/model_output/model-00004-of-00006.safetensors s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default/model-00004-of-00006.safetensors
chaiml-7b07-69d4-linear-27339-v9-uploader: cp /dev/shm/model_output/model-00003-of-00006.safetensors s3://guanaco-vllm-models/chaiml-7b07-69d4-linear-27339-v9/default/model-00003-of-00006.safetensors
Job chaiml-7b07-69d4-linear-27339-v9-uploader completed after 74.14s with status: succeeded
Stopping job with name chaiml-7b07-69d4-linear-27339-v9-uploader
Pipeline stage VLLMUploader completed in 74.65s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 2.37s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-7b07-69d4-linear-27339-v9
Waiting for inference service chaiml-7b07-69d4-linear-27339-v9 to be ready
Inference service chaiml-7b07-69d4-linear-27339-v9 ready after 150.8546862602234s
Pipeline stage VLLMDeployer completed in 151.36s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.8377630710601807s
Received healthy response to inference request in 2.5918097496032715s
Received healthy response to inference request in 2.6304996013641357s
Received healthy response to inference request in 2.6635537147521973s
Received healthy response to inference request in 2.53558349609375s
Received healthy response to inference request in 2.6258082389831543s
Received healthy response to inference request in 3.266758918762207s
Received healthy response to inference request in 3.258836269378662s
Received healthy response to inference request in 2.9084999561309814s
Received healthy response to inference request in 2.8678388595581055s
Received healthy response to inference request in 2.6391243934631348s
Received healthy response to inference request in 2.8511979579925537s
Received healthy response to inference request in 2.6807472705841064s
read tcp 127.0.0.1:41244->127.0.0.1:8080: read: connection reset by peer
Received unhealthy response to inference request!
Received healthy response to inference request in 2.673981189727783s
Received healthy response to inference request in 2.884302854537964s
Received healthy response to inference request in 2.7693264484405518s
Received healthy response to inference request in 2.5416157245635986s
Received healthy response to inference request in 2.5293562412261963s
Received healthy response to inference request in 2.577733039855957s
Received healthy response to inference request in 3.1252026557922363s
Received healthy response to inference request in 2.7881369590759277s
Received healthy response to inference request in 2.6236493587493896s
Received healthy response to inference request in 2.7598681449890137s
Received healthy response to inference request in 2.5360465049743652s
Received healthy response to inference request in 2.8825109004974365s
Received healthy response to inference request in 2.6033639907836914s
Received healthy response to inference request in 2.5793657302856445s
Received healthy response to inference request in 2.58083438873291s
Received healthy response to inference request in 2.550159215927124s
30 requests
1 failed requests
5th percentile: 2.5321585059165956
10th percentile: 2.536000204086304
20th percentile: 2.5722182750701905
30th percentile: 2.5885171413421633
40th percentile: 2.6249446868896484
50th percentile: 2.651339054107666
60th percentile: 2.712395620346069
70th percentile: 2.8030247926712035
80th percentile: 2.870773267745972
90th percentile: 2.930170226097107
95th percentile: 3.19870114326477
99th percentile: 3.264461350440979
mean time: 2.6500701904296875
%s, retrying in %s seconds...
Received healthy response to inference request in 2.6493475437164307s
Received healthy response to inference request in 2.985229253768921s
Received healthy response to inference request in 2.804115056991577s
Received healthy response to inference request in 3.3471240997314453s
Received healthy response to inference request in 2.6172537803649902s
Received healthy response to inference request in 3.028465986251831s
Received healthy response to inference request in 2.6089205741882324s
Received healthy response to inference request in 2.531794786453247s
Received healthy response to inference request in 2.6159188747406006s
Received healthy response to inference request in 3.3613245487213135s
Received healthy response to inference request in 2.654219627380371s
Received healthy response to inference request in 2.5227460861206055s
Received healthy response to inference request in 2.7628185749053955s
Received healthy response to inference request in 2.604494094848633s
Received healthy response to inference request in 2.640110492706299s
Received healthy response to inference request in 2.6314265727996826s
Received healthy response to inference request in 2.9434075355529785s
Received healthy response to inference request in 3.079052448272705s
Received healthy response to inference request in 2.5891647338867188s
Received healthy response to inference request in 2.589336395263672s
Received healthy response to inference request in 2.818685531616211s
Received healthy response to inference request in 3.08766770362854s
Received healthy response to inference request in 2.676941394805908s
Received healthy response to inference request in 2.5078365802764893s
Received healthy response to inference request in 2.5208263397216797s
Received healthy response to inference request in 2.840752363204956s
Received healthy response to inference request in 2.885160207748413s
Received healthy response to inference request in 3.016618251800537s
Received healthy response to inference request in 2.477294921875s
Received healthy response to inference request in 3.0603091716766357s
30 requests
0 failed requests
5th percentile: 2.513681972026825
10th percentile: 2.522554111480713
20th percentile: 2.5893020629882812
30th percentile: 2.61381938457489
40th percentile: 2.6366369247436525
50th percentile: 2.6655805110931396
60th percentile: 2.8099432468414305
70th percentile: 2.9026344060897826
80th percentile: 3.018987798690796
90th percentile: 3.0799139738082886
95th percentile: 3.2303687214851373
99th percentile: 3.357206418514252
mean time: 2.7819454511006674
Pipeline stage StressChecker completed in 171.70s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.69s
Shutdown handler de-registered
chaiml-7b07-69d4-linear_27339_v9 status is now deployed due to DeploymentManager action
chaiml-7b07-69d4-linear_27339_v9 status is now inactive due to system request