developer_uid: acehao-chai
submission_id: chaiml-kimid-v5a-q235-l_30124_v3
model_name: chaiml-kimid-v5a-q235-l_30124_v3
model_group: ChaiML/kimid-v5a-q235-lr
status: inactive
timestamp: 2026-02-14T08:18:12+00:00
num_battles: 13784
num_wins: 7059
celo_rating: 1307.59
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/kimid-v5a-q235-lr1e4ep2r64
model_architecture: Qwen3MoeForCausalLM
model_num_parameters: 18790207488.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 80
reward_model: default
display_name: chaiml-kimid-v5a-q235-l_30124_v3
ineligible_reason: max_output_tokens!=64
is_internal_developer: False
language_model: ChaiML/kimid-v5a-q235-lr1e4ep2r64
model_size: 19B
ranking_group: single
us_pacific_date: 2026-02-14
win_ratio: 0.5121154962275102
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</think>', '<|user|>', '<|assistant|>', '<|im_end|>', '####', '</s>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-kimid-v5a-q235-l-30124-v3-uploader
Waiting for job on chaiml-kimid-v5a-q235-l-30124-v3-uploader to finish
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Using quantization_mode: w4a16
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Checking if ChaiML/kimid-v5a-q235-lr1e4ep2r64-W4A16 already exists in ChaiML
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Downloading snapshot of ChaiML/kimid-v5a-q235-lr1e4ep2r64...
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Downloaded in 161.391s
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Applying quantization...
chaiml-kimid-v5a-q235-l-30124-v3-uploader: 2026-02-13 21:03:00 WARNING modeling_utils.py L4670: `torch_dtype` is deprecated! Use `dtype` instead!
chaiml-kimid-v5a-q235-l-30124-v3-uploader: 2026-02-13 21:03:16 INFO base.py L366: using torch.bfloat16 for quantization tuning
chaiml-kimid-v5a-q235-l-30124-v3-uploader: 2026-02-13 21:03:21 INFO base.py L1145: start to compute imatrix
chaiml-kimid-v5a-q235-l-30124-v3-uploader: /usr/local/lib/python3.12/dist-packages/torch/backends/cuda/__init__.py:131: UserWarning: Please use the new API settings to control TF32 behavior, such as torch.backends.cudnn.conv.fp32_precision = 'tf32' or torch.backends.cuda.matmul.fp32_precision = 'ieee'. Old settings, e.g, torch.backends.cuda.matmul.allow_tf32 = True, torch.backends.cudnn.allow_tf32 = True, allowTF32CuDNN() and allowTF32CuBLAS() will be deprecated after Pytorch 2.9. Please see https://pytorch.org/docs/main/notes/cuda.html#tensorfloat-32-tf32-on-ampere-and-later-devices (Triggered internally at /pytorch/aten/src/ATen/Context.cpp:80.)
chaiml-kimid-v5a-q235-l-30124-v3-uploader: return torch._C._get_cublas_allow_tf32()
chaiml-kimid-v5a-q235-l-30124-v3-uploader: W0213 21:04:28.387000 7 torch/_dynamo/convert_frame.py:1358] [6/8] torch._dynamo hit config.recompile_limit (8)
chaiml-kimid-v5a-q235-l-30124-v3-uploader: W0213 21:04:28.387000 7 torch/_dynamo/convert_frame.py:1358] [6/8] function: 'forward' (/usr/local/lib/python3.12/dist-packages/transformers/models/qwen3_moe/modeling_qwen3_moe.py:208)
chaiml-kimid-v5a-q235-l-30124-v3-uploader: W0213 21:04:28.387000 7 torch/_dynamo/convert_frame.py:1358] [6/8] last reason: 6/7: self._modules['up_proj'].imatrix_cnt == 1343 # module.imatrix_cnt += input.shape[0] # auto_round/compressors/base.py:1179 in get_imatrix_hook (HINT: torch.compile considers integer attributes of the nn.Module to be static. If you are observing recompilation, you might want to make this integer dynamic using torch._dynamo.config.allow_unspec_int_on_nn_module = True, or convert this integer into a tensor.)
chaiml-kimid-v5a-q235-l-30124-v3-uploader: W0213 21:04:28.387000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
chaiml-kimid-v5a-q235-l-30124-v3-uploader: W0213 21:04:28.387000 7 torch/_dynamo/convert_frame.py:1358] [6/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/torch.compiler_troubleshooting.html
chaiml-kimid-v5a-q235-l-30124-v3-uploader: W0213 21:04:31.413000 7 torch/_dynamo/convert_frame.py:1358] [3/8] torch._dynamo hit config.recompile_limit (8)
chaiml-kimid-v5a-q235-l-30124-v3-uploader: W0213 21:04:31.413000 7 torch/_dynamo/convert_frame.py:1358] [3/8] function: 'forward' (/usr/local/lib/python3.12/dist-packages/transformers/models/qwen3_moe/modeling_qwen3_moe.py:305)
chaiml-kimid-v5a-q235-l-30124-v3-uploader: W0213 21:04:31.413000 7 torch/_dynamo/convert_frame.py:1358] [3/8] last reason: 3/7: self._modules['self_attn']._modules['k_proj'].imatrix_cnt == 56 # module.imatrix_cnt += input.shape[0] # auto_round/compressors/base.py:1179 in get_imatrix_hook (HINT: torch.compile considers integer attributes of the nn.Module to be static. If you are observing recompilation, you might want to make this integer dynamic using torch._dynamo.config.allow_unspec_int_on_nn_module = True, or convert this integer into a tensor.)
chaiml-kimid-v5a-q235-l-30124-v3-uploader: W0213 21:04:31.413000 7 torch/_dynamo/convert_frame.py:1358] [3/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
chaiml-kimid-v5a-q235-l-30124-v3-uploader: W0213 21:04:31.413000 7 torch/_dynamo/convert_frame.py:1358] [3/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/torch.compiler_troubleshooting.html
chaiml-kimid-v5a-q235-l-30124-v3-uploader: 2026-02-13 21:04:50 WARNING gguf.py L297: please use more data via setting `nsamples` to improve accuracy as calibration activations contain 0
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Retrying (%r) after connection broken by '%r': %s
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Checking if ChaiML/kimid-v5a-q235-lr1e4ep2r64-W4A16 already exists in ChaiML
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Creating repo ChaiML/kimid-v5a-q235-lr1e4ep2r64-W4A16 and uploading /dev/shm/model_output to it
chaiml-kimid-v5a-q235-l-30124-v3-uploader: ---------- 2026-02-13 22:21:37 (0:00:00) ----------
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Files: hashed 11/38 (26.1M/131.9G) | pre-uploaded: 0/1 (0.0/131.9G) (+27 unsure) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Workers: hashing: 27 | get upload mode: 0 | pre-uploading: 1 | committing: 0 | waiting: 98
chaiml-kimid-v5a-q235-l-30124-v3-uploader: ---------------------------------------------------
chaiml-kimid-v5a-q235-l-30124-v3-uploader:       
chaiml-kimid-v5a-q235-l-30124-v3-uploader: ---------- 2026-02-13 22:22:37 (0:01:00) ----------
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 2/28 (1.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 26 | committing: 0 | waiting: 100
chaiml-kimid-v5a-q235-l-30124-v3-uploader: ---------------------------------------------------
chaiml-kimid-v5a-q235-l-30124-v3-uploader:       
chaiml-kimid-v5a-q235-l-30124-v3-uploader: ---------- 2026-02-13 22:23:37 (0:02:00) ----------
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 10/28 (41.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 18 | committing: 0 | waiting: 108
chaiml-kimid-v5a-q235-l-30124-v3-uploader: ---------------------------------------------------
chaiml-kimid-v5a-q235-l-30124-v3-uploader:       
chaiml-kimid-v5a-q235-l-30124-v3-uploader: ---------- 2026-02-13 22:24:37 (0:03:00) ----------
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 18/28 (81.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 10 | committing: 0 | waiting: 116
chaiml-kimid-v5a-q235-l-30124-v3-uploader: ---------------------------------------------------
chaiml-kimid-v5a-q235-l-30124-v3-uploader:       
chaiml-kimid-v5a-q235-l-30124-v3-uploader: ---------- 2026-02-13 22:25:37 (0:04:00) ----------
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 27/28 (126.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 1 | committing: 0 | waiting: 125
chaiml-kimid-v5a-q235-l-30124-v3-uploader: ---------------------------------------------------
Retrying (%r) after connection broken by '%r': %s
Failed to get response for submission chaiml-mistral-24b-2048_90555_v1: ('http://chaiml-mistral-24b-2048-90555-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:34662->127.0.0.1:8080: read: connection reset by peer\n')
chaiml-kimid-v5a-q235-l-30124-v3-uploader:       
chaiml-kimid-v5a-q235-l-30124-v3-uploader: ---------- 2026-02-13 22:26:37 (0:05:00) ----------
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Files: hashed 38/38 (131.9G/131.9G) | pre-uploaded: 28/28 (131.9G/131.9G) | committed: 0/38 (0.0/131.9G) | ignored: 0
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Workers: hashing: 0 | get upload mode: 0 | pre-uploading: 0 | committing: 1 | waiting: 125
chaiml-kimid-v5a-q235-l-30124-v3-uploader: ---------------------------------------------------
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Processed model ChaiML/kimid-v5a-q235-lr1e4ep2r64 in 5193.164s
chaiml-kimid-v5a-q235-l-30124-v3-uploader: creating bucket guanaco-vllm-models
chaiml-kimid-v5a-q235-l-30124-v3-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v5a-q235-l-30124-v3-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-kimid-v5a-q235-l-30124-v3-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-kimid-v5a-q235-l-30124-v3-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-kimid-v5a-q235-l-30124-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v5a-q235-l-30124-v3-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-kimid-v5a-q235-l-30124-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v5a-q235-l-30124-v3-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-kimid-v5a-q235-l-30124-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v5a-q235-l-30124-v3-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-kimid-v5a-q235-l-30124-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v5a-q235-l-30124-v3-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-kimid-v5a-q235-l-30124-v3-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-kimid-v5a-q235-l-30124-v3-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-kimid-v5a-q235-l-30124-v3-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-kimid-v5a-q235-l-30124-v3-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-kimid-v5a-q235-l-30124-v3-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-kimid-v5a-q235-l-30124-v3-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/added_tokens.json
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/generation_config.json
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/chat_template.jinja
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/tokenizer_config.json
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/special_tokens_map.json
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/config.json
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/quantization_config.json
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/merges.txt
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/vocab.json
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model.safetensors.index.json
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/tokenizer.json
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00027-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00007-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00021-of-00027.safetensors
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00026-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00002-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00025-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00013-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00015-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00001-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00017-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00014-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00003-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00009-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00019-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00006-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00004-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00008-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00022-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00011-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00020-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00024-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00018-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00010-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00016-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00005-of-00027.safetensors
chaiml-kimid-v5a-q235-l-30124-v3-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v5a-q235-l-30124-v3/default/model-00023-of-00027.safetensors
Job chaiml-kimid-v5a-q235-l-30124-v3-uploader completed after 5464.3s with status: succeeded
Stopping job with name chaiml-kimid-v5a-q235-l-30124-v3-uploader
Pipeline stage VLLMUploader completed in 5464.81s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.17s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-kimid-v5a-q235-l-30124-v3
Waiting for inference service chaiml-kimid-v5a-q235-l-30124-v3 to be ready
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
HTTP Request: %s %s "%s %d %s"
Inference service chaiml-kimid-v5a-q235-l-30124-v3 ready after 632.3334426879883s
Pipeline stage VLLMDeployer completed in 633.75s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2923648357391357s
Received healthy response to inference request in 2.0111255645751953s
Received healthy response to inference request in 1.9847688674926758s
Received healthy response to inference request in 2.182328462600708s
Received healthy response to inference request in 2.1970691680908203s
Received healthy response to inference request in 2.1233248710632324s
Received healthy response to inference request in 2.6782071590423584s
Received healthy response to inference request in 2.202911615371704s
Received healthy response to inference request in 1.9806671142578125s
Received healthy response to inference request in 2.1409285068511963s
Received healthy response to inference request in 2.1620426177978516s
Received healthy response to inference request in 2.027721881866455s
Received healthy response to inference request in 2.3113725185394287s
Received healthy response to inference request in 2.022120237350464s
Received healthy response to inference request in 2.0228750705718994s
Received healthy response to inference request in 2.0463767051696777s
Received healthy response to inference request in 2.115917921066284s
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Received healthy response to inference request in 1.9755072593688965s
Received healthy response to inference request in 2.3456904888153076s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 2.0101091861724854s
Received healthy response to inference request in 2.296998977661133s
Received healthy response to inference request in 2.2111976146698s
Received healthy response to inference request in 1.9135935306549072s
Received healthy response to inference request in 2.0018484592437744s
Received healthy response to inference request in 1.966012716293335s
Received healthy response to inference request in 2.030278444290161s
Received healthy response to inference request in 1.9925007820129395s
Received healthy response to inference request in 2.120256185531616s
Received healthy response to inference request in 2.198608875274658s
Received healthy response to inference request in 2.0607664585113525s
30 requests
0 failed requests
5th percentile: 1.9702852606773376
10th percentile: 1.9801511287689209
20th percentile: 1.9999789237976073
30th percentile: 2.0188218355178833
40th percentile: 2.029255819320679
50th percentile: 2.0883421897888184
60th percentile: 2.130366325378418
70th percentile: 2.1867506742477416
80th percentile: 2.2045688152313234
90th percentile: 2.2984363317489622
95th percentile: 2.330247402191162
99th percentile: 2.581777324676514
mean time: 2.1208497365315755
Pipeline stage StressChecker completed in 70.14s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.72s
Shutdown handler de-registered
chaiml-kimid-v5a-q235-l_30124_v3 status is now deployed due to DeploymentManager action
chaiml-kimid-v5a-q235-l_30124_v3 status is now inactive due to auto deactivation removed underperforming models