submission_id: chaiml-kimid-v8b-kimidv_77693_v4
developer_uid: richhx
status: inactive
model_repo: ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '<|assistant|>', '####', '</think>', '<|user|>', '<|im_end|>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 80}
formatter: {'memory_template': "<|im_start|>system\n{bot_name}'s persona: {memory}<|im_end|>\n", 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{bot_name}: {message}<|im_end|>\n', 'user_template': '<|im_start|>user\n{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n{bot_name}:', 'truncate_by_message': True}
timestamp: 2026-03-23T16:12:57+00:00
model_name: chaiml-kimid-v8b-kimidv_77693_v4
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-kimid-v8b-kimidv-77693-v4-uploader
Waiting for job on chaiml-kimid-v8b-kimidv-77693-v4-uploader to finish
chaiml-kimid-v8b-kimidv-77693-v4-uploader: Using quantization_mode: w4a16
chaiml-kimid-v8b-kimidv-77693-v4-uploader: Checking if ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01-W4A16 already exists in ChaiML
chaiml-kimid-v8b-kimidv-77693-v4-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-kimid-v8b-kimidv-77693-v4-uploader: Downloading snapshot of ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01-W4A16...
2026-03-23T16:05:58.470273+00:00 monitor updated for chaiml-kimid-v8b-kimidv_77693_v4
chaiml-kimid-v8b-kimidv-77693-v4-uploader: Downloaded in 64.891s
chaiml-kimid-v8b-kimidv-77693-v4-uploader: Processed model ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01 in 65.503s
chaiml-kimid-v8b-kimidv-77693-v4-uploader: creating bucket guanaco-vllm-models
chaiml-kimid-v8b-kimidv-77693-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimidv-77693-v4-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-kimid-v8b-kimidv-77693-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-kimid-v8b-kimidv-77693-v4-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-kimid-v8b-kimidv-77693-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimidv-77693-v4-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-kimid-v8b-kimidv-77693-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimidv-77693-v4-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-kimid-v8b-kimidv-77693-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimidv-77693-v4-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-kimid-v8b-kimidv-77693-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimidv-77693-v4-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-kimid-v8b-kimidv-77693-v4-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-kimid-v8b-kimidv-77693-v4-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-kimid-v8b-kimidv-77693-v4-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-kimid-v8b-kimidv-77693-v4-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-kimid-v8b-kimidv-77693-v4-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-kimid-v8b-kimidv-77693-v4-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/added_tokens.json
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/config.json
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/chat_template.jinja
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/.gitattributes
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/generation_config.json
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/special_tokens_map.json
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/quantization_config.json
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/tokenizer_config.json
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/merges.txt
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model.safetensors.index.json
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/vocab.json
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/tokenizer.json
2026-03-23T16:06:58.561118+00:00 monitor updated for chaiml-kimid-v8b-kimidv_77693_v4
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00027-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00022-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00017-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00006-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00008-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00001-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00004-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00026-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00005-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00019-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00014-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00020-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00002-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00021-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00013-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00010-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00012-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00009-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00025-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00015-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00023-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00018-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00016-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00007-of-00027.safetensors
chaiml-kimid-v8b-kimidv-77693-v4-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimidv-77693-v4/default/model-00003-of-00027.safetensors
Job chaiml-kimid-v8b-kimidv-77693-v4-uploader completed after 155.65s with status: succeeded
Stopping job with name chaiml-kimid-v8b-kimidv-77693-v4-uploader
Pipeline stage VLLMUploader completed in 156.15s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 5.26s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-kimid-v8b-kimidv-77693-v4
Waiting for inference service chaiml-kimid-v8b-kimidv-77693-v4 to be ready
2026-03-23T16:07:58.654809+00:00 monitor updated for chaiml-kimid-v8b-kimidv_77693_v4
2026-03-23T16:08:58.753570+00:00 monitor updated for chaiml-kimid-v8b-kimidv_77693_v4
2026-03-23T16:09:58.855142+00:00 monitor updated for chaiml-kimid-v8b-kimidv_77693_v4
2026-03-23T16:10:58.954861+00:00 monitor updated for chaiml-kimid-v8b-kimidv_77693_v4
Inference service chaiml-kimid-v8b-kimidv-77693-v4 ready after 240.96945810317993s
Pipeline stage VLLMDeployer completed in 242.31s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.175175905227661s
Received healthy response to inference request in 3.8492677211761475s
Received healthy response to inference request in 4.052347183227539s
Received healthy response to inference request in 2.1296229362487793s
2026-03-23T16:11:59.057135+00:00 monitor updated for chaiml-kimid-v8b-kimidv_77693_v4
Received healthy response to inference request in 4.651668071746826s
Received healthy response to inference request in 1.9232335090637207s
Received healthy response to inference request in 1.867915391921997s
Received healthy response to inference request in 1.9807615280151367s
Received healthy response to inference request in 1.964198112487793s
Received healthy response to inference request in 1.958852767944336s
Received healthy response to inference request in 1.8697378635406494s
Received healthy response to inference request in 2.360598564147949s
Received healthy response to inference request in 1.8583545684814453s
Received healthy response to inference request in 1.9687683582305908s
Received healthy response to inference request in 1.8428666591644287s
Received healthy response to inference request in 1.9723916053771973s
Received healthy response to inference request in 1.9283924102783203s
Received healthy response to inference request in 1.882948637008667s
Received healthy response to inference request in 1.9692485332489014s
Received healthy response to inference request in 1.9586906433105469s
Received healthy response to inference request in 2.0409371852874756s
Received healthy response to inference request in 2.0359079837799072s
Received healthy response to inference request in 1.998077630996704s
Received healthy response to inference request in 2.3747787475585938s
Received healthy response to inference request in 1.948700189590454s
Received healthy response to inference request in 2.0398619174957275s
Received healthy response to inference request in 1.9655671119689941s
Received healthy response to inference request in 2.104652166366577s
Received healthy response to inference request in 1.957308292388916s
Received healthy response to inference request in 2.004903554916382s
30 requests
0 failed requests
5th percentile: 1.8626569390296936
10th percentile: 1.869555616378784
20th percentile: 1.9273606300354005
30th percentile: 1.9582759380340575
40th percentile: 1.9650195121765137
50th percentile: 1.9708200693130493
60th percentile: 2.000808000564575
70th percentile: 2.040184497833252
80th percentile: 2.175818061828614
90th percentile: 3.869575667381287
95th percentile: 4.1199029803276055
99th percentile: 4.513485343456269
mean time: 2.2878578583399456
Pipeline stage StressChecker completed in 71.58s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.53s
Shutdown handler de-registered
chaiml-kimid-v8b-kimidv_77693_v4 status is now deployed due to DeploymentManager action
chaiml-kimid-v8b-kimidv_77693_v4 status is now inactive due to admin request