Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-kimid-v8b-kimid-77693-v13-uploader
Waiting for job on chaiml-kimid-v8b-kimid-77693-v13-uploader to finish
chaiml-kimid-v8b-kimid-77693-v13-uploader: Using quantization_mode: w4a16
chaiml-kimid-v8b-kimid-77693-v13-uploader: Checking if ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01-W4A16 already exists in ChaiML
chaiml-kimid-v8b-kimid-77693-v13-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-kimid-v8b-kimid-77693-v13-uploader: Downloading snapshot of ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01-W4A16...
2026-04-02T20:35:59.942578+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
chaiml-kimid-v8b-kimid-77693-v13-uploader: Downloaded in 56.340s
chaiml-kimid-v8b-kimid-77693-v13-uploader: Processed model ChaiML/kimid-v8b-kimidv5a-lr5e6ep2r64g4b01 in 58.963s
chaiml-kimid-v8b-kimid-77693-v13-uploader: creating bucket guanaco-vllm-models
chaiml-kimid-v8b-kimid-77693-v13-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimid-77693-v13-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-kimid-v8b-kimid-77693-v13-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-kimid-v8b-kimid-77693-v13-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-kimid-v8b-kimid-77693-v13-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimid-77693-v13-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-kimid-v8b-kimid-77693-v13-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimid-77693-v13-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-kimid-v8b-kimid-77693-v13-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimid-77693-v13-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-kimid-v8b-kimid-77693-v13-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v8b-kimid-77693-v13-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-kimid-v8b-kimid-77693-v13-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-kimid-v8b-kimid-77693-v13-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-kimid-v8b-kimid-77693-v13-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-kimid-v8b-kimid-77693-v13-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-kimid-v8b-kimid-77693-v13-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-kimid-v8b-kimid-77693-v13-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/generation_config.json
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/added_tokens.json
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/config.json
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/quantization_config.json
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/chat_template.jinja
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/special_tokens_map.json
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/tokenizer_config.json
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/.gitattributes
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/merges.txt
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/tokenizer.json
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/vocab.json
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model.safetensors.index.json
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00027-of-00027.safetensors
2026-04-02T20:37:00.033395+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00018-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00021-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00025-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00017-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00010-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00014-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00003-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00024-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00009-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00012-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00001-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00007-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00023-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00008-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00016-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00004-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00013-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00026-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00011-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00006-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00020-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00022-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00005-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00015-of-00027.safetensors
chaiml-kimid-v8b-kimid-77693-v13-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v8b-kimid-77693-v13/default/model-00019-of-00027.safetensors
Job chaiml-kimid-v8b-kimid-77693-v13-uploader completed after 144.96s with status: succeeded
Stopping job with name chaiml-kimid-v8b-kimid-77693-v13-uploader
Pipeline stage VLLMUploader completed in 145.48s
run pipeline stage %s
Running pipeline stage VLLMUploaderAMD
Pipeline stage vllm_upload_amd skipped, reason=not amd cluster
Pipeline stage VLLMUploaderAMD completed in 0.11s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 3.96s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-kimid-v8b-kimid-77693-v13
Waiting for inference service chaiml-kimid-v8b-kimid-77693-v13 to be ready
2026-04-02T20:38:00.153320+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
2026-04-02T20:39:00.261934+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
2026-04-02T20:40:00.366307+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
2026-04-02T20:41:00.490967+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
2026-04-02T20:42:00.588173+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
2026-04-02T20:43:00.692958+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
2026-04-02T20:44:00.798126+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
2026-04-02T20:45:00.901217+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
2026-04-02T20:46:01.012732+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
Failed to get response for submission chaiml-q235b-opus-judge_50366_v2: HTTPConnectionPool(host='chaiml-q235b-opus-judge-50366-v2-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=20.0)
2026-04-02T20:47:01.115050+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
2026-04-02T20:48:01.239929+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
2026-04-02T20:49:01.461597+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
2026-04-02T20:50:01.564572+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
2026-04-02T20:51:01.681238+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
2026-04-02T20:52:01.805478+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
Inference service chaiml-kimid-v8b-kimid-77693-v13 ready after 908.3391349315643s
Pipeline stage VLLMDeployer completed in 908.80s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.501953601837158s
Received healthy response to inference request in 1.4482977390289307s
Received healthy response to inference request in 1.457214593887329s
Received healthy response to inference request in 1.932157039642334s
Received healthy response to inference request in 2.2437078952789307s
Received healthy response to inference request in 1.5582828521728516s
Received healthy response to inference request in 1.5629074573516846s
Received healthy response to inference request in 1.4694325923919678s
Received healthy response to inference request in 1.4614858627319336s
Received healthy response to inference request in 1.5017693042755127s
Received healthy response to inference request in 1.4601771831512451s
Received healthy response to inference request in 1.4746215343475342s
2026-04-02T20:53:01.909531+00:00 monitor updated for chaiml-kimid-v8b-kimid_77693_v13
Received healthy response to inference request in 1.9969580173492432s
Received healthy response to inference request in 1.5436246395111084s
Received healthy response to inference request in 1.541576623916626s
Received healthy response to inference request in 1.609457015991211s
Received healthy response to inference request in 1.5373084545135498s
Received healthy response to inference request in 1.496692180633545s
Received healthy response to inference request in 1.5789093971252441s
Received healthy response to inference request in 1.9095773696899414s
Received healthy response to inference request in 1.472569465637207s
Received healthy response to inference request in 1.5673534870147705s
Received healthy response to inference request in 1.9946765899658203s
Received healthy response to inference request in 1.5800304412841797s
Received healthy response to inference request in 1.5981550216674805s
Received healthy response to inference request in 1.4744257926940918s
Received healthy response to inference request in 1.5052940845489502s
Received healthy response to inference request in 1.5603384971618652s
Received healthy response to inference request in 1.4663281440734863s
Received healthy response to inference request in 1.5595839023590088s
30 requests
0 failed requests
5th percentile: 1.4585477590560914
10th percentile: 1.4613549947738647
20th percentile: 1.471942090988159
30th percentile: 1.4900709867477417
40th percentile: 1.52450270652771
50th percentile: 1.55095374584198
60th percentile: 1.561366081237793
70th percentile: 1.5792457103729247
80th percentile: 1.669481086730958
90th percentile: 1.9949047327041627
95th percentile: 2.1326704502105707
99th percentile: 3.137062346935273
mean time: 1.6688288927078248
Pipeline stage StressChecker completed in 53.15s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.80s
Shutdown handler de-registered
chaiml-kimid-v8b-kimid_77693_v13 status is now deployed due to DeploymentManager action
chaiml-kimid-v8b-kimid_77693_v13 status is now inactive due to auto deactivation removed underperforming models
chaiml-kimid-v8b-kimid_77693_v13 status is now torndown due to DeploymentManager action