Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-kimid-v11-kimidv-25711-v4-uploader
Waiting for job on chaiml-kimid-v11-kimidv-25711-v4-uploader to finish
chaiml-kimid-v11-kimidv-25711-v4-uploader: Using quantization_mode: w4a16
chaiml-kimid-v11-kimidv-25711-v4-uploader: Checking if ChaiML/kimid-v11-kimidv4ann-lr1e5ep2r64g8b01-W4A16 already exists in ChaiML
chaiml-kimid-v11-kimidv-25711-v4-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-kimid-v11-kimidv-25711-v4-uploader: Downloading snapshot of ChaiML/kimid-v11-kimidv4ann-lr1e5ep2r64g8b01-W4A16...
chaiml-kimid-v11-kimidv-25711-v4-uploader:
Fetching 39 files: 0%| | 0/39 [00:00<?, ?it/s]
Fetching 39 files: 3%|▎ | 1/39 [00:00<00:11, 3.41it/s]
Fetching 39 files: 18%|█▊ | 7/39 [00:13<01:02, 1.94s/it]
Fetching 39 files: 21%|██ | 8/39 [00:14<00:58, 1.89s/it]
Fetching 39 files: 26%|██▌ | 10/39 [00:15<00:37, 1.31s/it]
Fetching 39 files: 28%|██▊ | 11/39 [00:15<00:34, 1.22s/it]
Fetching 39 files: 38%|███▊ | 15/39 [00:24<00:39, 1.65s/it]
Fetching 39 files: 41%|████ | 16/39 [00:25<00:38, 1.66s/it]
Fetching 39 files: 44%|████▎ | 17/39 [00:25<00:30, 1.38s/it]
Fetching 39 files: 46%|████▌ | 18/39 [00:28<00:32, 1.56s/it]
Fetching 39 files: 49%|████▊ | 19/39 [00:30<00:34, 1.72s/it]
Fetching 39 files: 59%|█████▉ | 23/39 [00:37<00:28, 1.81s/it]
Fetching 39 files: 62%|██████▏ | 24/39 [00:42<00:32, 2.19s/it]
Fetching 39 files: 69%|██████▉ | 27/39 [00:43<00:16, 1.41s/it]
Fetching 39 files: 72%|███████▏ | 28/39 [00:45<00:16, 1.53s/it]
Fetching 39 files: 79%|███████▉ | 31/39 [00:47<00:09, 1.21s/it]
Fetching 39 files: 100%|██████████| 39/39 [00:47<00:00, 1.22s/it]
chaiml-kimid-v11-kimidv-25711-v4-uploader: Downloaded in 47.530s
chaiml-kimid-v11-kimidv-25711-v4-uploader: Processed model ChaiML/kimid-v11-kimidv4ann-lr1e5ep2r64g8b01 in 48.064s
chaiml-kimid-v11-kimidv-25711-v4-uploader: creating bucket guanaco-vllm-models
chaiml-kimid-v11-kimidv-25711-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v11-kimidv-25711-v4-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-kimid-v11-kimidv-25711-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-kimid-v11-kimidv-25711-v4-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-kimid-v11-kimidv-25711-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v11-kimidv-25711-v4-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-kimid-v11-kimidv-25711-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v11-kimidv-25711-v4-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-kimid-v11-kimidv-25711-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v11-kimidv-25711-v4-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-kimid-v11-kimidv-25711-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v11-kimidv-25711-v4-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-kimid-v11-kimidv-25711-v4-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-kimid-v11-kimidv-25711-v4-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-kimid-v11-kimidv-25711-v4-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-kimid-v11-kimidv-25711-v4-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-kimid-v11-kimidv-25711-v4-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-kimid-v11-kimidv-25711-v4-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/added_tokens.json
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/tokenizer_config.json
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/generation_config.json
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/special_tokens_map.json
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/.gitattributes
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/chat_template.jinja
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/config.json
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/quantization_config.json
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/merges.txt
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/vocab.json
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model.safetensors.index.json
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/tokenizer.json
HTTP Request: %s %s "%s %d %s"
Retrying (%r) after connection broken by '%r': %s
Retrying (%r) after connection broken by '%r': %s
Retrying (%r) after connection broken by '%r': %s
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00027-of-00027.safetensors
Retrying (%r) after connection broken by '%r': %s
Retrying (%r) after connection broken by '%r': %s
HTTP Request: %s %s "%s %d %s"
Failed to get response for submission blend_beren_2026-01-30: HTTPConnectionPool(host='chaiml-kimid-v8b-kimidv-63800-v5-predictor.tenant-chaiml-guanaco.k2.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00022-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00010-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00008-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00012-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00007-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00024-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00020-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00005-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00003-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00001-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00013-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00019-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00017-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00021-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00009-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00015-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00018-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00026-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00011-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00006-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00014-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00025-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00002-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00004-of-00027.safetensors
chaiml-kimid-v11-kimidv-25711-v4-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-kimid-v11-kimidv-25711-v4/model-00016-of-00027.safetensors
Job chaiml-kimid-v11-kimidv-25711-v4-uploader completed after 467.52s with status: succeeded
Stopping job with name chaiml-kimid-v11-kimidv-25711-v4-uploader
Pipeline stage VLLMUploader completed in 468.71s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 1.10s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-kimid-v11-kimidv-25711-v4
Waiting for inference service chaiml-kimid-v11-kimidv-25711-v4 to be ready
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
HTTP Request: %s %s "%s %d %s"
Inference service chaiml-kimid-v11-kimidv-25711-v4 ready after 581.4144642353058s
Pipeline stage VLLMDeployer completed in 583.43s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7895488739013672s
Received healthy response to inference request in 1.7937939167022705s
Received healthy response to inference request in 2.1357009410858154s
Received healthy response to inference request in 1.9249436855316162s
Received healthy response to inference request in 1.9379172325134277s
Received healthy response to inference request in 1.9358494281768799s
Received healthy response to inference request in 2.476315975189209s
Received healthy response to inference request in 1.7340824604034424s
Received healthy response to inference request in 1.9365875720977783s
Received healthy response to inference request in 1.893866777420044s
Received healthy response to inference request in 2.0178160667419434s
Received healthy response to inference request in 2.0679516792297363s
Received healthy response to inference request in 2.0512523651123047s
Received healthy response to inference request in 1.7074522972106934s
Received healthy response to inference request in 1.681241750717163s
Received healthy response to inference request in 1.8224208354949951s
Received healthy response to inference request in 2.1196396350860596s
Received healthy response to inference request in 2.104626178741455s
Received healthy response to inference request in 1.7183635234832764s
Received healthy response to inference request in 1.6659529209136963s
Received healthy response to inference request in 1.8109748363494873s
Received healthy response to inference request in 1.700225830078125s
Received healthy response to inference request in 2.0357418060302734s
Received healthy response to inference request in 1.7769725322723389s
Received healthy response to inference request in 1.9058279991149902s
Received healthy response to inference request in 1.9436447620391846s
Received healthy response to inference request in 1.7540340423583984s
Received healthy response to inference request in 1.7269456386566162s
Received healthy response to inference request in 2.00105619430542s
Received healthy response to inference request in 1.7454166412353516s
30 requests
0 failed requests
5th percentile: 1.689784586429596
10th percentile: 1.7067296504974365
20th percentile: 1.7326550960540772
30th percentile: 1.7700909852981568
40th percentile: 1.8041024684906006
50th percentile: 1.899847388267517
60th percentile: 1.9361446857452393
70th percentile: 1.960868191719055
80th percentile: 2.0388439178466795
90th percentile: 2.1061275243759154
95th percentile: 2.128473353385925
99th percentile: 2.377537615299225
mean time: 1.8972054799397786
Pipeline stage StressChecker completed in 61.65s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.70s
Shutdown handler de-registered
chaiml-kimid-v11-kimidv_25711_v4 status is now deployed due to DeploymentManager action
chaiml-kimid-v11-kimidv_25711_v4 status is now inactive due to auto deactivation removed underperforming models
chaiml-kimid-v11-kimidv_25711_v4 status is now torndown due to DeploymentManager action