Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-kimid-v12-mv1-wi-79693-v2-uploader
Waiting for job on chaiml-kimid-v12-mv1-wi-79693-v2-uploader to finish
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: Using quantization_mode: none
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: Downloading snapshot of ChaiML/kimid-v12-mv1-winall-q35b-lr5e6ep2g8...
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: Downloaded in 26.537s
2026-03-25T17:32:28.407558+00:00 monitor updated for chaiml-kimid-v12-mv1-wi_79693_v2
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: Processed model ChaiML/kimid-v12-mv1-winall-q35b-lr5e6ep2g8 in 52.825s
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: creating bucket guanaco-vllm-models
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/.gitattributes
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/chat_template.jinja
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/special_tokens_map.json
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/args.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/args.json
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/added_tokens.json
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/config.json
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/preprocessor_config.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/preprocessor_config.json
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/processor_config.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/processor_config.json
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/tokenizer_config.json
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/README.md
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/generation_config.json
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/merges.txt
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model.safetensors.index.json
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/vocab.json
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00016-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00016-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00010-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00010-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00004-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00004-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00013-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00013-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00007-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00007-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00011-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00011-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00001-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00001-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00008-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00008-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00002-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00002-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00014-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00014-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00005-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00005-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00012-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00012-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00015-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00015-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00009-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00009-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00006-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00006-of-00016.safetensors
chaiml-kimid-v12-mv1-wi-79693-v2-uploader: cp /dev/shm/model_output/model-00003-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-wi-79693-v2/default/model-00003-of-00016.safetensors
Job chaiml-kimid-v12-mv1-wi-79693-v2-uploader completed after 94.12s with status: succeeded
Stopping job with name chaiml-kimid-v12-mv1-wi-79693-v2-uploader
Pipeline stage VLLMUploader completed in 94.78s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 3.53s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-kimid-v12-mv1-wi-79693-v2
Waiting for inference service chaiml-kimid-v12-mv1-wi-79693-v2 to be ready
2026-03-25T17:33:28.494238+00:00 monitor updated for chaiml-kimid-v12-mv1-wi_79693_v2
2026-03-25T17:34:28.591558+00:00 monitor updated for chaiml-kimid-v12-mv1-wi_79693_v2
2026-03-25T17:35:28.796444+00:00 monitor updated for chaiml-kimid-v12-mv1-wi_79693_v2
Inference service chaiml-kimid-v12-mv1-wi-79693-v2 ready after 191.278165102005s
Pipeline stage VLLMDeployer completed in 193.14s
run pipeline stage %s
Running pipeline stage StressChecker
2026-03-25T17:36:28.890470+00:00 monitor updated for chaiml-kimid-v12-mv1-wi_79693_v2
Retrying (%r) after connection broken by '%r': %s
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
2026-03-25T17:37:28.990569+00:00 monitor updated for chaiml-kimid-v12-mv1-wi_79693_v2
Received healthy response to inference request in 14.744495153427124s
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 6.378064393997192s
Received healthy response to inference request in 2.673065185546875s
Received healthy response to inference request in 1.6828999519348145s
2026-03-25T17:38:29.085991+00:00 monitor updated for chaiml-kimid-v12-mv1-wi_79693_v2
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 6.766797065734863s
Received healthy response to inference request in 1.3818683624267578s
Received healthy response to inference request in 1.4517786502838135s
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.191521167755127s
Received healthy response to inference request in 7.138591527938843s
Received healthy response to inference request in 1.701202630996704s
2026-03-25T17:39:29.179248+00:00 monitor updated for chaiml-kimid-v12-mv1-wi_79693_v2
Received healthy response to inference request in 2.2819972038269043s
Received healthy response to inference request in 1.5859713554382324s
Received healthy response to inference request in 1.3989582061767578s
Received healthy response to inference request in 7.676555156707764s
Received healthy response to inference request in 1.5328340530395508s
Received healthy response to inference request in 1.5267086029052734s
Received healthy response to inference request in 2.291649341583252s
Received healthy response to inference request in 1.994542121887207s
Received healthy response to inference request in 1.4689242839813232s
Received healthy response to inference request in 1.8015246391296387s
Received healthy response to inference request in 1.6871674060821533s
Received healthy response to inference request in 1.9868462085723877s
Received healthy response to inference request in 1.4048163890838623s
30 requests
7 failed requests
5th percentile: 1.4015943884849549
10th percentile: 1.4470824241638183
20th percentile: 1.5316089630126952
30th percentile: 1.6858871698379516
40th percentile: 1.9127175807952883
50th percentile: 2.2367591857910156
60th percentile: 4.1550648689269964
70th percentile: 7.299980616569518
80th percentile: 20.125011348724364
90th percentile: 20.144272780418397
95th percentile: 20.166238045692445
99th percentile: 20.170026693344116
mean time: 7.192941951751709
%s, retrying in %s seconds...
Received healthy response to inference request in 1.4012260437011719s
Received healthy response to inference request in 1.430877447128296s
Received healthy response to inference request in 1.4417626857757568s
Received healthy response to inference request in 1.3626890182495117s
Received healthy response to inference request in 1.3213133811950684s
Received healthy response to inference request in 1.348379135131836s
Received healthy response to inference request in 1.5167219638824463s
Received healthy response to inference request in 1.4913499355316162s
Received healthy response to inference request in 1.3572947978973389s
Received healthy response to inference request in 1.537515640258789s
Received healthy response to inference request in 1.407036542892456s
Received healthy response to inference request in 1.7207322120666504s
Received healthy response to inference request in 1.5235121250152588s
Received healthy response to inference request in 1.324124813079834s
Received healthy response to inference request in 1.753274917602539s
Received healthy response to inference request in 1.6313307285308838s
Received healthy response to inference request in 1.5785796642303467s
2026-03-25T17:40:29.283787+00:00 monitor updated for chaiml-kimid-v12-mv1-wi_79693_v2
Received healthy response to inference request in 2.074376344680786s
Received healthy response to inference request in 1.521589994430542s
Received healthy response to inference request in 1.5671885013580322s
Received healthy response to inference request in 1.732391119003296s
Received healthy response to inference request in 1.6272780895233154s
Received healthy response to inference request in 1.5284881591796875s
Received healthy response to inference request in 1.6223232746124268s
Received healthy response to inference request in 1.4531476497650146s
Received healthy response to inference request in 1.478956937789917s
Received healthy response to inference request in 1.4265778064727783s
Received healthy response to inference request in 1.4510340690612793s
Received healthy response to inference request in 1.4140164852142334s
Received healthy response to inference request in 1.8207390308380127s
30 requests
0 failed requests
5th percentile: 1.3350392580032349
10th percentile: 1.3564032316207886
20th percentile: 1.4058744430541992
30th percentile: 1.4295875549316406
40th percentile: 1.4523022174835205
50th percentile: 1.5040359497070312
60th percentile: 1.5255025386810304
70th percentile: 1.5706058502197264
80th percentile: 1.628088617324829
90th percentile: 1.7344794988632202
95th percentile: 1.7903801798820493
99th percentile: 2.000821523666382
mean time: 1.5288609504699706
Pipeline stage StressChecker completed in 278.31s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.88s
Shutdown handler de-registered
chaiml-kimid-v12-mv1-wi_79693_v2 status is now deployed due to DeploymentManager action
chaiml-kimid-v12-mv1-wi_79693_v2 status is now inactive due to auto deactivation removed underperforming models
chaiml-kimid-v12-mv1-wi_79693_v2 status is now torndown due to DeploymentManager action