Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-kimid-v12-mv1-to-30309-v3-uploader
Waiting for job on chaiml-kimid-v12-mv1-to-30309-v3-uploader to finish
chaiml-kimid-v12-mv1-to-30309-v3-uploader: Using quantization_mode: none
chaiml-kimid-v12-mv1-to-30309-v3-uploader: Downloading snapshot of ChaiML/kimid-v12-mv1-top2-q35b-lr5e6ep2g8...
chaiml-kimid-v12-mv1-to-30309-v3-uploader: Downloaded in 25.655s
2026-03-25T16:57:23.668755+00:00 monitor updated for chaiml-kimid-v12-mv1-to_30309_v3
chaiml-kimid-v12-mv1-to-30309-v3-uploader: Processed model ChaiML/kimid-v12-mv1-top2-q35b-lr5e6ep2g8 in 51.607s
chaiml-kimid-v12-mv1-to-30309-v3-uploader: creating bucket guanaco-vllm-models
chaiml-kimid-v12-mv1-to-30309-v3-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v12-mv1-to-30309-v3-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-kimid-v12-mv1-to-30309-v3-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-kimid-v12-mv1-to-30309-v3-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-kimid-v12-mv1-to-30309-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v12-mv1-to-30309-v3-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-kimid-v12-mv1-to-30309-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v12-mv1-to-30309-v3-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-kimid-v12-mv1-to-30309-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v12-mv1-to-30309-v3-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-kimid-v12-mv1-to-30309-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-kimid-v12-mv1-to-30309-v3-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-kimid-v12-mv1-to-30309-v3-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-kimid-v12-mv1-to-30309-v3-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-kimid-v12-mv1-to-30309-v3-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-kimid-v12-mv1-to-30309-v3-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-kimid-v12-mv1-to-30309-v3-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-kimid-v12-mv1-to-30309-v3-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/config.json
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/processor_config.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/processor_config.json
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/added_tokens.json
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/generation_config.json
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/special_tokens_map.json
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model.safetensors.index.json
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/chat_template.jinja
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/args.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/args.json
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/.gitattributes
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/README.md
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/preprocessor_config.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/preprocessor_config.json
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/merges.txt
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/tokenizer_config.json
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/vocab.json
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/tokenizer.json
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00016-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00016-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00007-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00007-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00004-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00004-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00013-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00013-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00010-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00010-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00002-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00002-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00001-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00001-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00011-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00011-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00014-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00014-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00005-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00005-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00008-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00008-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00015-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00015-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00012-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00012-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00006-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00006-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00009-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00009-of-00016.safetensors
chaiml-kimid-v12-mv1-to-30309-v3-uploader: cp /dev/shm/model_output/model-00003-of-00016.safetensors s3://guanaco-vllm-models/chaiml-kimid-v12-mv1-to-30309-v3/default/model-00003-of-00016.safetensors
Job chaiml-kimid-v12-mv1-to-30309-v3-uploader completed after 86.75s with status: succeeded
Stopping job with name chaiml-kimid-v12-mv1-to-30309-v3-uploader
Pipeline stage VLLMUploader completed in 87.22s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.85s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-kimid-v12-mv1-to-30309-v3
Waiting for inference service chaiml-kimid-v12-mv1-to-30309-v3 to be ready
2026-03-25T16:58:23.807275+00:00 monitor updated for chaiml-kimid-v12-mv1-to_30309_v3
2026-03-25T16:59:23.936252+00:00 monitor updated for chaiml-kimid-v12-mv1-to_30309_v3
Failed to get response for submission chaiml-pony-d3a-mv1-plc_30375_v1: ('http://chaiml-pony-d3a-mv1-plc-30375-v1-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/completions', 'request timeout')
Connection pool is full, discarding connection: %s. Connection pool size: %s
2026-03-25T17:00:24.887974+00:00 monitor updated for chaiml-kimid-v12-mv1-to_30309_v3
Inference service chaiml-kimid-v12-mv1-to-30309-v3 ready after 171.07689595222473s
Pipeline stage VLLMDeployer completed in 171.58s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
2026-03-25T17:01:25.067628+00:00 monitor updated for chaiml-kimid-v12-mv1-to_30309_v3
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
2026-03-25T17:02:25.315189+00:00 monitor updated for chaiml-kimid-v12-mv1-to_30309_v3
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 6.664383411407471s
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 6.539663553237915s
Received healthy response to inference request in 2.049499273300171s
2026-03-25T17:03:25.433015+00:00 monitor updated for chaiml-kimid-v12-mv1-to_30309_v3
Received healthy response to inference request in 6.355379343032837s
Received healthy response to inference request in 1.891247272491455s
Received healthy response to inference request in 1.3629112243652344s
Received healthy response to inference request in 2.616058349609375s
Received healthy response to inference request in 1.7176082134246826s
Received healthy response to inference request in 1.66780686378479s
Received healthy response to inference request in 2.0024421215057373s
HTTPConnectionPool(host='guanaco-submitter-v2.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 7.1126930713653564s
Received healthy response to inference request in 1.469719409942627s
Received healthy response to inference request in 2.9385955333709717s
Received healthy response to inference request in 1.868058443069458s
Received healthy response to inference request in 1.5024197101593018s
Received healthy response to inference request in 1.4250164031982422s
Received healthy response to inference request in 1.524569034576416s
Received healthy response to inference request in 1.5556178092956543s
Received healthy response to inference request in 1.6738450527191162s
Received healthy response to inference request in 1.5642879009246826s
Received healthy response to inference request in 1.524156093597412s
Received healthy response to inference request in 1.6020216941833496s
2026-03-25T17:04:26.023571+00:00 monitor updated for chaiml-kimid-v12-mv1-to_30309_v3
30 requests
8 failed requests
5th percentile: 1.4451327562332152
10th percentile: 1.4991496801376343
20th percentile: 1.5494080543518067
30th percentile: 1.6480713129043578
40th percentile: 1.8078783512115482
50th percentile: 2.025970697402954
60th percentile: 4.3053090572357124
70th percentile: 6.798876309394835
80th percentile: 20.139075613021852
90th percentile: 20.199912095069884
95th percentile: 20.2191011428833
99th percentile: 20.327551674842834
mean time: 7.3407999197642
%s, retrying in %s seconds...
Received healthy response to inference request in 1.3611187934875488s
Received healthy response to inference request in 1.5007760524749756s
Received healthy response to inference request in 1.4100277423858643s
Received healthy response to inference request in 1.4990968704223633s
Received healthy response to inference request in 1.4742403030395508s
Received healthy response to inference request in 1.3437156677246094s
Received healthy response to inference request in 1.4837567806243896s
Received healthy response to inference request in 2.0230891704559326s
Received healthy response to inference request in 1.3715581893920898s
Received healthy response to inference request in 2.5618631839752197s
Received healthy response to inference request in 1.3250508308410645s
Received healthy response to inference request in 2.0463428497314453s
Received healthy response to inference request in 2.2306790351867676s
Received healthy response to inference request in 1.3388457298278809s
Received healthy response to inference request in 1.69724440574646s
Received healthy response to inference request in 1.9139492511749268s
Received healthy response to inference request in 2.000087261199951s
Received healthy response to inference request in 1.5532267093658447s
Received healthy response to inference request in 1.351428747177124s
Received healthy response to inference request in 1.6568777561187744s
Received healthy response to inference request in 1.4138774871826172s
Received healthy response to inference request in 1.3935043811798096s
Received healthy response to inference request in 1.9807775020599365s
Received healthy response to inference request in 1.8619873523712158s
Received healthy response to inference request in 2.2235355377197266s
Received healthy response to inference request in 1.3572618961334229s
Received healthy response to inference request in 1.587695598602295s
Received healthy response to inference request in 1.4727134704589844s
Received healthy response to inference request in 1.4005975723266602s
Received healthy response to inference request in 1.3785440921783447s
30 requests
0 failed requests
5th percentile: 1.3410372018814087
10th percentile: 1.3506574392318726
20th percentile: 1.3694703102111816
30th percentile: 1.398469614982605
40th percentile: 1.4491790771484376
50th percentile: 1.4914268255233765
60th percentile: 1.5670142650604248
70th percentile: 1.7466672897338862
80th percentile: 1.9846394538879395
90th percentile: 2.064062118530274
95th percentile: 2.227464461326599
99th percentile: 2.465819780826569
mean time: 1.6404490073521931
Pipeline stage StressChecker completed in 278.30s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.99s
Shutdown handler de-registered
chaiml-kimid-v12-mv1-to_30309_v3 status is now deployed due to DeploymentManager action
chaiml-kimid-v12-mv1-to_30309_v3 status is now inactive due to auto deactivation removed underperforming models
chaiml-kimid-v12-mv1-to_30309_v3 status is now torndown due to DeploymentManager action