Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name qwen-qwen3-5-27b-v10-uploader
Waiting for job on qwen-qwen3-5-27b-v10-uploader to finish
qwen-qwen3-5-27b-v10-uploader: Using quantization_mode: none
qwen-qwen3-5-27b-v10-uploader: Downloading snapshot of Qwen/Qwen3.5-27B...
qwen-qwen3-5-27b-v10-uploader: Downloaded in 18.313s
qwen-qwen3-5-27b-v10-uploader: Processed model Qwen/Qwen3.5-27B in 39.641s
qwen-qwen3-5-27b-v10-uploader: creating bucket guanaco-vllm-models
qwen-qwen3-5-27b-v10-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
qwen-qwen3-5-27b-v10-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
qwen-qwen3-5-27b-v10-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
qwen-qwen3-5-27b-v10-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
qwen-qwen3-5-27b-v10-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
qwen-qwen3-5-27b-v10-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
qwen-qwen3-5-27b-v10-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
qwen-qwen3-5-27b-v10-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
qwen-qwen3-5-27b-v10-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
qwen-qwen3-5-27b-v10-uploader: if re.search("-\.", bucket, re.UNICODE):
qwen-qwen3-5-27b-v10-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
qwen-qwen3-5-27b-v10-uploader: if re.search("\.\.", bucket, re.UNICODE):
qwen-qwen3-5-27b-v10-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
qwen-qwen3-5-27b-v10-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
qwen-qwen3-5-27b-v10-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
qwen-qwen3-5-27b-v10-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
qwen-qwen3-5-27b-v10-uploader: Bucket 's3://guanaco-vllm-models/' created
qwen-qwen3-5-27b-v10-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/config.json
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/.gitattributes
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/chat_template.jinja
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/video_preprocessor_config.json s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/video_preprocessor_config.json
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/README.md
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/tokenizer_config.json
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/generation_config.json
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/model.safetensors.index.json
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/preprocessor_config.json s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/preprocessor_config.json
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/vocab.json
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/merges.txt
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/tokenizer.json
2026-03-22T02:42:45.846587+00:00 monitor updated for qwen-qwen3-5-27b_v10
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/model.safetensors-00011-of-00011.safetensors s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/model.safetensors-00011-of-00011.safetensors
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/model.safetensors-00008-of-00011.safetensors s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/model.safetensors-00008-of-00011.safetensors
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/model.safetensors-00001-of-00011.safetensors s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/model.safetensors-00001-of-00011.safetensors
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/model.safetensors-00006-of-00011.safetensors s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/model.safetensors-00006-of-00011.safetensors
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/model.safetensors-00005-of-00011.safetensors s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/model.safetensors-00005-of-00011.safetensors
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/model.safetensors-00003-of-00011.safetensors s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/model.safetensors-00003-of-00011.safetensors
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/model.safetensors-00009-of-00011.safetensors s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/model.safetensors-00009-of-00011.safetensors
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/model.safetensors-00010-of-00011.safetensors s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/model.safetensors-00010-of-00011.safetensors
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/model.safetensors-00002-of-00011.safetensors s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/model.safetensors-00002-of-00011.safetensors
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/model.safetensors-00004-of-00011.safetensors s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/model.safetensors-00004-of-00011.safetensors
qwen-qwen3-5-27b-v10-uploader: cp /dev/shm/model_output/model.safetensors-00007-of-00011.safetensors s3://guanaco-vllm-models/qwen-qwen3-5-27b-v10/default/model.safetensors-00007-of-00011.safetensors
Job qwen-qwen3-5-27b-v10-uploader completed after 72.8s with status: succeeded
Stopping job with name qwen-qwen3-5-27b-v10-uploader
Pipeline stage VLLMUploader completed in 73.69s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.50s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service qwen-qwen3-5-27b-v10
Waiting for inference service qwen-qwen3-5-27b-v10 to be ready
2026-03-22T02:43:45.934085+00:00 monitor updated for qwen-qwen3-5-27b_v10
2026-03-22T02:44:46.053290+00:00 monitor updated for qwen-qwen3-5-27b_v10
2026-03-22T02:45:46.141854+00:00 monitor updated for qwen-qwen3-5-27b_v10
Inference service qwen-qwen3-5-27b-v10 ready after 200.4666347503662s
Pipeline stage VLLMDeployer completed in 201.03s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
2026-03-22T02:46:46.228569+00:00 monitor updated for qwen-qwen3-5-27b_v10
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
2026-03-22T02:47:46.325053+00:00 monitor updated for qwen-qwen3-5-27b_v10
Received healthy response to inference request in 12.988123416900635s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 4.8212363719940186s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 4.5320165157318115s
2026-03-22T02:48:46.412195+00:00 monitor updated for qwen-qwen3-5-27b_v10
Received healthy response to inference request in 2.6662189960479736s
Received healthy response to inference request in 2.1752617359161377s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
2026-03-22T02:49:46.503569+00:00 monitor updated for qwen-qwen3-5-27b_v10
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.282123327255249s
Received healthy response to inference request in 2.3494060039520264s
Received healthy response to inference request in 2.4182987213134766s
Received healthy response to inference request in 2.241851329803467s
Received healthy response to inference request in 2.5209498405456543s
Received healthy response to inference request in 2.4146759510040283s
Received healthy response to inference request in 2.2918708324432373s
Received healthy response to inference request in 2.3980612754821777s
Received healthy response to inference request in 2.4860281944274902s
Received healthy response to inference request in 2.6280462741851807s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.453272819519043s
Received healthy response to inference request in 2.350166082382202s
Received healthy response to inference request in 2.6746792793273926s
Received healthy response to inference request in 2.390503406524658s
2026-03-22T02:50:46.595923+00:00 monitor updated for qwen-qwen3-5-27b_v10
Received healthy response to inference request in 2.5309090614318848s
30 requests
10 failed requests
5th percentile: 2.2599737286567687
10th percentile: 2.2908960819244384
20th percentile: 2.382435941696167
30th percentile: 2.417211890220642
40th percentile: 2.506981182098389
50th percentile: 2.647132635116577
60th percentile: 4.647704458236694
70th percentile: 20.116890025138854
80th percentile: 20.125291538238525
90th percentile: 20.133266139030457
95th percentile: 20.13406319618225
99th percentile: 20.13973086118698
mean time: 8.829862276713053
%s, retrying in %s seconds...
Received healthy response to inference request in 2.1710965633392334s
Received healthy response to inference request in 2.1667518615722656s
Received healthy response to inference request in 2.223720073699951s
Received healthy response to inference request in 2.2115743160247803s
Received healthy response to inference request in 2.328930377960205s
Received healthy response to inference request in 2.212022304534912s
Received healthy response to inference request in 2.2336018085479736s
Received healthy response to inference request in 2.2280170917510986s
Received healthy response to inference request in 2.2394771575927734s
Received healthy response to inference request in 2.2420427799224854s
Received healthy response to inference request in 4.492876291275024s
Received healthy response to inference request in 2.2310214042663574s
Received healthy response to inference request in 2.2476770877838135s
Received healthy response to inference request in 2.3911068439483643s
Received healthy response to inference request in 2.436298131942749s
Received healthy response to inference request in 2.299806594848633s
Received healthy response to inference request in 2.3672335147857666s
Received healthy response to inference request in 2.2930243015289307s
Received healthy response to inference request in 2.2488198280334473s
Received healthy response to inference request in 2.368809938430786s
Received healthy response to inference request in 2.449953556060791s
Received healthy response to inference request in 2.246300220489502s
Received healthy response to inference request in 2.7625160217285156s
2026-03-22T02:51:46.702100+00:00 monitor updated for qwen-qwen3-5-27b_v10
Received healthy response to inference request in 2.331408739089966s
Received healthy response to inference request in 2.4182143211364746s
Received healthy response to inference request in 2.3890273571014404s
Received healthy response to inference request in 2.4275670051574707s
Received healthy response to inference request in 2.4198100566864014s
Received healthy response to inference request in 2.3333911895751953s
Received healthy response to inference request in 2.3701024055480957s
30 requests
0 failed requests
5th percentile: 2.1893115520477293
10th percentile: 2.211977505683899
20th percentile: 2.230420541763306
30th percentile: 2.2412730932235716
40th percentile: 2.248362731933594
50th percentile: 2.314368486404419
60th percentile: 2.346928119659424
70th percentile: 2.375779891014099
80th percentile: 2.41853346824646
90th percentile: 2.4376636743545532
95th percentile: 2.6218629121780386
99th percentile: 3.9910718131065384
mean time: 2.3927399714787803
Pipeline stage StressChecker completed in 341.75s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.68s
Shutdown handler de-registered
qwen-qwen3-5-27b_v10 status is now deployed due to DeploymentManager action
qwen-qwen3-5-27b_v10 status is now inactive due to auto deactivation removed underperforming models
qwen-qwen3-5-27b_v10 status is now torndown due to DeploymentManager action