Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-merged-qwen-35-39140-v11-uploader
Waiting for job on chaiml-merged-qwen-35-39140-v11-uploader to finish
chaiml-merged-qwen-35-39140-v11-uploader: Using quantization_mode: none
chaiml-merged-qwen-35-39140-v11-uploader: Downloading snapshot of ChaiML/merged_qwen_35_dpo_lower_lr_v...
chaiml-merged-qwen-35-39140-v11-uploader: '(ReadTimeoutError("HTTPSConnectionPool(host='huggingface.co', port=443): Read timed out. (read timeout=10)"), '(Request ID: ce4405b6-614b-4fd2-8527-57af69d44631)')' thrown while requesting HEAD https://huggingface.co/ChaiML/merged_qwen_35_dpo_lower_lr_v/resolve/e08d48b1a6ae955fdbc436fb9d6d0fa8ee6f950c/.gitattributes
chaiml-merged-qwen-35-39140-v11-uploader: Retrying in 1s [Retry 1/5].
chaiml-merged-qwen-35-39140-v11-uploader: Downloaded in 30.773s
2026-03-27T20:44:20.508196+00:00 monitor updated for chaiml-merged-qwen-35-_39140_v11
chaiml-merged-qwen-35-39140-v11-uploader: Processed model ChaiML/merged_qwen_35_dpo_lower_lr_v in 50.824s
chaiml-merged-qwen-35-39140-v11-uploader: creating bucket guanaco-vllm-models
chaiml-merged-qwen-35-39140-v11-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-merged-qwen-35-39140-v11-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-merged-qwen-35-39140-v11-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-merged-qwen-35-39140-v11-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-merged-qwen-35-39140-v11-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-merged-qwen-35-39140-v11-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-merged-qwen-35-39140-v11-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-merged-qwen-35-39140-v11-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-merged-qwen-35-39140-v11-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-merged-qwen-35-39140-v11-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-merged-qwen-35-39140-v11-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-merged-qwen-35-39140-v11-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-merged-qwen-35-39140-v11-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-merged-qwen-35-39140-v11-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-merged-qwen-35-39140-v11-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-merged-qwen-35-39140-v11-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-merged-qwen-35-39140-v11-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-merged-qwen-35-39140-v11-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-merged-qwen-35-39140-v11/default
chaiml-merged-qwen-35-39140-v11-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-merged-qwen-35-39140-v11/default/.gitattributes
chaiml-merged-qwen-35-39140-v11-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-merged-qwen-35-39140-v11/default/model.safetensors.index.json
chaiml-merged-qwen-35-39140-v11-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-merged-qwen-35-39140-v11/default/tokenizer_config.json
chaiml-merged-qwen-35-39140-v11-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-merged-qwen-35-39140-v11/default/README.md
chaiml-merged-qwen-35-39140-v11-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-merged-qwen-35-39140-v11/default/generation_config.json
chaiml-merged-qwen-35-39140-v11-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-merged-qwen-35-39140-v11/default/config.json
chaiml-merged-qwen-35-39140-v11-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-merged-qwen-35-39140-v11/default/tokenizer.json
chaiml-merged-qwen-35-39140-v11-uploader: cp /dev/shm/model_output/model-00002-of-00002.safetensors s3://guanaco-vllm-models/chaiml-merged-qwen-35-39140-v11/default/model-00002-of-00002.safetensors
2026-03-27T20:45:20.604632+00:00 monitor updated for chaiml-merged-qwen-35-_39140_v11
chaiml-merged-qwen-35-39140-v11-uploader: cp /dev/shm/model_output/model-00001-of-00002.safetensors s3://guanaco-vllm-models/chaiml-merged-qwen-35-39140-v11/default/model-00001-of-00002.safetensors
Job chaiml-merged-qwen-35-39140-v11-uploader completed after 143.54s with status: succeeded
Stopping job with name chaiml-merged-qwen-35-39140-v11-uploader
Pipeline stage VLLMUploader completed in 144.11s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.36s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-merged-qwen-35-39140-v11
Waiting for inference service chaiml-merged-qwen-35-39140-v11 to be ready
2026-03-27T20:46:20.703445+00:00 monitor updated for chaiml-merged-qwen-35-_39140_v11
2026-03-27T20:47:20.800149+00:00 monitor updated for chaiml-merged-qwen-35-_39140_v11
2026-03-27T20:48:20.906505+00:00 monitor updated for chaiml-merged-qwen-35-_39140_v11
Inference service chaiml-merged-qwen-35-39140-v11 ready after 170.45552778244019s
Pipeline stage VLLMDeployer completed in 170.97s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
2026-03-27T20:49:21.013826+00:00 monitor updated for chaiml-merged-qwen-35-_39140_v11
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 18.51877474784851s
Received healthy response to inference request in 3.749572515487671s
Received healthy response to inference request in 16.553508758544922s
Received healthy response to inference request in 3.6923627853393555s
2026-03-27T20:50:21.115879+00:00 monitor updated for chaiml-merged-qwen-35-_39140_v11
Received healthy response to inference request in 16.537419319152832s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 4.014326572418213s
Received healthy response to inference request in 3.8672564029693604s
Received healthy response to inference request in 3.5460870265960693s
Received healthy response to inference request in 3.7526402473449707s
2026-03-27T20:51:21.218621+00:00 monitor updated for chaiml-merged-qwen-35-_39140_v11
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 4.071848392486572s
Received healthy response to inference request in 3.6256766319274902s
Received healthy response to inference request in 3.7811129093170166s
Received healthy response to inference request in 4.0780956745147705s
Received healthy response to inference request in 3.6997714042663574s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.8900885581970215s
2026-03-27T20:52:21.314875+00:00 monitor updated for chaiml-merged-qwen-35-_39140_v11
Received healthy response to inference request in 10.28645396232605s
Received healthy response to inference request in 3.7528235912323s
Received healthy response to inference request in 3.845370054244995s
Received healthy response to inference request in 3.902775526046753s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.kchai-google-us-east4.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 4.194890260696411s
Received healthy response to inference request in 3.7463018894195557s
Received healthy response to inference request in 3.731337070465088s
Received healthy response to inference request in 3.900181531906128s
30 requests
7 failed requests
5th percentile: 3.6556854009628297
10th percentile: 3.699030542373657
20th percentile: 3.748918390274048
30th percentile: 3.7726261138916017
40th percentile: 3.880955696105957
50th percentile: 3.958551049232483
60th percentile: 4.124813508987427
70th percentile: 16.54224615097046
80th percentile: 20.111246824264526
90th percentile: 20.116156148910523
95th percentile: 20.13130542039871
99th percentile: 20.139704761505126
mean time: 9.186380402247112
%s, retrying in %s seconds...
Received healthy response to inference request in 3.712005376815796s
2026-03-27T20:53:21.409477+00:00 monitor updated for chaiml-merged-qwen-35-_39140_v11
Received healthy response to inference request in 3.8540942668914795s
Received healthy response to inference request in 3.7484171390533447s
Received healthy response to inference request in 3.69704270362854s
Received healthy response to inference request in 10.19629192352295s
Received healthy response to inference request in 4.350048303604126s
Received healthy response to inference request in 3.393887758255005s
Received healthy response to inference request in 3.663313865661621s
Received healthy response to inference request in 3.7135255336761475s
Received healthy response to inference request in 3.4130783081054688s
Received healthy response to inference request in 3.8484692573547363s
Received healthy response to inference request in 3.467816114425659s
Received healthy response to inference request in 3.768045425415039s
Received healthy response to inference request in 3.7371275424957275s
Received healthy response to inference request in 3.475903272628784s
Received healthy response to inference request in 3.8706493377685547s
2026-03-27T20:54:21.511234+00:00 monitor updated for chaiml-merged-qwen-35-_39140_v11
Received healthy response to inference request in 3.8196053504943848s
Received healthy response to inference request in 3.667792320251465s
Received healthy response to inference request in 3.583244562149048s
Received healthy response to inference request in 3.5728836059570312s
Received healthy response to inference request in 3.880479574203491s
Received healthy response to inference request in 4.294769048690796s
Received healthy response to inference request in 3.9007644653320312s
Received healthy response to inference request in 3.5367379188537598s
Received healthy response to inference request in 3.868813991546631s
Received healthy response to inference request in 3.4494247436523438s
Received healthy response to inference request in 3.7355973720550537s
Received healthy response to inference request in 3.7257211208343506s
Received healthy response to inference request in 3.707503318786621s
Received healthy response to inference request in 3.8412857055664062s
30 requests
0 failed requests
5th percentile: 3.4294342041015624
10th percentile: 3.4659769773483275
20th percentile: 3.565654468536377
30th percentile: 3.666448783874512
40th percentile: 3.710204553604126
50th percentile: 3.730659246444702
60th percentile: 3.7562684535980226
70th percentile: 3.8434407711029053
80th percentile: 3.8691810607910155
90th percentile: 3.940164923667908
95th percentile: 4.325172638893127
99th percentile: 8.500881273746495
mean time: 3.949811307589213
Pipeline stage StressChecker completed in 398.71s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.68s
Shutdown handler de-registered
chaiml-merged-qwen-35-_39140_v11 status is now deployed due to DeploymentManager action
chaiml-merged-qwen-35-_39140_v11 status is now inactive due to auto deactivation removed underperforming models
chaiml-merged-qwen-35-_39140_v11 status is now torndown due to DeploymentManager action