Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-q235b-opus-v3-ju-23012-v2-uploader
Waiting for job on chaiml-q235b-opus-v3-ju-23012-v2-uploader to finish
chaiml-q235b-opus-v3-ju-23012-v2-uploader: Using quantization_mode: w4a16
chaiml-q235b-opus-v3-ju-23012-v2-uploader: Checking if ChaiML/q235b_opus_v3_judging-step396-merged-W4A16 already exists in ChaiML
chaiml-q235b-opus-v3-ju-23012-v2-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-q235b-opus-v3-ju-23012-v2-uploader: Downloading snapshot of ChaiML/q235b_opus_v3_judging-step396-merged-W4A16...
2026-04-03T01:43:23.110220+00:00 monitor updated for chaiml-q235b-opus-v3-ju_23012_v2
chaiml-q235b-opus-v3-ju-23012-v2-uploader: Downloaded in 50.312s
chaiml-q235b-opus-v3-ju-23012-v2-uploader: Processed model ChaiML/q235b_opus_v3_judging-step396-merged in 53.096s
chaiml-q235b-opus-v3-ju-23012-v2-uploader: creating bucket guanaco-vllm-models
chaiml-q235b-opus-v3-ju-23012-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-q235b-opus-v3-ju-23012-v2-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-q235b-opus-v3-ju-23012-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-q235b-opus-v3-ju-23012-v2-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-q235b-opus-v3-ju-23012-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-q235b-opus-v3-ju-23012-v2-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-q235b-opus-v3-ju-23012-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-q235b-opus-v3-ju-23012-v2-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-q235b-opus-v3-ju-23012-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-q235b-opus-v3-ju-23012-v2-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-q235b-opus-v3-ju-23012-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-q235b-opus-v3-ju-23012-v2-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-q235b-opus-v3-ju-23012-v2-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-q235b-opus-v3-ju-23012-v2-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-q235b-opus-v3-ju-23012-v2-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-q235b-opus-v3-ju-23012-v2-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-q235b-opus-v3-ju-23012-v2-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-q235b-opus-v3-ju-23012-v2-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/quantization_config.json
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/tokenizer_config.json
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/config.json
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/chat_template.jinja
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/generation_config.json
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/.gitattributes
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/tokenizer.json
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model.safetensors.index.json
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00025-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00025-of-00025.safetensors
2026-04-03T01:44:23.191024+00:00 monitor updated for chaiml-q235b-opus-v3-ju_23012_v2
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00009-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00009-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00018-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00018-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00001-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00001-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00004-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00004-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00005-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00005-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00015-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00015-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00010-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00010-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00013-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00013-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00006-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00006-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00016-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00016-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00021-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00021-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00023-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00023-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00024-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00024-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00012-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00012-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00014-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00014-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00019-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00019-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00020-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00020-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00017-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00017-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00002-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00002-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00011-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00011-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00022-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00022-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00007-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00007-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00008-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00008-of-00025.safetensors
chaiml-q235b-opus-v3-ju-23012-v2-uploader: cp /dev/shm/model_output/model-00003-of-00025.safetensors s3://guanaco-vllm-models/chaiml-q235b-opus-v3-ju-23012-v2/default/model-00003-of-00025.safetensors
Job chaiml-q235b-opus-v3-ju-23012-v2-uploader completed after 134.2s with status: succeeded
Stopping job with name chaiml-q235b-opus-v3-ju-23012-v2-uploader
Pipeline stage VLLMUploader completed in 134.65s
run pipeline stage %s
Running pipeline stage VLLMUploaderAMD
Pipeline stage vllm_upload_amd skipped, reason=not amd cluster
Pipeline stage VLLMUploaderAMD completed in 0.10s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 2.06s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-q235b-opus-v3-ju-23012-v2
Waiting for inference service chaiml-q235b-opus-v3-ju-23012-v2 to be ready
2026-04-03T01:45:23.272744+00:00 monitor updated for chaiml-q235b-opus-v3-ju_23012_v2
2026-04-03T01:46:23.362393+00:00 monitor updated for chaiml-q235b-opus-v3-ju_23012_v2
2026-04-03T01:47:23.481399+00:00 monitor updated for chaiml-q235b-opus-v3-ju_23012_v2
Inference service chaiml-q235b-opus-v3-ju-23012-v2 ready after 211.27684473991394s
Pipeline stage VLLMDeployer completed in 211.84s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.629730463027954s
Received healthy response to inference request in 1.9760076999664307s
Received healthy response to inference request in 1.5214519500732422s
Received healthy response to inference request in 1.598024606704712s
2026-04-03T01:48:23.584614+00:00 monitor updated for chaiml-q235b-opus-v3-ju_23012_v2
Received healthy response to inference request in 3.4468793869018555s
Received healthy response to inference request in 3.5666310787200928s
Received healthy response to inference request in 3.5625197887420654s
Received healthy response to inference request in 1.5806784629821777s
Received healthy response to inference request in 1.4538507461547852s
Received healthy response to inference request in 1.5309593677520752s
Received healthy response to inference request in 1.9745564460754395s
Received healthy response to inference request in 1.464707612991333s
Received healthy response to inference request in 3.9729554653167725s
Received healthy response to inference request in 1.696159839630127s
Received healthy response to inference request in 1.752096176147461s
Received healthy response to inference request in 1.7854564189910889s
Received healthy response to inference request in 1.5718047618865967s
Received healthy response to inference request in 1.4494209289550781s
Received healthy response to inference request in 1.6919264793395996s
Received healthy response to inference request in 1.4625780582427979s
Received healthy response to inference request in 1.5914170742034912s
Received healthy response to inference request in 1.8875102996826172s
Received healthy response to inference request in 1.470552921295166s
Received healthy response to inference request in 1.6200392246246338s
Received healthy response to inference request in 1.6600921154022217s
Received healthy response to inference request in 1.467775583267212s
Received healthy response to inference request in 1.5566413402557373s
Received healthy response to inference request in 1.5089011192321777s
Received healthy response to inference request in 2.0597195625305176s
Received healthy response to inference request in 1.619121789932251s
30 requests
0 failed requests
5th percentile: 1.4577780365943909
10th percentile: 1.4644946575164794
20th percentile: 1.5012314796447754
30th percentile: 1.5489367485046386
40th percentile: 1.5871216297149657
50th percentile: 1.6195805072784424
60th percentile: 1.6936198234558106
70th percentile: 1.8160725831985471
80th percentile: 1.9927500724792482
90th percentile: 3.5629309177398683
95th percentile: 3.601335740089416
99th percentile: 3.8734202146530152
mean time: 1.9710055589675903
Pipeline stage StressChecker completed in 62.24s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.55s
Shutdown handler de-registered
chaiml-q235b-opus-v3-ju_23012_v2 status is now deployed due to DeploymentManager action
chaiml-q235b-opus-v3-ju_23012_v2 status is now inactive due to auto deactivation removed underperforming models