Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-mega-v2-top2-g47-13273-v4-uploader
Waiting for job on chaiml-mega-v2-top2-g47-13273-v4-uploader to finish
chaiml-mega-v2-top2-g47-13273-v4-uploader: Using quantization_mode: w4a16
chaiml-mega-v2-top2-g47-13273-v4-uploader: Checking if ChaiML/mega-v2-top2-g47-lr1e5ep2r64b32-W4A16 already exists in ChaiML
chaiml-mega-v2-top2-g47-13273-v4-uploader: Model already exists. Downloading to /tmp/model_output...
chaiml-mega-v2-top2-g47-13273-v4-uploader: Downloading snapshot of ChaiML/mega-v2-top2-g47-lr1e5ep2r64b32-W4A16...
2026-04-06T15:09:10.316121+00:00 monitor updated for chaiml-mega-v2-top2-g47_13273_v4
chaiml-mega-v2-top2-g47-13273-v4-uploader: Downloaded in 62.079s
chaiml-mega-v2-top2-g47-13273-v4-uploader: Processed model ChaiML/mega-v2-top2-g47-lr1e5ep2r64b32 in 64.693s
chaiml-mega-v2-top2-g47-13273-v4-uploader: creating bucket guanaco-vllm-models
chaiml-mega-v2-top2-g47-13273-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-mega-v2-top2-g47-13273-v4-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-mega-v2-top2-g47-13273-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-mega-v2-top2-g47-13273-v4-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-mega-v2-top2-g47-13273-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-mega-v2-top2-g47-13273-v4-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-mega-v2-top2-g47-13273-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-mega-v2-top2-g47-13273-v4-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-mega-v2-top2-g47-13273-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-mega-v2-top2-g47-13273-v4-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-mega-v2-top2-g47-13273-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-mega-v2-top2-g47-13273-v4-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-mega-v2-top2-g47-13273-v4-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-mega-v2-top2-g47-13273-v4-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-mega-v2-top2-g47-13273-v4-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-mega-v2-top2-g47-13273-v4-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-mega-v2-top2-g47-13273-v4-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-mega-v2-top2-g47-13273-v4-uploader: uploading /tmp/model_output to s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default
2026-04-06T15:10:10.406751+00:00 monitor updated for chaiml-mega-v2-top2-g47_13273_v4
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/generation_config.json
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/.gitattributes
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/config.json s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/config.json
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/chat_template.jinja
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/tokenizer.json
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/tokenizer_config.json
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/quantization_config.json
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model.safetensors.index.json
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00038-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00038-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00036-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00036-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00037-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00037-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00029-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00029-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00022-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00022-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00013-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00013-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00005-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00005-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00032-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00032-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00017-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00017-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00031-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00031-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00033-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00033-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00026-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00026-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00016-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00016-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00006-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00006-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00012-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00012-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00025-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00025-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00003-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00003-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00023-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00023-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00021-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00021-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00014-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00014-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00015-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00015-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00030-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00030-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00028-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00028-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00010-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00010-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00009-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00009-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00019-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00019-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00007-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00007-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00001-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00001-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00002-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00002-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00035-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00035-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00027-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00027-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00020-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00020-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00008-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00008-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00034-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00034-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00018-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00018-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00011-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00011-of-00038.safetensors
chaiml-mega-v2-top2-g47-13273-v4-uploader: cp /tmp/model_output/model-00024-of-00038.safetensors s3://guanaco-vllm-models/chaiml-mega-v2-top2-g47-13273-v4/default/model-00024-of-00038.safetensors
2026-04-06T15:11:10.504195+00:00 monitor updated for chaiml-mega-v2-top2-g47_13273_v4
Job chaiml-mega-v2-top2-g47-13273-v4-uploader completed after 186.48s with status: succeeded
Stopping job with name chaiml-mega-v2-top2-g47-13273-v4-uploader
Pipeline stage VLLMUploader completed in 187.15s
run pipeline stage %s
Running pipeline stage VLLMUploaderAMD
Pipeline stage vllm_upload_amd skipped, reason=not amd cluster
Pipeline stage VLLMUploaderAMD completed in 0.17s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 4.89s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-mega-v2-top2-g47-13273-v4
Waiting for inference service chaiml-mega-v2-top2-g47-13273-v4 to be ready
2026-04-06T15:12:10.602272+00:00 monitor updated for chaiml-mega-v2-top2-g47_13273_v4
2026-04-06T15:13:10.706030+00:00 monitor updated for chaiml-mega-v2-top2-g47_13273_v4
2026-04-06T15:14:10.798353+00:00 monitor updated for chaiml-mega-v2-top2-g47_13273_v4
Failed to get request counts for guanaco-submitter. Falling back to default
2026-04-06T15:15:10.945206+00:00 monitor updated for chaiml-mega-v2-top2-g47_13273_v4
Inference service chaiml-mega-v2-top2-g47-13273-v4 ready after 252.7262635231018s
Pipeline stage VLLMDeployer completed in 253.30s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 9.072526454925537s
Received healthy response to inference request in 2.375797748565674s
Received healthy response to inference request in 9.34496784210205s
Received healthy response to inference request in 2.0793285369873047s
Received healthy response to inference request in 2.5953803062438965s
Received healthy response to inference request in 2.0726234912872314s
Received healthy response to inference request in 2.191389799118042s
2026-04-06T15:16:11.038177+00:00 monitor updated for chaiml-mega-v2-top2-g47_13273_v4
Received healthy response to inference request in 8.69728398323059s
Received healthy response to inference request in 2.2252087593078613s
Received healthy response to inference request in 2.6504459381103516s
Received healthy response to inference request in 3.9141879081726074s
Received healthy response to inference request in 2.1399717330932617s
Received healthy response to inference request in 2.2426528930664062s
Received healthy response to inference request in 8.779966831207275s
Received healthy response to inference request in 2.3141391277313232s
Received healthy response to inference request in 9.27243185043335s
Received healthy response to inference request in 2.2156894207000732s
Received healthy response to inference request in 1.965911865234375s
Received healthy response to inference request in 2.0851826667785645s
Received healthy response to inference request in 2.2977213859558105s
Received healthy response to inference request in 2.0970075130462646s
Received healthy response to inference request in 2.0897133350372314s
Received healthy response to inference request in 2.2679238319396973s
Received healthy response to inference request in 2.3140909671783447s
Received healthy response to inference request in 2.0321645736694336s
2026-04-06T15:17:11.132575+00:00 monitor updated for chaiml-mega-v2-top2-g47_13273_v4
Received healthy response to inference request in 2.3019652366638184s
Received healthy response to inference request in 2.2013871669769287s
Received healthy response to inference request in 2.293001651763916s
Received healthy response to inference request in 2.1242847442626953s
Received healthy response to inference request in 2.152660369873047s
30 requests
0 failed requests
5th percentile: 2.0503710865974427
10th percentile: 2.078658032417297
20th percentile: 2.095548677444458
30th percentile: 2.1488537788391113
40th percentile: 2.2099685192108156
50th percentile: 2.2552883625030518
60th percentile: 2.2994189262390137
70th percentile: 2.3326367139816284
80th percentile: 2.903194332122806
90th percentile: 8.809222793579101
95th percentile: 9.182474422454833
99th percentile: 9.323932404518127
mean time: 3.4135669310887655
Pipeline stage StressChecker completed in 105.72s
Shutdown handler de-registered
chaiml-mega-v2-top2-g47_13273_v4 status is now deployed due to DeploymentManager action
chaiml-mega-v2-top2-g47_13273_v4 status is now inactive due to auto deactivation removed underperforming models
chaiml-mega-v2-top2-g47_13273_v4 status is now torndown due to DeploymentManager action