Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-grpo-q235b-opusd-11130-v4-uploader
Waiting for job on chaiml-grpo-q235b-opusd-11130-v4-uploader to finish
chaiml-grpo-q235b-opusd-11130-v4-uploader: Using quantization_mode: w4a16
chaiml-grpo-q235b-opusd-11130-v4-uploader: Checking if ChaiML/grpo-q235b-opusd-v1-merged-chai-rm-step-700-W4A16 already exists in ChaiML
chaiml-grpo-q235b-opusd-11130-v4-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-grpo-q235b-opusd-11130-v4-uploader: Downloading snapshot of ChaiML/grpo-q235b-opusd-v1-merged-chai-rm-step-700-W4A16...
chaiml-grpo-q235b-opusd-11130-v4-uploader: Downloaded in 57.172s
chaiml-grpo-q235b-opusd-11130-v4-uploader: Processed model ChaiML/grpo-q235b-opusd-v1-merged-chai-rm-step-700 in 57.838s
chaiml-grpo-q235b-opusd-11130-v4-uploader: creating bucket guanaco-vllm-models
chaiml-grpo-q235b-opusd-11130-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-11130-v4-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-grpo-q235b-opusd-11130-v4-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-grpo-q235b-opusd-11130-v4-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-grpo-q235b-opusd-11130-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-11130-v4-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-grpo-q235b-opusd-11130-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-11130-v4-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-grpo-q235b-opusd-11130-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-11130-v4-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-grpo-q235b-opusd-11130-v4-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-opusd-11130-v4-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-grpo-q235b-opusd-11130-v4-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-grpo-q235b-opusd-11130-v4-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-grpo-q235b-opusd-11130-v4-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-grpo-q235b-opusd-11130-v4-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-grpo-q235b-opusd-11130-v4-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-grpo-q235b-opusd-11130-v4-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/added_tokens.json
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/.gitattributes
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/quantization_config.json
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/config.json
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/chat_template.jinja
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/generation_config.json
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/tokenizer_config.json
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/merges.txt
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/special_tokens_map.json
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/vocab.json
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model.safetensors.index.json
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/tokenizer.json
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00015-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00017-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00007-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00020-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00014-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00001-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00003-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00025-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00021-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00005-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00023-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00006-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00016-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00011-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00004-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00022-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00002-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00024-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00009-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00010-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00019-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00012-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00013-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00026-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00008-of-00027.safetensors
chaiml-grpo-q235b-opusd-11130-v4-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-opusd-11130-v4/default/model-00018-of-00027.safetensors
Job chaiml-grpo-q235b-opusd-11130-v4-uploader completed after 137.83s with status: succeeded
Stopping job with name chaiml-grpo-q235b-opusd-11130-v4-uploader
Pipeline stage VLLMUploader completed in 138.32s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.30s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-grpo-q235b-opusd-11130-v4
Waiting for inference service chaiml-grpo-q235b-opusd-11130-v4 to be ready
Inference service chaiml-grpo-q235b-opusd-11130-v4 ready after 383.37205839157104s
Pipeline stage VLLMDeployer completed in 383.87s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.3473293781280518s
Received healthy response to inference request in 1.6976571083068848s
Received healthy response to inference request in 1.8987700939178467s
Received healthy response to inference request in 2.144519567489624s
Received healthy response to inference request in 1.9392459392547607s
Received healthy response to inference request in 2.0872673988342285s
Received healthy response to inference request in 1.7947287559509277s
Received healthy response to inference request in 1.8147974014282227s
Received healthy response to inference request in 1.7859926223754883s
Received healthy response to inference request in 2.0846259593963623s
Received healthy response to inference request in 2.0811007022857666s
Received healthy response to inference request in 1.9632012844085693s
Received healthy response to inference request in 2.028029441833496s
Received healthy response to inference request in 1.9461135864257812s
Received healthy response to inference request in 1.771536111831665s
Received healthy response to inference request in 2.255053997039795s
Received healthy response to inference request in 1.7091057300567627s
Received healthy response to inference request in 1.7729671001434326s
Received healthy response to inference request in 2.011808156967163s
Received healthy response to inference request in 2.011730194091797s
Received healthy response to inference request in 2.3034043312072754s
Received healthy response to inference request in 1.8517849445343018s
Received healthy response to inference request in 1.9121747016906738s
Received healthy response to inference request in 2.065009355545044s
Received healthy response to inference request in 1.960684061050415s
Received healthy response to inference request in 2.078267812728882s
Received healthy response to inference request in 1.9111733436584473s
Received healthy response to inference request in 2.2356626987457275s
Received healthy response to inference request in 1.967851161956787s
Received healthy response to inference request in 2.121666431427002s
30 requests
0 failed requests
5th percentile: 1.7371994018554688
10th percentile: 1.7728240013122558
20th percentile: 1.8107836723327637
30th percentile: 1.907452368736267
40th percentile: 1.943366527557373
50th percentile: 1.9655262231826782
60th percentile: 2.0182966709136965
70th percentile: 2.079117679595947
80th percentile: 2.094147205352783
90th percentile: 2.2376018285751345
95th percentile: 2.281646680831909
99th percentile: 2.3345911145210265
mean time: 1.9851086457570395
Pipeline stage StressChecker completed in 63.25s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.74s
Shutdown handler de-registered
chaiml-grpo-q235b-opusd_11130_v4 status is now deployed due to DeploymentManager action
chaiml-grpo-q235b-opusd_11130_v4 status is now inactive due to system request