Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-grpo-q235b-kimid-17084-v2-uploader
Waiting for job on chaiml-grpo-q235b-kimid-17084-v2-uploader to finish
chaiml-grpo-q235b-kimid-17084-v2-uploader: Checking if ChaiML/grpo-q235b-kimid-v5d-merged-chai-rm-low-kl-averaged-loras-W4A16 already exists in ChaiML
chaiml-grpo-q235b-kimid-17084-v2-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-grpo-q235b-kimid-17084-v2-uploader: Downloading snapshot of ChaiML/grpo-q235b-kimid-v5d-merged-chai-rm-low-kl-averaged-loras-W4A16...
chaiml-grpo-q235b-kimid-17084-v2-uploader: Downloaded in 50.746s
chaiml-grpo-q235b-kimid-17084-v2-uploader: Processed model ChaiML/grpo-q235b-kimid-v5d-merged-chai-rm-low-kl-averaged-loras in 51.480s
chaiml-grpo-q235b-kimid-17084-v2-uploader: creating bucket guanaco-vllm-models
chaiml-grpo-q235b-kimid-17084-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-17084-v2-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-grpo-q235b-kimid-17084-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-grpo-q235b-kimid-17084-v2-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-grpo-q235b-kimid-17084-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-17084-v2-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-grpo-q235b-kimid-17084-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-17084-v2-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-grpo-q235b-kimid-17084-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-17084-v2-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-grpo-q235b-kimid-17084-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-17084-v2-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-grpo-q235b-kimid-17084-v2-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-grpo-q235b-kimid-17084-v2-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-grpo-q235b-kimid-17084-v2-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-grpo-q235b-kimid-17084-v2-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-grpo-q235b-kimid-17084-v2-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-grpo-q235b-kimid-17084-v2-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/.gitattributes
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/generation_config.json
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/config.json
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/added_tokens.json
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/chat_template.jinja
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/quantization_config.json
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/special_tokens_map.json
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/tokenizer_config.json
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/vocab.json
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/merges.txt
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model.safetensors.index.json
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/tokenizer.json
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00020-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00009-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00012-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00022-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00015-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00023-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00024-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00011-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00014-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00021-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00007-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00026-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00010-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00004-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00005-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00016-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00006-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00003-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00025-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00019-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00018-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00002-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00017-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00001-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00013-of-00027.safetensors
chaiml-grpo-q235b-kimid-17084-v2-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-17084-v2/default/model-00008-of-00027.safetensors
Job chaiml-grpo-q235b-kimid-17084-v2-uploader completed after 147.06s with status: succeeded
Stopping job with name chaiml-grpo-q235b-kimid-17084-v2-uploader
Pipeline stage VLLMUploader completed in 147.73s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.19s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-grpo-q235b-kimid-17084-v2
Waiting for inference service chaiml-grpo-q235b-kimid-17084-v2 to be ready
Inference service chaiml-grpo-q235b-kimid-17084-v2 ready after 382.29934573173523s
Pipeline stage VLLMDeployer completed in 382.99s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2095773220062256s
Received healthy response to inference request in 2.0149736404418945s
Received healthy response to inference request in 2.023695468902588s
Received healthy response to inference request in 1.9598164558410645s
Received healthy response to inference request in 2.0644114017486572s
Received healthy response to inference request in 2.0983846187591553s
Received healthy response to inference request in 1.982222080230713s
Received healthy response to inference request in 2.003964900970459s
Received healthy response to inference request in 2.0024399757385254s
Received healthy response to inference request in 1.9839105606079102s
Received healthy response to inference request in 2.133685350418091s
Received healthy response to inference request in 2.0081119537353516s
Received healthy response to inference request in 1.9714295864105225s
Received healthy response to inference request in 2.0280864238739014s
Received healthy response to inference request in 1.8748042583465576s
Received healthy response to inference request in 2.270181894302368s
Received healthy response to inference request in 1.971876621246338s
Received healthy response to inference request in 2.074453592300415s
Received healthy response to inference request in 2.057046890258789s
Received healthy response to inference request in 2.0802412033081055s
Received healthy response to inference request in 2.0243067741394043s
Received healthy response to inference request in 1.9555139541625977s
Received healthy response to inference request in 2.208984851837158s
Received healthy response to inference request in 2.085401773452759s
Received healthy response to inference request in 2.0371766090393066s
Received healthy response to inference request in 2.001600742340088s
Received healthy response to inference request in 2.0065114498138428s
Received healthy response to inference request in 1.9354608058929443s
Received healthy response to inference request in 2.0069966316223145s
Received healthy response to inference request in 2.029250144958496s
30 requests
0 failed requests
5th percentile: 1.9444847226142883
10th percentile: 1.9593862056732179
20th percentile: 1.9801529884338378
30th percentile: 2.002188205718994
40th percentile: 2.006802558898926
50th percentile: 2.019334554672241
60th percentile: 2.0285519123077393
70th percentile: 2.0592562437057493
80th percentile: 2.081273317337036
90th percentile: 2.1412153005599976
95th percentile: 2.2093107104301453
99th percentile: 2.252606568336487
mean time: 2.036817264556885
Pipeline stage StressChecker completed in 64.42s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.70s
Shutdown handler de-registered
chaiml-grpo-q235b-kimid_17084_v2 status is now deployed due to DeploymentManager action
chaiml-grpo-q235b-kimid_17084_v2 status is now inactive due to auto deactivation removed underperforming models
chaiml-grpo-q235b-kimid_17084_v2 status is now torndown due to DeploymentManager action