Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-grpo-q235b-kimid-76483-v2-uploader
Waiting for job on chaiml-grpo-q235b-kimid-76483-v2-uploader to finish
HTTP Request: %s %s "%s %d %s"
chaiml-grpo-q235b-kimid-76483-v2-uploader: Using quantization_mode: w4a16
chaiml-grpo-q235b-kimid-76483-v2-uploader: Checking if ChaiML/grpo-q235b-kimid-v5d-merged-chai-rm-high-kl-averaged-loras-W4A16 already exists in ChaiML
chaiml-grpo-q235b-kimid-76483-v2-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-grpo-q235b-kimid-76483-v2-uploader: Downloading snapshot of ChaiML/grpo-q235b-kimid-v5d-merged-chai-rm-high-kl-averaged-loras-W4A16...
Retrying (%r) after connection broken by '%r': %s
chaiml-grpo-q235b-kimid-76483-v2-uploader: Downloaded in 56.360s
chaiml-grpo-q235b-kimid-76483-v2-uploader: Processed model ChaiML/grpo-q235b-kimid-v5d-merged-chai-rm-high-kl-averaged-loras in 56.924s
chaiml-grpo-q235b-kimid-76483-v2-uploader: creating bucket guanaco-vllm-models
chaiml-grpo-q235b-kimid-76483-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-76483-v2-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-grpo-q235b-kimid-76483-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-grpo-q235b-kimid-76483-v2-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-grpo-q235b-kimid-76483-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-76483-v2-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-grpo-q235b-kimid-76483-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-76483-v2-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-grpo-q235b-kimid-76483-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-76483-v2-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-grpo-q235b-kimid-76483-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-grpo-q235b-kimid-76483-v2-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-grpo-q235b-kimid-76483-v2-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-grpo-q235b-kimid-76483-v2-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-grpo-q235b-kimid-76483-v2-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-grpo-q235b-kimid-76483-v2-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-grpo-q235b-kimid-76483-v2-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-grpo-q235b-kimid-76483-v2-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/added_tokens.json
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/chat_template.jinja
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/merges.txt
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/generation_config.json
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/tokenizer_config.json
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/quantization_config.json
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/.gitattributes
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/config.json
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/special_tokens_map.json
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/vocab.json
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model.safetensors.index.json
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/tokenizer.json
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00027-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00015-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00019-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00021-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00010-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00024-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00026-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00008-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00012-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00005-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00022-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00016-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00020-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00025-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00001-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00003-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00017-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00018-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00007-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00004-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00013-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00009-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00011-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00002-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00014-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00006-of-00027.safetensors
chaiml-grpo-q235b-kimid-76483-v2-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-grpo-q235b-kimid-76483-v2/default/model-00023-of-00027.safetensors
Job chaiml-grpo-q235b-kimid-76483-v2-uploader completed after 166.75s with status: succeeded
Stopping job with name chaiml-grpo-q235b-kimid-76483-v2-uploader
Pipeline stage VLLMUploader completed in 167.31s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-grpo-q235b-kimid-76483-v2
Waiting for inference service chaiml-grpo-q235b-kimid-76483-v2 to be ready
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Inference service chaiml-grpo-q235b-kimid-76483-v2 ready after 412.63961601257324s
Pipeline stage VLLMDeployer completed in 413.17s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.1413774490356445s
Received healthy response to inference request in 2.074014902114868s
Received healthy response to inference request in 2.1654298305511475s
Received healthy response to inference request in 1.936931848526001s
Received healthy response to inference request in 1.870394229888916s
Received healthy response to inference request in 2.155477523803711s
Received healthy response to inference request in 1.9841134548187256s
Received healthy response to inference request in 2.0362038612365723s
Received healthy response to inference request in 1.9224917888641357s
Received healthy response to inference request in 1.8436651229858398s
Received healthy response to inference request in 2.0010576248168945s
Received healthy response to inference request in 1.8730969429016113s
Received healthy response to inference request in 1.8332891464233398s
Received healthy response to inference request in 1.7851848602294922s
Received healthy response to inference request in 1.8811674118041992s
Received healthy response to inference request in 1.934821367263794s
Received healthy response to inference request in 1.9045000076293945s
Received healthy response to inference request in 2.0559818744659424s
Received healthy response to inference request in 1.7932264804840088s
Received healthy response to inference request in 1.9024872779846191s
Received healthy response to inference request in 1.923879861831665s
Received healthy response to inference request in 1.8851985931396484s
Received healthy response to inference request in 2.157442569732666s
Received healthy response to inference request in 1.9488680362701416s
Received healthy response to inference request in 2.0186452865600586s
Received healthy response to inference request in 2.074575662612915s
Received healthy response to inference request in 5.117866277694702s
Received healthy response to inference request in 1.9160783290863037s
Received healthy response to inference request in 1.9070982933044434s
Received healthy response to inference request in 1.978783369064331s
30 requests
0 failed requests
5th percentile: 1.8112546801567078
10th percentile: 1.8426275253295898
20th percentile: 1.8795533180236816
30th percentile: 1.903896188735962
40th percentile: 1.919926404953003
50th percentile: 1.9358766078948975
60th percentile: 1.980915403366089
70th percentile: 2.0239128589630124
80th percentile: 2.0741270542144776
90th percentile: 2.1556740283966063
95th percentile: 2.161835563182831
99th percentile: 4.261659708023074
mean time: 2.0674449761708575
Pipeline stage StressChecker completed in 81.57s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.68s
Shutdown handler de-registered
chaiml-grpo-q235b-kimid_76483_v2 status is now deployed due to DeploymentManager action
chaiml-grpo-q235b-kimid_76483_v2 status is now inactive due to auto deactivation removed underperforming models
chaiml-grpo-q235b-kimid_76483_v2 status is now torndown due to DeploymentManager action