Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-reward-dpo-1c36-72316-v2-uploader
Waiting for job on chaiml-reward-dpo-1c36-72316-v2-uploader to finish
chaiml-reward-dpo-1c36-72316-v2-uploader: Using quantization_mode: w4a16
chaiml-reward-dpo-1c36-72316-v2-uploader: Repo ChaiML/reward-dpo-1c36-chaiml-235b-sft-new-rm-_42002_v1-W4A16 already ends in W4A16. Skipping...
chaiml-reward-dpo-1c36-72316-v2-uploader: Checking if ChaiML/reward-dpo-1c36-chaiml-235b-sft-new-rm-_42002_v1-W4A16 already exists in ChaiML
chaiml-reward-dpo-1c36-72316-v2-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-reward-dpo-1c36-72316-v2-uploader: Downloading snapshot of ChaiML/reward-dpo-1c36-chaiml-235b-sft-new-rm-_42002_v1-W4A16...
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-reward-dpo-1c36-72316-v2-uploader: Downloaded in 67.305s
chaiml-reward-dpo-1c36-72316-v2-uploader: Processed model ChaiML/reward-dpo-1c36-chaiml-235b-sft-new-rm-_42002_v1-W4A16 in 67.853s
chaiml-reward-dpo-1c36-72316-v2-uploader: creating bucket guanaco-vllm-models
chaiml-reward-dpo-1c36-72316-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-reward-dpo-1c36-72316-v2-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-reward-dpo-1c36-72316-v2-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-reward-dpo-1c36-72316-v2-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-reward-dpo-1c36-72316-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-reward-dpo-1c36-72316-v2-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-reward-dpo-1c36-72316-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-reward-dpo-1c36-72316-v2-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-reward-dpo-1c36-72316-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-reward-dpo-1c36-72316-v2-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-reward-dpo-1c36-72316-v2-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-reward-dpo-1c36-72316-v2-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-reward-dpo-1c36-72316-v2-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-reward-dpo-1c36-72316-v2-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-reward-dpo-1c36-72316-v2-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-reward-dpo-1c36-72316-v2-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-reward-dpo-1c36-72316-v2-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-reward-dpo-1c36-72316-v2-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/added_tokens.json s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/added_tokens.json
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/config.json
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/generation_config.json
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/quantization_config.json s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/quantization_config.json
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/tokenizer_config.json
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/merges.txt s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/merges.txt
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/special_tokens_map.json
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/chat_template.jinja
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/.gitattributes
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/vocab.json s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/vocab.json
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model.safetensors.index.json
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/tokenizer.json
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00027-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00027-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00014-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00014-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00019-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00019-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00017-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00017-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00020-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00020-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00004-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00004-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00018-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00018-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00001-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00001-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00016-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00016-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00008-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00008-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00025-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00025-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00021-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00021-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00003-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00003-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00015-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00015-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00007-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00007-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00026-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00026-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00022-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00022-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00005-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00005-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00002-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00002-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00023-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00023-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00011-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00011-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00024-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00024-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00013-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00013-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00010-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00010-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00009-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00009-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00012-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00012-of-00027.safetensors
chaiml-reward-dpo-1c36-72316-v2-uploader: cp /dev/shm/model_output/model-00006-of-00027.safetensors s3://guanaco-vllm-models/chaiml-reward-dpo-1c36-72316-v2/default/model-00006-of-00027.safetensors
Job chaiml-reward-dpo-1c36-72316-v2-uploader completed after 257.12s with status: succeeded
Stopping job with name chaiml-reward-dpo-1c36-72316-v2-uploader
Pipeline stage VLLMUploader completed in 257.58s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-reward-dpo-1c36-72316-v2
Waiting for inference service chaiml-reward-dpo-1c36-72316-v2 to be ready
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service chaiml-reward-dpo-1c36-72316-v2 ready after 523.02401471138s
Pipeline stage VLLMDeployer completed in 523.48s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.057708501815796s
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Received healthy response to inference request in 2.0968785285949707s
Received healthy response to inference request in 2.066621780395508s
Received healthy response to inference request in 2.390683889389038s
Received healthy response to inference request in 1.984323501586914s
Received healthy response to inference request in 1.9591388702392578s
Received healthy response to inference request in 1.9197914600372314s
Received healthy response to inference request in 1.9638879299163818s
Received healthy response to inference request in 2.259381055831909s
Received healthy response to inference request in 1.9484813213348389s
Received healthy response to inference request in 1.9946482181549072s
Received healthy response to inference request in 1.9431190490722656s
Received healthy response to inference request in 1.9410734176635742s
Received healthy response to inference request in 1.9676637649536133s
Received healthy response to inference request in 2.016817808151245s
Received healthy response to inference request in 2.040818929672241s
Received healthy response to inference request in 2.0144572257995605s
Received healthy response to inference request in 1.9822180271148682s
Received healthy response to inference request in 2.0578653812408447s
Received healthy response to inference request in 2.1963648796081543s
Received healthy response to inference request in 2.0687456130981445s
Received healthy response to inference request in 2.0623528957366943s
Received healthy response to inference request in 2.1138529777526855s
Received healthy response to inference request in 2.085237979888916s
Failed to get response for submission chaiml-grpo-q235b-kimid_37540_v1: HTTPConnectionPool(host='chaiml-grpo-q235b-kimid-37540-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Received healthy response to inference request in 1.9581325054168701s
Received healthy response to inference request in 1.9847056865692139s
Received healthy response to inference request in 1.9340476989746094s
Received healthy response to inference request in 2.035355806350708s
Received healthy response to inference request in 2.154454231262207s
Received healthy response to inference request in 2.3315796852111816s
30 requests
0 failed requests
5th percentile: 1.9372092723846435
10th percentile: 1.9429144859313965
20th percentile: 1.9589375972747802
30th percentile: 1.9778517484664917
40th percentile: 1.99067120552063
50th percentile: 2.0260868072509766
60th percentile: 2.0577712535858153
70th percentile: 2.067258930206299
80th percentile: 2.100273418426514
90th percentile: 2.20266649723053
95th percentile: 2.2990903019905087
99th percentile: 2.37354367017746
mean time: 2.0510136206944782
Pipeline stage StressChecker completed in 67.25s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.73s
Shutdown handler de-registered
chaiml-reward-dpo-1c36-_72316_v2 status is now deployed due to DeploymentManager action
chaiml-reward-dpo-1c36-_72316_v2 status is now inactive due to auto deactivation removed underperforming models
chaiml-reward-dpo-1c36-_72316_v2 status is now torndown due to DeploymentManager action