Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-csfs-v3-3-dpo-l-86358-v24-uploader
Waiting for job on chaiml-csfs-v3-3-dpo-l-86358-v24-uploader to finish
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: Using quantization_mode: fp8
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: Checking if ChaiML/csfs-v3-3-dpo-lr5e6b01-lora-FP8 already exists in ChaiML
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: Downloading snapshot of ChaiML/csfs-v3-3-dpo-lr5e6b01-lora-FP8...
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: Downloaded in 12.421s
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: Processed model ChaiML/csfs-v3-3-dpo-lr5e6b01-lora in 15.989s
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: creating bucket guanaco-vllm-models
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/.gitattributes
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/config.json
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/model.safetensors.index.json
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/generation_config.json
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/recipe.yaml
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/special_tokens_map.json
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/tokenizer_config.json
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/tokenizer.json
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/model-00006-of-00006.safetensors s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/model-00006-of-00006.safetensors
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/model-00005-of-00006.safetensors s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/model-00005-of-00006.safetensors
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/model-00002-of-00006.safetensors s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/model-00002-of-00006.safetensors
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/model-00003-of-00006.safetensors s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/model-00003-of-00006.safetensors
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/model-00001-of-00006.safetensors s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/model-00001-of-00006.safetensors
chaiml-csfs-v3-3-dpo-l-86358-v24-uploader: cp /dev/shm/model_output/model-00004-of-00006.safetensors s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-l-86358-v24/default/model-00004-of-00006.safetensors
Job chaiml-csfs-v3-3-dpo-l-86358-v24-uploader completed after 115.11s with status: succeeded
Stopping job with name chaiml-csfs-v3-3-dpo-l-86358-v24-uploader
Pipeline stage VLLMUploader completed in 115.81s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-csfs-v3-3-dpo-l-86358-v24
Waiting for inference service chaiml-csfs-v3-3-dpo-l-86358-v24 to be ready
Failed to get response for submission chaiml-mistral-24b-2048_54327_v6: ('http://chaiml-mistral-24b-2048-54327-v6-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048-_2678_v3: ('http://chaiml-mistral-24b-2048-2678-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-mistral-24b-2048_15988_v1: ('http://chaiml-mistral-24b-2048-15988-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Inference service chaiml-csfs-v3-3-dpo-l-86358-v24 ready after 191.87887907028198s
Pipeline stage VLLMDeployer completed in 193.25s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8880529403686523s
Received healthy response to inference request in 1.4656856060028076s
Received healthy response to inference request in 2.2079861164093018s
Received healthy response to inference request in 1.662477731704712s
Received healthy response to inference request in 1.5891883373260498s
Received healthy response to inference request in 1.5282773971557617s
Received healthy response to inference request in 1.4179067611694336s
Received healthy response to inference request in 1.4105422496795654s
Received healthy response to inference request in 1.4129083156585693s
Received healthy response to inference request in 1.3435604572296143s
Received healthy response to inference request in 2.1713345050811768s
Received healthy response to inference request in 1.6501429080963135s
Received healthy response to inference request in 1.4412896633148193s
Received healthy response to inference request in 1.4035766124725342s
Received healthy response to inference request in 1.6062877178192139s
Received healthy response to inference request in 1.3629388809204102s
Received healthy response to inference request in 1.3878185749053955s
Received healthy response to inference request in 1.415313959121704s
Received healthy response to inference request in 1.6976399421691895s
Received healthy response to inference request in 1.6694047451019287s
Received healthy response to inference request in 1.4838061332702637s
Received healthy response to inference request in 1.4408915042877197s
Received healthy response to inference request in 1.4520885944366455s
Received healthy response to inference request in 1.4637861251831055s
Received healthy response to inference request in 1.6398463249206543s
Received healthy response to inference request in 1.4356393814086914s
Received healthy response to inference request in 1.5613505840301514s
Received healthy response to inference request in 1.4850893020629883s
Received healthy response to inference request in 1.328766107559204s
Received healthy response to inference request in 2.1018178462982178s
30 requests
0 failed requests
5th percentile: 1.3522807478904724
10th percentile: 1.385330605506897
20th percentile: 1.4124351024627686
30th percentile: 1.430319595336914
40th percentile: 1.4477690219879151
50th percentile: 1.4747458696365356
60th percentile: 1.5415066719055175
70th percentile: 1.616355299949646
80th percentile: 1.6638631343841552
90th percentile: 1.909429430961609
95th percentile: 2.140052008628845
99th percentile: 2.1973571491241457
mean time: 1.570847177505493
Pipeline stage StressChecker completed in 50.61s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.66s
Shutdown handler de-registered
chaiml-csfs-v3-3-dpo-l_86358_v24 status is now deployed due to DeploymentManager action
chaiml-csfs-v3-3-dpo-l_86358_v24 status is now inactive due to auto deactivation removed underperforming models