Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-star-trek-open-r-10141-v3-uploader
Waiting for job on chaiml-star-trek-open-r-10141-v3-uploader to finish
chaiml-star-trek-open-r-10141-v3-uploader: Using quantization_mode: fp8
chaiml-star-trek-open-r-10141-v3-uploader: Checking if ChaiML/Star-Trek-Open-RP-After-the-Burn260303204351_sft-FP8 already exists in ChaiML
chaiml-star-trek-open-r-10141-v3-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-star-trek-open-r-10141-v3-uploader: Downloading snapshot of ChaiML/Star-Trek-Open-RP-After-the-Burn260303204351_sft-FP8...
chaiml-star-trek-open-r-10141-v3-uploader: Downloaded in 14.651s
chaiml-star-trek-open-r-10141-v3-uploader: Processed model ChaiML/Star-Trek-Open-RP-After-the-Burn260303204351_sft in 18.128s
chaiml-star-trek-open-r-10141-v3-uploader: creating bucket guanaco-vllm-models
chaiml-star-trek-open-r-10141-v3-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-star-trek-open-r-10141-v3-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-star-trek-open-r-10141-v3-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-star-trek-open-r-10141-v3-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-star-trek-open-r-10141-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-star-trek-open-r-10141-v3-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-star-trek-open-r-10141-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-star-trek-open-r-10141-v3-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-star-trek-open-r-10141-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-star-trek-open-r-10141-v3-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-star-trek-open-r-10141-v3-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-star-trek-open-r-10141-v3-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-star-trek-open-r-10141-v3-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-star-trek-open-r-10141-v3-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-star-trek-open-r-10141-v3-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-star-trek-open-r-10141-v3-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-star-trek-open-r-10141-v3-uploader: cp /dev/shm/model_output/model-00006-of-00006.safetensors s3://guanaco-vllm-models/chaiml-star-trek-open-r-10141-v3/default/model-00006-of-00006.safetensors
chaiml-star-trek-open-r-10141-v3-uploader: cp /dev/shm/model_output/model-00005-of-00006.safetensors s3://guanaco-vllm-models/chaiml-star-trek-open-r-10141-v3/default/model-00005-of-00006.safetensors
chaiml-star-trek-open-r-10141-v3-uploader: cp /dev/shm/model_output/model-00002-of-00006.safetensors s3://guanaco-vllm-models/chaiml-star-trek-open-r-10141-v3/default/model-00002-of-00006.safetensors
chaiml-star-trek-open-r-10141-v3-uploader: cp /dev/shm/model_output/model-00001-of-00006.safetensors s3://guanaco-vllm-models/chaiml-star-trek-open-r-10141-v3/default/model-00001-of-00006.safetensors
chaiml-star-trek-open-r-10141-v3-uploader: cp /dev/shm/model_output/model-00004-of-00006.safetensors s3://guanaco-vllm-models/chaiml-star-trek-open-r-10141-v3/default/model-00004-of-00006.safetensors
chaiml-star-trek-open-r-10141-v3-uploader: cp /dev/shm/model_output/model-00003-of-00006.safetensors s3://guanaco-vllm-models/chaiml-star-trek-open-r-10141-v3/default/model-00003-of-00006.safetensors
Job chaiml-star-trek-open-r-10141-v3-uploader completed after 102.47s with status: succeeded
Stopping job with name chaiml-star-trek-open-r-10141-v3-uploader
Pipeline stage VLLMUploader completed in 102.91s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.29s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-star-trek-open-r-10141-v3
Waiting for inference service chaiml-star-trek-open-r-10141-v3 to be ready
Inference service chaiml-star-trek-open-r-10141-v3 ready after 160.7144079208374s
Pipeline stage VLLMDeployer completed in 161.24s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.3746747970581055s
Received healthy response to inference request in 3.287095546722412s
Received healthy response to inference request in 2.391479015350342s
Received healthy response to inference request in 3.2684035301208496s
Received healthy response to inference request in 2.8273866176605225s
Received healthy response to inference request in 2.7483649253845215s
Received healthy response to inference request in 2.8803319931030273s
Received healthy response to inference request in 3.040081024169922s
Received healthy response to inference request in 2.973235845565796s
Received healthy response to inference request in 4.917659759521484s
Received healthy response to inference request in 2.7131597995758057s
Received healthy response to inference request in 2.87929105758667s
Received healthy response to inference request in 2.831308603286743s
Received healthy response to inference request in 2.9610891342163086s
Received healthy response to inference request in 3.0288498401641846s
Received healthy response to inference request in 3.294755697250366s
Received healthy response to inference request in 3.1100594997406006s
Received healthy response to inference request in 2.0433011054992676s
Received healthy response to inference request in 1.486976146697998s
Received healthy response to inference request in 1.4234018325805664s
Received healthy response to inference request in 2.8582324981689453s
Received healthy response to inference request in 2.866135597229004s
Received healthy response to inference request in 2.038456439971924s
Received healthy response to inference request in 1.8305504322052002s
Received healthy response to inference request in 2.5215096473693848s
Received healthy response to inference request in 2.8787107467651367s
Received healthy response to inference request in 2.053999423980713s
Received healthy response to inference request in 3.000231981277466s
Received healthy response to inference request in 2.92716121673584s
Received healthy response to inference request in 2.1105258464813232s
30 requests
0 failed requests
5th percentile: 1.641584575176239
10th percentile: 2.0176658391952516
20th percentile: 2.099220561981201
30th percentile: 2.655664753913879
40th percentile: 2.829739809036255
50th percentile: 2.8724231719970703
60th percentile: 2.8990636825561524
70th percentile: 2.981334686279297
80th percentile: 3.054076719284058
90th percentile: 3.2878615617752076
95th percentile: 3.3387112021446224
99th percentile: 4.470194120407106
mean time: 2.752213986714681
Pipeline stage StressChecker completed in 85.00s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
chaiml-star-trek-open-r_10141_v3 status is now deployed due to DeploymentManager action
chaiml-star-trek-open-r_10141_v3 status is now inactive due to auto deactivation removed underperforming models
chaiml-star-trek-open-r_10141_v3 status is now torndown due to DeploymentManager action