Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-prm-kimi-v1-300k-92220-v9-uploader
Waiting for job on chaiml-prm-kimi-v1-300k-92220-v9-uploader to finish
chaiml-prm-kimi-v1-300k-92220-v9-uploader: Using quantization_mode: none
chaiml-prm-kimi-v1-300k-92220-v9-uploader: Downloading snapshot of ChaiML/prm_kimi_v1_300k_default8b-cosine-lr1e6g32...
chaiml-prm-kimi-v1-300k-92220-v9-uploader: Downloaded in 6.851s
chaiml-prm-kimi-v1-300k-92220-v9-uploader: Processed model ChaiML/prm_kimi_v1_300k_default8b-cosine-lr1e6g32 in 12.431s
chaiml-prm-kimi-v1-300k-92220-v9-uploader: creating bucket guanaco-vllm-models
chaiml-prm-kimi-v1-300k-92220-v9-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-prm-kimi-v1-300k-92220-v9-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-prm-kimi-v1-300k-92220-v9-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-prm-kimi-v1-300k-92220-v9-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-prm-kimi-v1-300k-92220-v9-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-prm-kimi-v1-300k-92220-v9-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-prm-kimi-v1-300k-92220-v9-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-prm-kimi-v1-300k-92220-v9-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-prm-kimi-v1-300k-92220-v9-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-prm-kimi-v1-300k-92220-v9-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-prm-kimi-v1-300k-92220-v9-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-prm-kimi-v1-300k-92220-v9-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-prm-kimi-v1-300k-92220-v9-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-prm-kimi-v1-300k-92220-v9-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-prm-kimi-v1-300k-92220-v9-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-prm-kimi-v1-300k-92220-v9-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-prm-kimi-v1-300k-92220-v9-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-prm-kimi-v1-300k-92220-v9-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-prm-kimi-v1-300k-92220-v9/default
chaiml-prm-kimi-v1-300k-92220-v9-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-prm-kimi-v1-300k-92220-v9/default/special_tokens_map.json
chaiml-prm-kimi-v1-300k-92220-v9-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-prm-kimi-v1-300k-92220-v9/default/README.md
chaiml-prm-kimi-v1-300k-92220-v9-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-prm-kimi-v1-300k-92220-v9/default/config.json
chaiml-prm-kimi-v1-300k-92220-v9-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-prm-kimi-v1-300k-92220-v9/default/model.safetensors.index.json
chaiml-prm-kimi-v1-300k-92220-v9-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-prm-kimi-v1-300k-92220-v9/default/tokenizer_config.json
chaiml-prm-kimi-v1-300k-92220-v9-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-prm-kimi-v1-300k-92220-v9/default/.gitattributes
chaiml-prm-kimi-v1-300k-92220-v9-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-prm-kimi-v1-300k-92220-v9/default/tokenizer.json
chaiml-prm-kimi-v1-300k-92220-v9-uploader: cp /dev/shm/model_output/model-00004-of-00004.safetensors s3://guanaco-vllm-models/chaiml-prm-kimi-v1-300k-92220-v9/default/model-00004-of-00004.safetensors
chaiml-prm-kimi-v1-300k-92220-v9-uploader: cp /dev/shm/model_output/model-00003-of-00004.safetensors s3://guanaco-vllm-models/chaiml-prm-kimi-v1-300k-92220-v9/default/model-00003-of-00004.safetensors
chaiml-prm-kimi-v1-300k-92220-v9-uploader: cp /dev/shm/model_output/model-00002-of-00004.safetensors s3://guanaco-vllm-models/chaiml-prm-kimi-v1-300k-92220-v9/default/model-00002-of-00004.safetensors
chaiml-prm-kimi-v1-300k-92220-v9-uploader: cp /dev/shm/model_output/model-00001-of-00004.safetensors s3://guanaco-vllm-models/chaiml-prm-kimi-v1-300k-92220-v9/default/model-00001-of-00004.safetensors
Job chaiml-prm-kimi-v1-300k-92220-v9-uploader completed after 42.23s with status: succeeded
Stopping job with name chaiml-prm-kimi-v1-300k-92220-v9-uploader
Pipeline stage VLLMUploader completed in 42.74s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-prm-kimi-v1-300k-92220-v9
Waiting for inference service chaiml-prm-kimi-v1-300k-92220-v9 to be ready
Inference service chaiml-prm-kimi-v1-300k-92220-v9 ready after 161.96089959144592s
Pipeline stage VLLMDeployer completed in 162.59s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.544796705245972s
Received healthy response to inference request in 3.4686334133148193s
Received healthy response to inference request in 4.1050803661346436s
Received healthy response to inference request in 3.578127384185791s
Received healthy response to inference request in 3.4821970462799072s
5 requests
0 failed requests
5th percentile: 3.471346139907837
10th percentile: 3.4740588665008545
20th percentile: 3.4794843196868896
30th percentile: 3.501383113861084
40th percentile: 3.5397552490234374
50th percentile: 3.578127384185791
60th percentile: 3.788908576965332
70th percentile: 3.999689769744873
80th percentile: 4.193023633956909
90th percentile: 4.36891016960144
95th percentile: 4.456853437423706
99th percentile: 4.5272080516815185
mean time: 3.8357669830322267
Pipeline stage StressChecker completed in 20.63s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
chaiml-prm-kimi-v1-300k_92220_v9 status is now deployed due to DeploymentManager action
chaiml-prm-kimi-v1-300k_92220_v9 status is now inactive due to auto deactivation removed underperforming models