Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-kaniwara-japan-fut-433-v7-uploader
Waiting for job on chaiml-kaniwara-japan-fut-433-v7-uploader to finish
chaiml-kaniwara-japan-fut-433-v7-uploader: Using quantization_mode: fp8
2026-03-06T23:55:49.456968+00:00 monitor updated for chaiml-kaniwara-japan-fut_433_v7
chaiml-kaniwara-japan-fut-433-v7-uploader: Checking if ChaiML/Kaniwara-Japan-Future-RPG260223150837_sft-FP8 already exists in ChaiML
chaiml-kaniwara-japan-fut-433-v7-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-kaniwara-japan-fut-433-v7-uploader: Downloading snapshot of ChaiML/Kaniwara-Japan-Future-RPG260223150837_sft-FP8...
chaiml-kaniwara-japan-fut-433-v7-uploader: Downloaded in 10.889s
chaiml-kaniwara-japan-fut-433-v7-uploader: Processed model ChaiML/Kaniwara-Japan-Future-RPG260223150837_sft in 14.398s
chaiml-kaniwara-japan-fut-433-v7-uploader: creating bucket guanaco-vllm-models
chaiml-kaniwara-japan-fut-433-v7-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-kaniwara-japan-fut-433-v7-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-kaniwara-japan-fut-433-v7-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-kaniwara-japan-fut-433-v7-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-kaniwara-japan-fut-433-v7-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-kaniwara-japan-fut-433-v7-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-kaniwara-japan-fut-433-v7-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-kaniwara-japan-fut-433-v7-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-kaniwara-japan-fut-433-v7-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-kaniwara-japan-fut-433-v7-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-kaniwara-japan-fut-433-v7-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-kaniwara-japan-fut-433-v7-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-kaniwara-japan-fut-433-v7-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-kaniwara-japan-fut-433-v7-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-kaniwara-japan-fut-433-v7-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-kaniwara-japan-fut-433-v7-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-kaniwara-japan-fut-433-v7-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-kaniwara-japan-fut-433-v7-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default
chaiml-kaniwara-japan-fut-433-v7-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default/recipe.yaml
chaiml-kaniwara-japan-fut-433-v7-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default/.gitattributes
chaiml-kaniwara-japan-fut-433-v7-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default/config.json
chaiml-kaniwara-japan-fut-433-v7-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default/generation_config.json
chaiml-kaniwara-japan-fut-433-v7-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default/special_tokens_map.json
chaiml-kaniwara-japan-fut-433-v7-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default/model.safetensors.index.json
chaiml-kaniwara-japan-fut-433-v7-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default/tokenizer_config.json
chaiml-kaniwara-japan-fut-433-v7-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default/tokenizer.json
chaiml-kaniwara-japan-fut-433-v7-uploader: cp /dev/shm/model_output/model-00006-of-00006.safetensors s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default/model-00006-of-00006.safetensors
chaiml-kaniwara-japan-fut-433-v7-uploader: cp /dev/shm/model_output/model-00005-of-00006.safetensors s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default/model-00005-of-00006.safetensors
chaiml-kaniwara-japan-fut-433-v7-uploader: cp /dev/shm/model_output/model-00002-of-00006.safetensors s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default/model-00002-of-00006.safetensors
chaiml-kaniwara-japan-fut-433-v7-uploader: cp /dev/shm/model_output/model-00001-of-00006.safetensors s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default/model-00001-of-00006.safetensors
chaiml-kaniwara-japan-fut-433-v7-uploader: cp /dev/shm/model_output/model-00004-of-00006.safetensors s3://guanaco-vllm-models/chaiml-kaniwara-japan-fut-433-v7/default/model-00004-of-00006.safetensors
Job chaiml-kaniwara-japan-fut-433-v7-uploader completed after 99.42s with status: succeeded
Stopping job with name chaiml-kaniwara-japan-fut-433-v7-uploader
Pipeline stage VLLMUploader completed in 100.79s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 1.25s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-kaniwara-japan-fut-433-v7
Waiting for inference service chaiml-kaniwara-japan-fut-433-v7 to be ready
2026-03-06T23:56:49.651377+00:00 monitor updated for chaiml-kaniwara-japan-fut_433_v7
2026-03-06T23:57:49.851138+00:00 monitor updated for chaiml-kaniwara-japan-fut_433_v7
2026-03-06T23:58:50.044910+00:00 monitor updated for chaiml-kaniwara-japan-fut_433_v7
Inference service chaiml-kaniwara-japan-fut-433-v7 ready after 162.2196114063263s
Pipeline stage VLLMDeployer completed in 163.26s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 14.992543458938599s
Received healthy response to inference request in 3.3701040744781494s
Received healthy response to inference request in 2.870979070663452s
Received healthy response to inference request in 3.0172119140625s
Received healthy response to inference request in 2.952712059020996s
Received healthy response to inference request in 2.8294994831085205s
2026-03-06T23:59:50.279228+00:00 monitor updated for chaiml-kaniwara-japan-fut_433_v7
Received healthy response to inference request in 3.815223217010498s
Received healthy response to inference request in 3.011265993118286s
Received healthy response to inference request in 3.169055461883545s
Received healthy response to inference request in 3.032264471054077s
Received healthy response to inference request in 2.9394423961639404s
Received healthy response to inference request in 3.050506114959717s
Received healthy response to inference request in 3.003505229949951s
Received healthy response to inference request in 2.866248607635498s
Received healthy response to inference request in 2.8386805057525635s
Received healthy response to inference request in 3.2642922401428223s
Received healthy response to inference request in 2.922076463699341s
Received healthy response to inference request in 2.922126293182373s
Received healthy response to inference request in 3.6191301345825195s
Received healthy response to inference request in 3.12699294090271s
Received healthy response to inference request in 3.520318031311035s
2026-03-07T00:00:50.495462+00:00 monitor updated for chaiml-kaniwara-japan-fut_433_v7
Received healthy response to inference request in 3.156979560852051s
Received healthy response to inference request in 3.0006890296936035s
Received healthy response to inference request in 2.983616590499878s
Received healthy response to inference request in 3.08056902885437s
Received healthy response to inference request in 3.063634157180786s
Received healthy response to inference request in 3.3533568382263184s
Received healthy response to inference request in 2.931096076965332s
Received healthy response to inference request in 2.9901773929595947s
Received healthy response to inference request in 3.395210027694702s
30 requests
0 failed requests
5th percentile: 2.851086151599884
10th percentile: 2.870506024360657
20th percentile: 2.92930212020874
30th percentile: 2.9743452310562133
40th percentile: 3.002378749847412
50th percentile: 3.0247381925582886
60th percentile: 3.0704081058502197
70th percentile: 3.160602331161499
80th percentile: 3.3567062854766845
90th percentile: 3.5301992416381838
95th percentile: 3.726981329917907
99th percentile: 11.75112058877946
mean time: 3.502983562151591
Pipeline stage StressChecker completed in 123.56s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.31s
Shutdown handler de-registered
chaiml-kaniwara-japan-fut_433_v7 status is now deployed due to DeploymentManager action
chaiml-kaniwara-japan-fut_433_v7 status is now inactive due to system request
chaiml-kaniwara-japan-fut_433_v7 status is now torndown due to DeploymentManager action