Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-02f4-69d4-linea-30131-v13-uploader
Waiting for job on chaiml-02f4-69d4-linea-30131-v13-uploader to finish
chaiml-02f4-69d4-linea-30131-v13-uploader: Using quantization_mode: fp8
chaiml-02f4-69d4-linea-30131-v13-uploader: Repo ChaiML/02f4-69d4-linear-w01-FP8 already ends in FP8. Skipping...
chaiml-02f4-69d4-linea-30131-v13-uploader: Checking if ChaiML/02f4-69d4-linear-w01-FP8 already exists in ChaiML
chaiml-02f4-69d4-linea-30131-v13-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-02f4-69d4-linea-30131-v13-uploader: Downloading snapshot of ChaiML/02f4-69d4-linear-w01-FP8...
chaiml-02f4-69d4-linea-30131-v13-uploader: Downloaded in 8.804s
chaiml-02f4-69d4-linea-30131-v13-uploader: Processed model ChaiML/02f4-69d4-linear-w01-FP8 in 12.454s
2026-03-25T22:18:47.798467+00:00 monitor updated for chaiml-02f4-69d4-linea_30131_v13
chaiml-02f4-69d4-linea-30131-v13-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-02f4-69d4-linea-30131-v13-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-02f4-69d4-linea-30131-v13-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-02f4-69d4-linea-30131-v13-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-02f4-69d4-linea-30131-v13-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-02f4-69d4-linea-30131-v13-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/model.safetensors.index.json
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/special_tokens_map.json
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/.gitattributes
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/config.json
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/recipe.yaml
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/tokenizer_config.json
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/generation_config.json
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/README.md
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/tokenizer.json
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/model-00006-of-00006.safetensors s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/model-00006-of-00006.safetensors
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/model-00005-of-00006.safetensors s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/model-00005-of-00006.safetensors
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/model-00001-of-00006.safetensors s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/model-00001-of-00006.safetensors
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/model-00002-of-00006.safetensors s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/model-00002-of-00006.safetensors
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/model-00004-of-00006.safetensors s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/model-00004-of-00006.safetensors
chaiml-02f4-69d4-linea-30131-v13-uploader: cp /dev/shm/model_output/model-00003-of-00006.safetensors s3://guanaco-vllm-models/chaiml-02f4-69d4-linea-30131-v13/default/model-00003-of-00006.safetensors
Job chaiml-02f4-69d4-linea-30131-v13-uploader completed after 72.82s with status: succeeded
Stopping job with name chaiml-02f4-69d4-linea-30131-v13-uploader
Pipeline stage VLLMUploader completed in 73.54s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 2.90s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-02f4-69d4-linea-30131-v13
Waiting for inference service chaiml-02f4-69d4-linea-30131-v13 to be ready
2026-03-25T22:19:47.893720+00:00 monitor updated for chaiml-02f4-69d4-linea_30131_v13
Failed to get request counts for guanaco-submitter. Falling back to default
2026-03-25T22:20:47.993033+00:00 monitor updated for chaiml-02f4-69d4-linea_30131_v13
Inference service chaiml-02f4-69d4-linea-30131-v13 ready after 160.48605513572693s
Pipeline stage VLLMDeployer completed in 161.02s
run pipeline stage %s
Running pipeline stage StressChecker
2026-03-25T22:21:48.094779+00:00 monitor updated for chaiml-02f4-69d4-linea_30131_v13
Received healthy response to inference request in 7.961947679519653s
Received healthy response to inference request in 7.875259876251221s
Received healthy response to inference request in 7.872296571731567s
Received healthy response to inference request in 7.943522930145264s
Received healthy response to inference request in 3.049631357192993s
Received healthy response to inference request in 3.0612616539001465s
Received healthy response to inference request in 2.9459388256073s
Received healthy response to inference request in 2.9988555908203125s
Received healthy response to inference request in 2.9669010639190674s
Received healthy response to inference request in 8.031169176101685s
Received healthy response to inference request in 2.9761252403259277s
Received healthy response to inference request in 3.129450798034668s
2026-03-25T22:22:48.205321+00:00 monitor updated for chaiml-02f4-69d4-linea_30131_v13
Received healthy response to inference request in 2.867159605026245s
Received healthy response to inference request in 2.84409499168396s
Received healthy response to inference request in 2.8738129138946533s
Received healthy response to inference request in 3.1140482425689697s
Received healthy response to inference request in 2.9541218280792236s
Received healthy response to inference request in 3.0273513793945312s
Received healthy response to inference request in 2.878977060317993s
Received healthy response to inference request in 2.968745231628418s
Received healthy response to inference request in 2.8563544750213623s
Received healthy response to inference request in 2.9441537857055664s
Received healthy response to inference request in 2.948608636856079s
Received healthy response to inference request in 2.895895481109619s
Received healthy response to inference request in 2.8867790699005127s
Received healthy response to inference request in 2.946988344192505s
Received healthy response to inference request in 2.8870959281921387s
Received healthy response to inference request in 3.4826819896698s
Received healthy response to inference request in 3.5768883228302s
Received healthy response to inference request in 3.150329351425171s
30 requests
0 failed requests
5th percentile: 2.8612167835235596
10th percentile: 2.8731475830078126
20th percentile: 2.8870325565338133
30th percentile: 2.94540331363678
40th percentile: 2.951916551589966
50th percentile: 2.972435235977173
60th percentile: 3.036263370513916
70th percentile: 3.1186690092086793
80th percentile: 3.5015232563018803
90th percentile: 7.882086181640625
95th percentile: 7.953656542301178
99th percentile: 8.011094942092896
mean time: 3.8305482467015586
Pipeline stage StressChecker completed in 117.51s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.64s
Shutdown handler de-registered
chaiml-02f4-69d4-linea_30131_v13 status is now deployed due to DeploymentManager action
chaiml-02f4-69d4-linea_30131_v13 status is now inactive due to system request
chaiml-02f4-69d4-linea_30131_v13 status is now torndown due to DeploymentManager action