Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader
Waiting for job on chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader to finish
2026-03-25T12:28:49.880326+00:00 monitor updated for chaiml-ssnew-v5-dpo-lr_19068_v15
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: Using quantization_mode: fp8
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: Repo ChaiML/csfs-v3-3-dpo-lr5e6b01-lora-FP8 already ends in FP8. Skipping...
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: Checking if ChaiML/csfs-v3-3-dpo-lr5e6b01-lora-FP8 already exists in ChaiML
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: Model already exists. Downloading to /dev/shm/model_output...
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: Downloading snapshot of ChaiML/csfs-v3-3-dpo-lr5e6b01-lora-FP8...
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: Downloaded in 10.686s
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: Processed model ChaiML/csfs-v3-3-dpo-lr5e6b01-lora-FP8 in 14.308s
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: creating bucket guanaco-vllm-models
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/model.safetensors.index.json
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/README.md
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/.gitattributes
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/special_tokens_map.json
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/config.json
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/tokenizer_config.json
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/generation_config.json s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/generation_config.json
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/recipe.yaml s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/recipe.yaml
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/tokenizer.json
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/model-00006-of-00006.safetensors s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/model-00006-of-00006.safetensors
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/model-00005-of-00006.safetensors s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/model-00005-of-00006.safetensors
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/model-00003-of-00006.safetensors s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/model-00003-of-00006.safetensors
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/model-00002-of-00006.safetensors s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/model-00002-of-00006.safetensors
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/model-00004-of-00006.safetensors s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/model-00004-of-00006.safetensors
chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader: cp /dev/shm/model_output/model-00001-of-00006.safetensors s3://guanaco-vllm-models/chaiml-csfs-v3-3-dpo-lr5-937-v10/default/model-00001-of-00006.safetensors
Job chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader completed after 44.94s with status: succeeded
Stopping job with name chaiml-csfs-v3-3-dpo-lr5-937-v10-uploader
Pipeline stage VLLMUploader completed in 45.63s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.45s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-csfs-v3-3-dpo-lr5-937-v10
Waiting for inference service chaiml-csfs-v3-3-dpo-lr5-937-v10 to be ready
2026-03-25T12:29:34.408331+00:00 monitor updated for chaiml-csfs-v3-3-dpo-lr5_937_v10
Inference service chaiml-ssnew-v5-dpo-lr-19068-v15 ready after 160.3820993900299s
Pipeline stage VLLMDeployer completed in 161.03s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.398024559020996s
Received healthy response to inference request in 4.5714874267578125s
2026-03-25T12:29:50.301336+00:00 monitor updated for chaiml-ssnew-v5-dpo-lr_19068_v15
Received healthy response to inference request in 4.192166566848755s
Received healthy response to inference request in 2.6582190990448s
Received healthy response to inference request in 2.703934907913208s
Received healthy response to inference request in 2.6619629859924316s
Received healthy response to inference request in 2.724956750869751s
Received healthy response to inference request in 2.737095594406128s
Received healthy response to inference request in 2.795311450958252s
Received healthy response to inference request in 4.2797462940216064s
Received healthy response to inference request in 2.7763922214508057s
Received healthy response to inference request in 2.6876420974731445s
Received healthy response to inference request in 2.6869559288024902s
Received healthy response to inference request in 2.669828414916992s
Received healthy response to inference request in 2.7208542823791504s
Received healthy response to inference request in 2.6431548595428467s
Received healthy response to inference request in 2.723360300064087s
2026-03-25T12:30:34.559458+00:00 monitor updated for chaiml-csfs-v3-3-dpo-lr5_937_v10
Received healthy response to inference request in 2.6743898391723633s
Received healthy response to inference request in 2.6841237545013428s
Received healthy response to inference request in 2.6775670051574707s
Received healthy response to inference request in 2.752131700515747s
Received healthy response to inference request in 2.684474229812622s
Received healthy response to inference request in 2.677527904510498s
2026-03-25T12:30:50.468886+00:00 monitor updated for chaiml-ssnew-v5-dpo-lr_19068_v15
Received healthy response to inference request in 4.31263542175293s
Received healthy response to inference request in 2.77150297164917s
Received healthy response to inference request in 2.6898088455200195s
Received healthy response to inference request in 2.6948516368865967s
Received healthy response to inference request in 2.757887840270996s
Received healthy response to inference request in 2.876107931137085s
Received healthy response to inference request in 2.7837047576904297s
30 requests
0 failed requests
5th percentile: 2.659903848171234
10th percentile: 2.669041872024536
20th percentile: 2.677559185028076
30th percentile: 2.68621141910553
40th percentile: 2.6928345203399657
50th percentile: 2.7221072912216187
60th percentile: 2.7431100368499757
70th percentile: 2.7729697465896606
80th percentile: 2.811470746994019
90th percentile: 4.283035206794739
95th percentile: 4.359599447250366
99th percentile: 4.521183195114136
mean time: 2.988926919301351
Pipeline stage StressChecker completed in 95.25s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.43s
Shutdown handler de-registered
chaiml-ssnew-v5-dpo-lr_19068_v15 status is now deployed due to DeploymentManager action
2026-03-25T12:31:34.729792+00:00 monitor updated for chaiml-csfs-v3-3-dpo-lr5_937_v10
Inference service chaiml-csfs-v3-3-dpo-lr5-937-v10 ready after 160.4884431362152s
Pipeline stage VLLMDeployer completed in 161.29s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.483574151992798s
Received healthy response to inference request in 2.7250635623931885s
Received healthy response to inference request in 2.7049195766448975s
Received healthy response to inference request in 4.1872851848602295s
Received healthy response to inference request in 4.256525278091431s
Received healthy response to inference request in 4.236422300338745s
Received healthy response to inference request in 2.7878952026367188s
Received healthy response to inference request in 2.8013508319854736s
Received healthy response to inference request in 2.6153132915496826s
2026-03-25T12:32:34.845580+00:00 monitor updated for chaiml-csfs-v3-3-dpo-lr5_937_v10
Received healthy response to inference request in 2.866539239883423s
Received healthy response to inference request in 2.92407488822937s
Received healthy response to inference request in 2.8564047813415527s
Received healthy response to inference request in 2.6689088344573975s
Received healthy response to inference request in 2.667104721069336s
Received healthy response to inference request in 2.703732490539551s
Received healthy response to inference request in 4.183555841445923s
Received healthy response to inference request in 2.6569206714630127s
Received healthy response to inference request in 2.6898114681243896s
Received healthy response to inference request in 2.64263653755188s
Received healthy response to inference request in 2.6619458198547363s
Received healthy response to inference request in 2.6830246448516846s
Received healthy response to inference request in 2.8628861904144287s
Received healthy response to inference request in 2.6942105293273926s
Received healthy response to inference request in 2.661766767501831s
Received healthy response to inference request in 2.700192928314209s
Received healthy response to inference request in 2.6636099815368652s
Received healthy response to inference request in 2.6733293533325195s
Received healthy response to inference request in 2.9556539058685303s
Received healthy response to inference request in 2.852534532546997s
Received healthy response to inference request in 2.658160448074341s
30 requests
0 failed requests
5th percentile: 2.6490643978118897
10th percentile: 2.658036470413208
20th percentile: 2.6632771492004395
30th percentile: 2.6720031976699827
40th percentile: 2.6924509048461913
50th percentile: 2.704326033592224
60th percentile: 2.793277454376221
70th percentile: 2.8583492040634155
2026-03-25T12:33:34.949082+00:00 monitor updated for chaiml-csfs-v3-3-dpo-lr5_937_v10
80th percentile: 2.9303906917572022
90th percentile: 4.192198896408081
95th percentile: 4.247478938102722
99th percentile: 4.417729978561401
mean time: 2.9908451318740843
Pipeline stage StressChecker completed in 92.86s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.70s
Shutdown handler de-registered
chaiml-csfs-v3-3-dpo-lr5_937_v10 status is now deployed due to DeploymentManager action
chaiml-csfs-v3-3-dpo-lr5_937_v10 status is now inactive due to auto deactivation removed underperforming models
chaiml-csfs-v3-3-dpo-lr5_937_v10 status is now torndown due to DeploymentManager action