Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.13s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-kimid-v9-noname-29495-v1
Waiting for inference service chaiml-kimid-v9-noname-29495-v1 to be ready
Inference service chaiml-kimid-v9-noname-29495-v1 ready after 520.5991570949554s
Pipeline stage VLLMDeployer completed in 523.54s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7560279369354248s
Received healthy response to inference request in 1.7844769954681396s
Received healthy response to inference request in 1.765657901763916s
Received healthy response to inference request in 1.8552892208099365s
Received healthy response to inference request in 1.8979125022888184s
Received healthy response to inference request in 1.6971831321716309s
Received healthy response to inference request in 1.8205175399780273s
Received healthy response to inference request in 2.1092476844787598s
Received healthy response to inference request in 1.6782503128051758s
Received healthy response to inference request in 1.839719533920288s
Received healthy response to inference request in 1.9258337020874023s
Received healthy response to inference request in 1.8904995918273926s
Received healthy response to inference request in 1.8916375637054443s
Received healthy response to inference request in 1.670309066772461s
Received healthy response to inference request in 1.7916889190673828s
Received healthy response to inference request in 1.6788220405578613s
Received healthy response to inference request in 1.7620594501495361s
Received healthy response to inference request in 1.6732361316680908s
Received healthy response to inference request in 2.191304922103882s
Received healthy response to inference request in 1.6788735389709473s
Received healthy response to inference request in 1.9533092975616455s
Received healthy response to inference request in 1.6836192607879639s
Received healthy response to inference request in 2.3483707904815674s
Received healthy response to inference request in 1.6966698169708252s
Received healthy response to inference request in 1.7294988632202148s
Received healthy response to inference request in 1.6620402336120605s
Received healthy response to inference request in 1.6767630577087402s
Received healthy response to inference request in 1.7058589458465576s
Received healthy response to inference request in 1.6584117412567139s
Received healthy response to inference request in 2.393186569213867s
30 requests
0 failed requests
5th percentile: 1.6657612085342408
10th percentile: 1.6729434251785278
20th percentile: 1.6787076950073243
30th percentile: 1.6927546501159667
40th percentile: 1.720042896270752
50th percentile: 1.763858675956726
60th percentile: 1.8032203674316405
70th percentile: 1.8658523321151732
80th percentile: 1.9034967422485352
90th percentile: 2.117453408241272
95th percentile: 2.2776911497116084
99th percentile: 2.3801899933815003
mean time: 1.8288758754730225
Pipeline stage StressChecker completed in 57.63s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.01s
Shutdown handler de-registered
chaiml-kimid-v9-noname-_29495_v1 status is now deployed due to DeploymentManager action
chaiml-kimid-v9-noname-_29495_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-kimid-v9-noname-_29495_v1 status is now torndown due to DeploymentManager action