Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.92s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-kimid-v8a-kimidv-94399-v1
Waiting for inference service chaiml-kimid-v8a-kimidv-94399-v1 to be ready
Inference service chaiml-kimid-v8a-kimidv-94399-v1 ready after 503.3451223373413s
Pipeline stage VLLMDeployer completed in 504.03s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0839622020721436s
Received healthy response to inference request in 2.763946533203125s
Received healthy response to inference request in 2.0128366947174072s
Received healthy response to inference request in 1.9109060764312744s
Received healthy response to inference request in 1.9397625923156738s
Received healthy response to inference request in 1.9808144569396973s
Received healthy response to inference request in 1.9459030628204346s
Received healthy response to inference request in 2.3027517795562744s
Received healthy response to inference request in 1.8468163013458252s
Received healthy response to inference request in 2.0207712650299072s
Received healthy response to inference request in 2.032493829727173s
Received healthy response to inference request in 1.933868646621704s
Received healthy response to inference request in 1.8495585918426514s
Received healthy response to inference request in 2.0120773315429688s
Received healthy response to inference request in 2.347219944000244s
Received healthy response to inference request in 1.9213144779205322s
Received healthy response to inference request in 1.8980543613433838s
Received healthy response to inference request in 2.126603841781616s
Received healthy response to inference request in 2.542625904083252s
Received healthy response to inference request in 1.945577621459961s
Received healthy response to inference request in 2.381009101867676s
Received healthy response to inference request in 2.0176591873168945s
Received healthy response to inference request in 1.8913609981536865s
Received healthy response to inference request in 2.044229030609131s
Received healthy response to inference request in 1.8675274848937988s
Received healthy response to inference request in 2.136500835418701s
Received healthy response to inference request in 2.026556968688965s
Received healthy response to inference request in 2.3043062686920166s
Received healthy response to inference request in 1.917536735534668s
Received healthy response to inference request in 2.095937728881836s
30 requests
0 failed requests
5th percentile: 1.8576445937156678
10th percentile: 1.8889776468276978
20th percentile: 1.9162106037139892
30th percentile: 1.937994408607483
40th percentile: 1.9668498992919923
50th percentile: 2.015247941017151
60th percentile: 2.028931713104248
70th percentile: 2.087554860115051
80th percentile: 2.1697510242462164
90th percentile: 2.3505988597869876
95th percentile: 2.4698983430862422
99th percentile: 2.699763550758362
mean time: 2.0700163284937543
Pipeline stage StressChecker completed in 64.77s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.83s
Shutdown handler de-registered
chaiml-kimid-v8a-kimidv_94399_v1 status is now deployed due to DeploymentManager action
chaiml-kimid-v8a-kimidv_94399_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-kimid-v8a-kimidv_94399_v1 status is now torndown due to DeploymentManager action