Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.14s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-q80b-a3b-chai-11-41637-v3
Waiting for inference service chaiml-q80b-a3b-chai-11-41637-v3 to be ready
Inference service chaiml-q80b-a3b-chai-11-41637-v3 ready after 432.74529337882996s
Pipeline stage VLLMDeployer completed in 433.27s
run pipeline stage %s
Running pipeline stage StressChecker
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
Received healthy response to inference request in 2.500054359436035s
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
Received healthy response to inference request in 11.123239994049072s
Received healthy response to inference request in 12.284926176071167s
Received healthy response to inference request in 2.1581642627716064s
Received healthy response to inference request in 1.1504509449005127s
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
Received healthy response to inference request in 1.5555531978607178s
Received healthy response to inference request in 1.1404693126678467s
Received healthy response to inference request in 1.8598852157592773s
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
Received healthy response to inference request in 1.2156176567077637s
Received healthy response to inference request in 1.3630216121673584s
Received healthy response to inference request in 1.2197802066802979s
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
Received healthy response to inference request in 1.6964478492736816s
Received healthy response to inference request in 1.9105310440063477s
Received healthy response to inference request in 1.2635149955749512s
Received healthy response to inference request in 1.437412977218628s
Received healthy response to inference request in 2.308697462081909s
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
Received healthy response to inference request in 1.8951680660247803s
30 requests
13 failed requests
5th percentile: 1.1797759652137756
10th percentile: 1.2193639516830443
20th percentile: 1.422534704208374
30th percentile: 1.8108540058135985
40th percentile: 2.0591109752655035
50th percentile: 6.811647176742554
60th percentile: 12.169560813903809
70th percentile: 12.196109557151795
80th percentile: 12.308017110824585
90th percentile: 12.420736503601074
95th percentile: 12.430408906936645
99th percentile: 12.603531377315521
mean time: 6.939507420857748
%s, retrying in %s seconds...
Received healthy response to inference request in 1.0981552600860596s
Received healthy response to inference request in 1.1255691051483154s
Received healthy response to inference request in 1.5463249683380127s
Received healthy response to inference request in 1.3891563415527344s
Received healthy response to inference request in 1.1611571311950684s
Received healthy response to inference request in 1.4292376041412354s
Received healthy response to inference request in 1.5796751976013184s
Received healthy response to inference request in 1.5102086067199707s
Received healthy response to inference request in 1.1483678817749023s
Received healthy response to inference request in 2.3868846893310547s
Received healthy response to inference request in 1.4594459533691406s
Received healthy response to inference request in 1.8065087795257568s
Received healthy response to inference request in 1.3264977931976318s
Received healthy response to inference request in 1.5198123455047607s
Received healthy response to inference request in 1.4958605766296387s
Received healthy response to inference request in 1.7576444149017334s
Received healthy response to inference request in 1.3498382568359375s
Received healthy response to inference request in 1.0890765190124512s
Received healthy response to inference request in 1.4227252006530762s
Received healthy response to inference request in 1.683356523513794s
Received healthy response to inference request in 1.6936068534851074s
Received healthy response to inference request in 1.3910510540008545s
Received healthy response to inference request in 1.2981762886047363s
Received healthy response to inference request in 1.4574253559112549s
Received healthy response to inference request in 1.4438707828521729s
Received healthy response to inference request in 1.5320439338684082s
Received healthy response to inference request in 1.1226470470428467s
Received healthy response to inference request in 1.8661320209503174s
Received healthy response to inference request in 1.3424084186553955s
Received healthy response to inference request in 1.8489997386932373s
30 requests
0 failed requests
5th percentile: 1.1091765642166138
10th percentile: 1.1252768993377686
20th percentile: 1.2707724571228027
30th percentile: 1.3476093053817748
40th percentile: 1.4100555419921876
50th percentile: 1.4506480693817139
60th percentile: 1.5015997886657715
70th percentile: 1.5363282442092896
80th percentile: 1.6854065895080568
90th percentile: 1.8107578754425049
95th percentile: 1.8584224939346312
99th percentile: 2.235866415500641
mean time: 1.4760621547698975
Pipeline stage StressChecker completed in 257.64s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.77s
Shutdown handler de-registered
chaiml-q80b-a3b-chai-11_41637_v3 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError("('http://chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/completions', '')")
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError("('http://chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/completions', '')")
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError("('http://chaiml-q80b-a3b-chai-11-41637-v3-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com/v1/completions', '')")
Shutdown handler de-registered