Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-q80b-a3b-chai-11-41637-v1
Waiting for inference service chaiml-q80b-a3b-chai-11-41637-v1 to be ready
Failed to get response for submission chaiml-prm-csfsv3-300k-_41257_v1: HTTPConnectionPool(host='chaiml-prm-csfsv3-300k-41257-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-prm-icld-v1-pair_12474_v1: HTTPConnectionPool(host='chaiml-prm-icld-v1-pair-12474-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service chaiml-q80b-a3b-chai-11-41637-v1 ready after 442.3945960998535s
Pipeline stage VLLMDeployer completed in 443.00s
run pipeline stage %s
Running pipeline stage StressChecker
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
Received healthy response to inference request in 1.3493719100952148s
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
Received healthy response to inference request in 2.7135212421417236s
Received healthy response to inference request in 1.4720478057861328s
Received healthy response to inference request in 2.0853967666625977s
Received healthy response to inference request in 1.2705979347229004s
Received healthy response to inference request in 1.9001991748809814s
Received healthy response to inference request in 1.591965913772583s
Received healthy response to inference request in 1.1326022148132324s
Received healthy response to inference request in 2.364941358566284s
Received healthy response to inference request in 1.2467811107635498s
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
Received healthy response to inference request in 1.2474555969238281s
{"detail":"HTTPConnectionPool(host='chaiml-q80b-a3b-chai-11-41637-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)"}
Received unhealthy response to inference request!
Received healthy response to inference request in 1.598325490951538s
Failed to get response for submission chaiml-prm-csfsv3-300k-_98290_v1: HTTPConnectionPool(host='chaiml-prm-csfsv3-300k-98290-v1-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Received healthy response to inference request in 1.480966329574585s
Received healthy response to inference request in 1.0314855575561523s
Received healthy response to inference request in 1.1882896423339844s
Received healthy response to inference request in 1.2924644947052002s
Received healthy response to inference request in 1.1670520305633545s
Received healthy response to inference request in 1.5182843208312988s
30 requests
12 failed requests
5th percentile: 1.1481046319007873
10th percentile: 1.1861658811569213
20th percentile: 1.265969467163086
30th percentile: 1.4352450370788574
40th percentile: 1.5624932765960695
50th percentile: 1.9927979707717896
60th percentile: 6.498831415176378
70th percentile: 12.187367224693299
80th percentile: 12.317961025238038
90th percentile: 12.380172967910767
95th percentile: 12.397788381576538
99th percentile: 12.458671641349792
mean time: 5.84378121693929
%s, retrying in %s seconds...
Received healthy response to inference request in 1.2151551246643066s
Received healthy response to inference request in 1.37996506690979s
Received healthy response to inference request in 2.004617214202881s
Received healthy response to inference request in 1.101499319076538s
Received healthy response to inference request in 1.3117098808288574s
Received healthy response to inference request in 1.0815067291259766s
Received healthy response to inference request in 1.6416208744049072s
Received healthy response to inference request in 1.1398968696594238s
Received healthy response to inference request in 1.4570245742797852s
Received healthy response to inference request in 1.3592991828918457s
Received healthy response to inference request in 1.8028299808502197s
Received healthy response to inference request in 1.1516625881195068s
Received healthy response to inference request in 1.1984658241271973s
Received healthy response to inference request in 1.5229213237762451s
Received healthy response to inference request in 1.6047728061676025s
Received healthy response to inference request in 1.372196912765503s
Received healthy response to inference request in 1.2167317867279053s
Received healthy response to inference request in 1.172273874282837s
Received healthy response to inference request in 1.1343741416931152s
Received healthy response to inference request in 1.226027488708496s
Received healthy response to inference request in 1.1234190464019775s
Received healthy response to inference request in 1.632549524307251s
Received healthy response to inference request in 1.2461204528808594s
Received healthy response to inference request in 1.2024855613708496s
Received healthy response to inference request in 1.4609763622283936s
Received healthy response to inference request in 1.669114112854004s
Received healthy response to inference request in 1.3998112678527832s
Received healthy response to inference request in 1.500206708908081s
Received healthy response to inference request in 1.3171591758728027s
Received healthy response to inference request in 1.2310218811035156s
30 requests
0 failed requests
5th percentile: 1.1113631963729858
10th percentile: 1.1332786321640014
20th percentile: 1.168151617050171
30th percentile: 1.2113542556762695
40th percentile: 1.2290241241455078
50th percentile: 1.31443452835083
60th percentile: 1.3753041744232177
70th percentile: 1.4582101106643677
80th percentile: 1.5392916202545168
90th percentile: 1.644370198249817
95th percentile: 1.7426578402519222
99th percentile: 1.9460989165306093
mean time: 1.3625805219014486
Pipeline stage StressChecker completed in 221.35s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.71s
Shutdown handler de-registered
chaiml-q80b-a3b-chai-11_41637_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 1747.49s
Shutdown handler de-registered
chaiml-q80b-a3b-chai-11_41637_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-q80b-a3b-chai-11_41637_v1 status is now torndown due to DeploymentManager action