Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-cai-real-v3b-v6-mkmlizer
Waiting for job on chaiml-cai-real-v3b-v6-mkmlizer to finish
chaiml-cai-real-v3b-v6-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-cai-real-v3b-v6-mkmlizer: ║ ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ Version: 0.30.2 ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ https://mk1.ai ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ belonging to: ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ Chai Research Corp. ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-cai-real-v3b-v6-mkmlizer: ║ ║
chaiml-cai-real-v3b-v6-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission chaiml-02f4-69d4-linear-w01_v8: ('http://guanaco-model-mesh-load-balancer.model-mesh.k2.chaiverse.com/models/chaiml-02f4-69d4-linear-w01_v8/predict', '{"detail":"1 validation error for RuntimeResponse\\npredictions\\n Field required [type=missing, input_value={\'detail\': \\"503, message=...linear-w01_v8/predict\'\\"}, input_type=dict]\\n For further information visit https://errors.pydantic.dev/2.11/v/missing"}')
chaiml-cai-real-v3b-v6-mkmlizer: Downloaded to shared memory in 51.598s
chaiml-cai-real-v3b-v6-mkmlizer: Checking if ChaiML/cai-real-v3b already exists in ChaiML
chaiml-cai-real-v3b-v6-mkmlizer: quantizing model to /dev/shm/model_cache, profile:q4, folder:/tmp/tmpu10al8i1, device:0
chaiml-cai-real-v3b-v6-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-cai-real-v3b-v6-mkmlizer: quantized model in 284.442s
chaiml-cai-real-v3b-v6-mkmlizer: Processed model ChaiML/cai-real-v3b in 336.041s
chaiml-cai-real-v3b-v6-mkmlizer: creating bucket guanaco-mkml-models
chaiml-cai-real-v3b-v6-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-cai-real-v3b-v6-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-cai-real-v3b-v6/nvidia
chaiml-cai-real-v3b-v6-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-cai-real-v3b-v6/nvidia/config.json
chaiml-cai-real-v3b-v6-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-cai-real-v3b-v6/nvidia/special_tokens_map.json
chaiml-cai-real-v3b-v6-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-cai-real-v3b-v6/nvidia/tokenizer_config.json
chaiml-cai-real-v3b-v6-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-cai-real-v3b-v6/nvidia/tokenizer.json
chaiml-cai-real-v3b-v6-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-cai-real-v3b-v6/nvidia/flywheel_model.1.safetensors
chaiml-cai-real-v3b-v6-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-cai-real-v3b-v6/nvidia/flywheel_model.0.safetensors
Job chaiml-cai-real-v3b-v6-mkmlizer completed after 410.78s with status: succeeded
Stopping job with name chaiml-cai-real-v3b-v6-mkmlizer
Pipeline stage MKMLizer completed in 411.96s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.14s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-cai-real-v3b-v6
Waiting for inference service chaiml-cai-real-v3b-v6 to be ready
Inference service chaiml-cai-real-v3b-v6 ready after 170.89418196678162s
Pipeline stage MKMLDeployer completed in 171.30s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.7698922157287598s
Received healthy response to inference request in 3.9639456272125244s
Received healthy response to inference request in 3.8620402812957764s
Received healthy response to inference request in 3.27726674079895s
Received healthy response to inference request in 3.347932815551758s
5 requests
0 failed requests
5th percentile: 3.2913999557495117
10th percentile: 3.3055331707000732
20th percentile: 3.3337996006011963
30th percentile: 3.432324695587158
40th percentile: 3.601108455657959
50th percentile: 3.7698922157287598
60th percentile: 3.8067514419555666
70th percentile: 3.843610668182373
80th percentile: 3.882421350479126
90th percentile: 3.923183488845825
95th percentile: 3.9435645580291747
99th percentile: 3.9598694133758543
mean time: 3.644215536117554
%s, retrying in %s seconds...
Received healthy response to inference request in 3.2729673385620117s
Received healthy response to inference request in 3.4713523387908936s
Received healthy response to inference request in 3.492730140686035s
Received healthy response to inference request in 3.4745373725891113s
Received healthy response to inference request in 3.4256210327148438s
5 requests
0 failed requests
5th percentile: 3.303498077392578
10th percentile: 3.3340288162231446
20th percentile: 3.3950902938842775
30th percentile: 3.4347672939300535
40th percentile: 3.4530598163604735
50th percentile: 3.4713523387908936
60th percentile: 3.4726263523101806
70th percentile: 3.4739003658294676
80th percentile: 3.478175926208496
90th percentile: 3.4854530334472655
95th percentile: 3.4890915870666506
99th percentile: 3.492002429962158
mean time: 3.427441644668579
Pipeline stage StressChecker completed in 38.69s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.67s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.61s
Shutdown handler de-registered
chaiml-cai-real-v3b_v6 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.08s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.08s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service chaiml-cai-real-v3b-v6-profiler
Waiting for inference service chaiml-cai-real-v3b-v6-profiler to be ready
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3402.56s
Shutdown handler de-registered