Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name nitral-ai-captain-bmo-12b-v75-mkmlizer
Waiting for job on nitral-ai-captain-bmo-12b-v75-mkmlizer to finish
nitral-ai-captain-bmo-12b-v75-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ Version: 0.29.3 ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ https://mk1.ai ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ The license key for the current software has been verified as ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ belonging to: ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ Chai Research Corp. ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ║ ║
nitral-ai-captain-bmo-12b-v75-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
nitral-ai-captain-bmo-12b-v75-mkmlizer: Downloaded to shared memory in 25.657s
nitral-ai-captain-bmo-12b-v75-mkmlizer: Checking if Nitral-AI/Captain_BMO-12B already exists in ChaiML
nitral-ai-captain-bmo-12b-v75-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpbrbmwli0, device:0
nitral-ai-captain-bmo-12b-v75-mkmlizer: Saving flywheel model at /dev/shm/model_cache
nitral-ai-captain-bmo-12b-v75-mkmlizer: quantized model in 30.190s
nitral-ai-captain-bmo-12b-v75-mkmlizer: Processed model Nitral-AI/Captain_BMO-12B in 55.932s
nitral-ai-captain-bmo-12b-v75-mkmlizer: creating bucket guanaco-mkml-models
nitral-ai-captain-bmo-12b-v75-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
nitral-ai-captain-bmo-12b-v75-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/nitral-ai-captain-bmo-12b-v75
nitral-ai-captain-bmo-12b-v75-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/nitral-ai-captain-bmo-12b-v75/config.json
nitral-ai-captain-bmo-12b-v75-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/nitral-ai-captain-bmo-12b-v75/special_tokens_map.json
nitral-ai-captain-bmo-12b-v75-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/nitral-ai-captain-bmo-12b-v75/tokenizer_config.json
nitral-ai-captain-bmo-12b-v75-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/nitral-ai-captain-bmo-12b-v75/tokenizer.json
nitral-ai-captain-bmo-12b-v75-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/nitral-ai-captain-bmo-12b-v75/flywheel_model.0.safetensors
nitral-ai-captain-bmo-12b-v75-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 2/363 [00:06<18:40, 3.10s/it]
Loading 0: 2%|▏ | 6/363 [00:06<04:57, 1.20it/s]
Loading 0: 4%|▍ | 14/363 [00:06<01:36, 3.61it/s]
Loading 0: 6%|▌ | 20/363 [00:06<00:58, 5.83it/s]
Loading 0: 7%|▋ | 24/363 [00:06<00:44, 7.60it/s]
Loading 0: 9%|▊ | 31/363 [00:06<00:27, 12.04it/s]
Loading 0: 10%|▉ | 36/363 [00:06<00:21, 15.23it/s]
Loading 0: 11%|█▏ | 41/363 [00:07<00:20, 15.37it/s]
Loading 0: 12%|█▏ | 45/363 [00:07<00:17, 18.17it/s]
Loading 0: 14%|█▍ | 50/363 [00:07<00:14, 22.17it/s]
Loading 0: 15%|█▌ | 55/363 [00:07<00:11, 26.76it/s]
Loading 0: 17%|█▋ | 60/363 [00:07<00:11, 26.77it/s]
Loading 0: 18%|█▊ | 67/363 [00:07<00:08, 34.68it/s]
Loading 0: 20%|█▉ | 72/363 [00:08<00:07, 36.82it/s]
Loading 0: 21%|██ | 77/363 [00:08<00:07, 38.82it/s]
Loading 0: 23%|██▎ | 83/363 [00:08<00:07, 38.18it/s]
Loading 0: 24%|██▍ | 88/363 [00:08<00:07, 38.27it/s]
Loading 0: 26%|██▌ | 94/363 [00:08<00:06, 42.67it/s]
Loading 0: 27%|██▋ | 99/363 [00:08<00:06, 41.21it/s]
Loading 0: 29%|██▊ | 104/363 [00:08<00:06, 40.61it/s]
Loading 0: 30%|███ | 109/363 [00:08<00:06, 41.65it/s]
Loading 0: 31%|███▏ | 114/363 [00:09<00:07, 34.96it/s]
Loading 0: 33%|███▎ | 121/363 [00:09<00:07, 30.44it/s]
Loading 0: 34%|███▍ | 125/363 [00:09<00:07, 31.51it/s]
Loading 0: 36%|███▌ | 131/363 [00:09<00:06, 36.10it/s]
Loading 0: 37%|███▋ | 136/363 [00:09<00:05, 39.12it/s]
Loading 0: 39%|███▉ | 141/363 [00:09<00:06, 34.60it/s]
Loading 0: 41%|████ | 148/363 [00:10<00:05, 41.30it/s]
Loading 0: 42%|████▏ | 153/363 [00:10<00:05, 41.78it/s]
Loading 0: 44%|████▎ | 158/363 [00:10<00:04, 42.35it/s]
Loading 0: 45%|████▍ | 163/363 [00:10<00:04, 44.18it/s]
Loading 0: 46%|████▋ | 168/363 [00:10<00:05, 36.42it/s]
Loading 0: 48%|████▊ | 175/363 [00:10<00:04, 43.29it/s]
Loading 0: 50%|████▉ | 180/363 [00:10<00:04, 42.75it/s]
Loading 0: 51%|█████ | 185/363 [00:10<00:04, 42.45it/s]
Loading 0: 52%|█████▏ | 190/363 [00:11<00:03, 43.53it/s]
Loading 0: 54%|█████▎ | 195/363 [00:11<00:04, 35.80it/s]
Loading 0: 56%|█████▌ | 202/363 [00:11<00:05, 30.93it/s]
Loading 0: 57%|█████▋ | 206/363 [00:11<00:04, 31.72it/s]
Loading 0: 58%|█████▊ | 211/363 [00:11<00:04, 34.84it/s]
Loading 0: 59%|█████▉ | 215/363 [00:11<00:04, 35.16it/s]
Loading 0: 61%|██████ | 221/363 [00:11<00:03, 39.37it/s]
Loading 0: 62%|██████▏ | 226/363 [00:12<00:03, 41.47it/s]
Loading 0: 64%|██████▎ | 231/363 [00:12<00:03, 34.37it/s]
Loading 0: 66%|██████▌ | 238/363 [00:12<00:03, 41.02it/s]
Loading 0: 67%|██████▋ | 243/363 [00:12<00:02, 41.79it/s]
Loading 0: 68%|██████▊ | 248/363 [00:12<00:02, 42.29it/s]
Loading 0: 70%|██████▉ | 253/363 [00:12<00:02, 44.12it/s]
Loading 0: 71%|███████ | 258/363 [00:12<00:02, 36.96it/s]
Loading 0: 73%|███████▎ | 265/363 [00:12<00:02, 44.06it/s]
Loading 0: 74%|███████▍ | 270/363 [00:13<00:02, 43.78it/s]
Loading 0: 76%|███████▌ | 275/363 [00:13<00:02, 43.86it/s]
Loading 0: 77%|███████▋ | 280/363 [00:13<00:01, 45.22it/s]
Loading 0: 79%|███████▊ | 285/363 [00:13<00:02, 27.01it/s]
Loading 0: 80%|████████ | 292/363 [00:13<00:02, 33.79it/s]
Loading 0: 82%|████████▏ | 297/363 [00:13<00:01, 35.71it/s]
Loading 0: 83%|████████▎ | 302/363 [00:14<00:01, 37.24it/s]
Loading 0: 85%|████████▍ | 307/363 [00:14<00:01, 40.07it/s]
Loading 0: 86%|████████▌ | 312/363 [00:14<00:01, 34.76it/s]
Loading 0: 88%|████████▊ | 319/363 [00:14<00:01, 41.52it/s]
Loading 0: 89%|████████▉ | 324/363 [00:14<00:00, 42.08it/s]
Loading 0: 91%|█████████ | 329/363 [00:14<00:00, 41.75it/s]
Loading 0: 92%|█████████▏| 334/363 [00:14<00:00, 43.31it/s]
Loading 0: 93%|█████████▎| 339/363 [00:14<00:00, 35.64it/s]
Loading 0: 95%|█████████▌| 346/363 [00:15<00:00, 42.77it/s]
Loading 0: 97%|█████████▋| 351/363 [00:15<00:00, 42.72it/s]
Loading 0: 98%|█████████▊| 356/363 [00:15<00:00, 42.26it/s]
Loading 0: 99%|█████████▉| 361/363 [00:15<00:00, 43.56it/s]
Job nitral-ai-captain-bmo-12b-v75-mkmlizer completed after 84.52s with status: succeeded
Stopping job with name nitral-ai-captain-bmo-12b-v75-mkmlizer
Pipeline stage MKMLizer completed in 85.07s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service nitral-ai-captain-bmo-12b-v75
Waiting for inference service nitral-ai-captain-bmo-12b-v75 to be ready
Inference service nitral-ai-captain-bmo-12b-v75 ready after 191.2250759601593s
Pipeline stage MKMLDeployer completed in 191.96s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.783792495727539s
Received healthy response to inference request in 2.0728466510772705s
Received healthy response to inference request in 1.5951900482177734s
Received healthy response to inference request in 1.7632970809936523s
Received healthy response to inference request in 1.8447070121765137s
5 requests
0 failed requests
5th percentile: 1.6288114547729493
10th percentile: 1.662432861328125
20th percentile: 1.7296756744384765
30th percentile: 1.7795790672302245
40th percentile: 1.812143039703369
50th percentile: 1.8447070121765137
60th percentile: 1.9359628677368164
70th percentile: 2.027218723297119
80th percentile: 2.2150358200073246
90th percentile: 2.499414157867432
95th percentile: 2.641603326797485
99th percentile: 2.7553546619415283
mean time: 2.01196665763855
Pipeline stage StressChecker completed in 11.41s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.67s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.62s
Shutdown handler de-registered
nitral-ai-captain-bmo-12b_v75 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError('None: None')
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError('None: None')
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError('None: None')
Shutdown handler de-registered
nitral-ai-captain-bmo-12b_v75 status is now inactive due to auto deactivation removed underperforming models
nitral-ai-captain-bmo-12b_v75 status is now torndown due to DeploymentManager action