Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name nitral-ai-captain-bmo-12b-v60-mkmlizer
Waiting for job on nitral-ai-captain-bmo-12b-v60-mkmlizer to finish
nitral-ai-captain-bmo-12b-v60-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ Version: 0.27.1+vampire_v3 ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ https://mk1.ai ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ The license key for the current software has been verified as ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ belonging to: ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ Chai Research Corp. ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ║ ║
nitral-ai-captain-bmo-12b-v60-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission chaiml-cyndonia24b-cpos_38112_v2: HTTPConnectionPool(host='chaiml-cyndonia24b-cpos-38112-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-cyndonia24b-cpos_38112_v2: HTTPConnectionPool(host='chaiml-cyndonia24b-cpos-38112-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
nitral-ai-captain-bmo-12b-v60-mkmlizer: Downloaded to shared memory in 33.396s
nitral-ai-captain-bmo-12b-v60-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp_fa64cf7, device:0
nitral-ai-captain-bmo-12b-v60-mkmlizer: Saving flywheel model at /dev/shm/model_cache
nitral-ai-captain-bmo-12b-v60-mkmlizer: quantized model in 32.601s
nitral-ai-captain-bmo-12b-v60-mkmlizer: Processed model Nitral-AI/Captain_BMO-12B in 65.997s
nitral-ai-captain-bmo-12b-v60-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
nitral-ai-captain-bmo-12b-v60-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/nitral-ai-captain-bmo-12b-v60
nitral-ai-captain-bmo-12b-v60-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/nitral-ai-captain-bmo-12b-v60/config.json
nitral-ai-captain-bmo-12b-v60-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/nitral-ai-captain-bmo-12b-v60/special_tokens_map.json
nitral-ai-captain-bmo-12b-v60-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/nitral-ai-captain-bmo-12b-v60/tokenizer_config.json
nitral-ai-captain-bmo-12b-v60-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/nitral-ai-captain-bmo-12b-v60/tokenizer.json
nitral-ai-captain-bmo-12b-v60-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/nitral-ai-captain-bmo-12b-v60/flywheel_model.0.safetensors
nitral-ai-captain-bmo-12b-v60-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 2/363 [00:06<19:05, 3.17s/it]
Loading 0: 2%|▏ | 6/363 [00:06<05:05, 1.17it/s]
Loading 0: 4%|▎ | 13/363 [00:06<01:48, 3.22it/s]
Loading 0: 5%|▍ | 17/363 [00:06<01:14, 4.67it/s]
Loading 0: 6%|▌ | 22/363 [00:06<00:48, 7.06it/s]
Loading 0: 7%|▋ | 26/363 [00:07<00:36, 9.26it/s]
Loading 0: 9%|▊ | 31/363 [00:07<00:25, 12.95it/s]
Loading 0: 10%|▉ | 35/363 [00:07<00:21, 15.60it/s]
Loading 0: 11%|█ | 40/363 [00:07<00:20, 15.44it/s]
Loading 0: 12%|█▏ | 44/363 [00:07<00:17, 17.95it/s]
Loading 0: 13%|█▎ | 49/363 [00:07<00:14, 22.20it/s]
Loading 0: 15%|█▍ | 53/363 [00:07<00:12, 24.15it/s]
Loading 0: 16%|█▌ | 58/363 [00:08<00:10, 28.98it/s]
Loading 0: 17%|█▋ | 62/363 [00:08<00:10, 29.42it/s]
Loading 0: 18%|█▊ | 67/363 [00:08<00:09, 32.80it/s]
Loading 0: 20%|█▉ | 71/363 [00:08<00:08, 32.95it/s]
Loading 0: 21%|██ | 76/363 [00:08<00:07, 35.92it/s]
Loading 0: 22%|██▏ | 80/363 [00:08<00:08, 34.63it/s]
Loading 0: 23%|██▎ | 85/363 [00:08<00:07, 37.20it/s]
Loading 0: 25%|██▍ | 89/363 [00:08<00:07, 34.70it/s]
Loading 0: 26%|██▌ | 94/363 [00:09<00:07, 38.01it/s]
Loading 0: 27%|██▋ | 98/363 [00:09<00:07, 36.16it/s]
Loading 0: 28%|██▊ | 103/363 [00:09<00:06, 37.73it/s]
Loading 0: 29%|██▉ | 107/363 [00:09<00:07, 34.70it/s]
Loading 0: 31%|███ | 112/363 [00:09<00:06, 37.06it/s]
Loading 0: 32%|███▏ | 116/363 [00:09<00:07, 34.82it/s]
Loading 0: 33%|███▎ | 121/363 [00:09<00:09, 24.73it/s]
Loading 0: 34%|███▍ | 124/363 [00:10<00:09, 24.73it/s]
Loading 0: 36%|███▌ | 130/363 [00:10<00:07, 30.89it/s]
Loading 0: 37%|███▋ | 134/363 [00:10<00:07, 30.56it/s]
Loading 0: 38%|███▊ | 139/363 [00:10<00:06, 33.15it/s]
Loading 0: 39%|███▉ | 143/363 [00:10<00:06, 32.55it/s]
Loading 0: 41%|████ | 148/363 [00:10<00:06, 35.00it/s]
Loading 0: 42%|████▏ | 152/363 [00:10<00:06, 33.15it/s]
Loading 0: 43%|████▎ | 157/363 [00:10<00:05, 35.08it/s]
Loading 0: 44%|████▍ | 161/363 [00:11<00:05, 34.04it/s]
Loading 0: 46%|████▌ | 166/363 [00:11<00:05, 35.21it/s]
Loading 0: 47%|████▋ | 170/363 [00:11<00:05, 32.56it/s]
Loading 0: 48%|████▊ | 175/363 [00:11<00:05, 34.85it/s]
Loading 0: 49%|████▉ | 179/363 [00:11<00:05, 33.62it/s]
Loading 0: 51%|█████ | 184/363 [00:11<00:04, 35.84it/s]
Loading 0: 52%|█████▏ | 188/363 [00:11<00:05, 34.00it/s]
Loading 0: 53%|█████▎ | 193/363 [00:12<00:04, 36.21it/s]
Loading 0: 54%|█████▍ | 197/363 [00:12<00:04, 34.85it/s]
Loading 0: 56%|█████▌ | 202/363 [00:12<00:06, 25.48it/s]
Loading 0: 56%|█████▋ | 205/363 [00:12<00:06, 24.63it/s]
Loading 0: 58%|█████▊ | 209/363 [00:12<00:05, 27.26it/s]
Loading 0: 59%|█████▊ | 213/363 [00:12<00:05, 26.28it/s]
Loading 0: 60%|██████ | 218/363 [00:12<00:04, 30.58it/s]
Loading 0: 61%|██████ | 222/363 [00:13<00:04, 28.49it/s]
Loading 0: 63%|██████▎ | 229/363 [00:13<00:03, 35.70it/s]
Loading 0: 64%|██████▍ | 233/363 [00:13<00:03, 33.88it/s]
Loading 0: 65%|██████▌ | 237/363 [00:13<00:03, 34.98it/s]
Loading 0: 66%|██████▋ | 241/363 [00:13<00:03, 30.92it/s]
Loading 0: 67%|██████▋ | 245/363 [00:13<00:03, 32.53it/s]
Loading 0: 69%|██████▊ | 249/363 [00:13<00:03, 29.61it/s]
Loading 0: 70%|██████▉ | 254/363 [00:14<00:03, 33.98it/s]
Loading 0: 71%|███████ | 258/363 [00:14<00:03, 31.14it/s]
Loading 0: 73%|███████▎ | 265/363 [00:14<00:02, 38.35it/s]
Loading 0: 74%|███████▍ | 270/363 [00:14<00:02, 38.04it/s]
Loading 0: 76%|███████▌ | 275/363 [00:14<00:02, 38.90it/s]
Loading 0: 77%|███████▋ | 279/363 [00:14<00:02, 38.11it/s]
Loading 0: 78%|███████▊ | 283/363 [00:15<00:03, 24.70it/s]
Loading 0: 79%|███████▉ | 287/363 [00:15<00:02, 26.30it/s]
Loading 0: 80%|████████ | 292/363 [00:15<00:02, 29.91it/s]
Loading 0: 82%|████████▏ | 296/363 [00:15<00:02, 29.95it/s]
Loading 0: 83%|████████▎ | 301/363 [00:15<00:01, 33.00it/s]
Loading 0: 84%|████████▍ | 305/363 [00:15<00:01, 31.74it/s]
Loading 0: 85%|████████▌ | 310/363 [00:15<00:01, 34.41it/s]
Loading 0: 87%|████████▋ | 314/363 [00:15<00:01, 33.90it/s]
Loading 0: 88%|████████▊ | 319/363 [00:15<00:01, 37.65it/s]
Loading 0: 89%|████████▉ | 323/363 [00:16<00:01, 37.40it/s]
Loading 0: 90%|█████████ | 328/363 [00:16<00:00, 40.36it/s]
Loading 0: 92%|█████████▏| 333/363 [00:16<00:00, 40.49it/s]
Loading 0: 93%|█████████▎| 338/363 [00:16<00:00, 40.89it/s]
Loading 0: 94%|█████████▍| 343/363 [00:16<00:00, 42.07it/s]
Loading 0: 96%|█████████▌| 348/363 [00:16<00:00, 34.00it/s]
Loading 0: 98%|█████████▊| 355/363 [00:16<00:00, 40.61it/s]
Loading 0: 99%|█████████▉| 360/363 [00:17<00:00, 39.00it/s]
Job nitral-ai-captain-bmo-12b-v60-mkmlizer completed after 94.48s with status: succeeded
Stopping job with name nitral-ai-captain-bmo-12b-v60-mkmlizer
Pipeline stage MKMLizer completed in 94.98s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service nitral-ai-captain-bmo-12b-v60
Waiting for inference service nitral-ai-captain-bmo-12b-v60 to be ready
Failed to get response for submission nitral-ai-captain-bmo-12b_v59: HTTPConnectionPool(host='nitral-ai-captain-bmo-12b-v59-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service nitral-ai-captain-bmo-12b-v60 ready after 110.45921993255615s
Pipeline stage MKMLDeployer completed in 111.34s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.3023436069488525s
Received healthy response to inference request in 1.6535663604736328s
Received healthy response to inference request in 1.7286527156829834s
Received healthy response to inference request in 1.917849063873291s
5 requests
1 failed requests
5th percentile: 1.668583631515503
10th percentile: 1.683600902557373
20th percentile: 1.7136354446411133
30th percentile: 1.766491985321045
40th percentile: 1.842170524597168
50th percentile: 1.917849063873291
60th percentile: 2.071646881103516
70th percentile: 2.22544469833374
80th percentile: 5.869265508651736
90th percentile: 13.003109312057497
95th percentile: 16.570031213760373
99th percentile: 19.42356873512268
mean time: 5.5478729724884035
%s, retrying in %s seconds...
Received healthy response to inference request in 1.9067509174346924s
Received healthy response to inference request in 1.5932207107543945s
Received healthy response to inference request in 1.940047264099121s
Received healthy response to inference request in 1.6785688400268555s
Received healthy response to inference request in 1.5405521392822266s
5 requests
0 failed requests
5th percentile: 1.55108585357666
10th percentile: 1.5616195678710938
20th percentile: 1.582686996459961
30th percentile: 1.6102903366088868
40th percentile: 1.644429588317871
50th percentile: 1.6785688400268555
60th percentile: 1.7698416709899902
70th percentile: 1.861114501953125
80th percentile: 1.9134101867675781
90th percentile: 1.9267287254333496
95th percentile: 1.9333879947662354
99th percentile: 1.938715410232544
mean time: 1.731827974319458
Pipeline stage StressChecker completed in 38.71s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.67s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.69s
Shutdown handler de-registered
nitral-ai-captain-bmo-12b_v60 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4270.87s
Shutdown handler de-registered
nitral-ai-captain-bmo-12b_v60 status is now inactive due to auto deactivation removed underperforming models
nitral-ai-captain-bmo-12b_v60 status is now torndown due to DeploymentManager action