Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name latitudegames-muse-12b-v1-mkmlizer
Waiting for job on latitudegames-muse-12b-v1-mkmlizer to finish
latitudegames-muse-12b-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
latitudegames-muse-12b-v1-mkmlizer: ║ ║
latitudegames-muse-12b-v1-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
latitudegames-muse-12b-v1-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
latitudegames-muse-12b-v1-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
latitudegames-muse-12b-v1-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
latitudegames-muse-12b-v1-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
latitudegames-muse-12b-v1-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
latitudegames-muse-12b-v1-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
latitudegames-muse-12b-v1-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
latitudegames-muse-12b-v1-mkmlizer: ║ ║
latitudegames-muse-12b-v1-mkmlizer: ║ Version: 0.29.15 ║
latitudegames-muse-12b-v1-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
latitudegames-muse-12b-v1-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
latitudegames-muse-12b-v1-mkmlizer: ║ https://mk1.ai ║
latitudegames-muse-12b-v1-mkmlizer: ║ ║
latitudegames-muse-12b-v1-mkmlizer: ║ The license key for the current software has been verified as ║
latitudegames-muse-12b-v1-mkmlizer: ║ belonging to: ║
latitudegames-muse-12b-v1-mkmlizer: ║ ║
latitudegames-muse-12b-v1-mkmlizer: ║ Chai Research Corp. ║
latitudegames-muse-12b-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
latitudegames-muse-12b-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
latitudegames-muse-12b-v1-mkmlizer: ║ ║
latitudegames-muse-12b-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
latitudegames-muse-12b-v1-mkmlizer: Downloaded to shared memory in 34.744s
latitudegames-muse-12b-v1-mkmlizer: Checking if LatitudeGames/Muse-12B already exists in ChaiML
latitudegames-muse-12b-v1-mkmlizer: Creating repo ChaiML/Muse-12B and uploading /tmp/tmp7tex98o9 to it
latitudegames-muse-12b-v1-mkmlizer:
0%| | 0/7 [00:00<?, ?it/s]
14%|█▍ | 1/7 [00:04<00:27, 4.57s/it]
29%|██▊ | 2/7 [00:08<00:20, 4.09s/it]
43%|████▎ | 3/7 [00:14<00:21, 5.26s/it]
57%|█████▋ | 4/7 [00:19<00:15, 5.13s/it]
71%|███████▏ | 5/7 [00:28<00:12, 6.23s/it]
86%|████████▌ | 6/7 [00:29<00:04, 4.43s/it]
100%|██████████| 7/7 [00:30<00:00, 3.34s/it]
100%|██████████| 7/7 [00:30<00:00, 4.30s/it]
latitudegames-muse-12b-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp7tex98o9, device:0
latitudegames-muse-12b-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
latitudegames-muse-12b-v1-mkmlizer: quantized model in 30.901s
latitudegames-muse-12b-v1-mkmlizer: Processed model LatitudeGames/Muse-12B in 121.419s
latitudegames-muse-12b-v1-mkmlizer: creating bucket guanaco-mkml-models
latitudegames-muse-12b-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
latitudegames-muse-12b-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/latitudegames-muse-12b-v1/nvidia
latitudegames-muse-12b-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/latitudegames-muse-12b-v1/nvidia/special_tokens_map.json
latitudegames-muse-12b-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/latitudegames-muse-12b-v1/nvidia/config.json
latitudegames-muse-12b-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/latitudegames-muse-12b-v1/nvidia/tokenizer_config.json
latitudegames-muse-12b-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/latitudegames-muse-12b-v1/nvidia/tokenizer.json
latitudegames-muse-12b-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/latitudegames-muse-12b-v1/nvidia/flywheel_model.0.safetensors
latitudegames-muse-12b-v1-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:12, 29.17it/s]
Loading 0: 3%|▎ | 12/363 [00:00<00:07, 46.72it/s]
Loading 0: 5%|▍ | 18/363 [00:00<00:07, 47.43it/s]
Loading 0: 6%|▋ | 23/363 [00:00<00:09, 36.86it/s]
Loading 0: 9%|▊ | 31/363 [00:00<00:07, 46.13it/s]
Loading 0: 10%|█ | 37/363 [00:00<00:07, 41.85it/s]
Loading 0: 12%|█▏ | 42/363 [00:01<00:07, 40.59it/s]
Loading 0: 13%|█▎ | 49/363 [00:01<00:06, 45.42it/s]
Loading 0: 15%|█▍ | 54/363 [00:01<00:06, 46.31it/s]
Loading 0: 16%|█▋ | 59/363 [00:01<00:06, 46.67it/s]
Loading 0: 18%|█▊ | 64/363 [00:01<00:11, 25.68it/s]
Loading 0: 20%|█▉ | 71/363 [00:01<00:08, 32.58it/s]
Loading 0: 21%|██ | 76/363 [00:02<00:08, 33.38it/s]
Loading 0: 22%|██▏ | 81/363 [00:02<00:07, 35.86it/s]
Loading 0: 24%|██▎ | 86/363 [00:02<00:07, 38.41it/s]
Loading 0: 25%|██▌ | 91/363 [00:02<00:08, 33.20it/s]
Loading 0: 27%|██▋ | 98/363 [00:02<00:06, 41.06it/s]
Loading 0: 28%|██▊ | 103/363 [00:02<00:06, 40.20it/s]
Loading 0: 30%|██▉ | 108/363 [00:02<00:06, 42.49it/s]
Loading 0: 31%|███ | 113/363 [00:02<00:06, 36.54it/s]
Loading 0: 33%|███▎ | 118/363 [00:03<00:06, 35.99it/s]
Loading 0: 34%|███▍ | 125/363 [00:03<00:05, 43.37it/s]
Loading 0: 36%|███▌ | 130/363 [00:03<00:05, 41.82it/s]
Loading 0: 37%|███▋ | 135/363 [00:03<00:05, 42.79it/s]
Loading 0: 39%|███▉ | 141/363 [00:03<00:05, 41.34it/s]
Loading 0: 40%|████ | 146/363 [00:03<00:07, 28.20it/s]
Loading 0: 41%|████▏ | 150/363 [00:04<00:07, 28.62it/s]
Loading 0: 43%|████▎ | 156/363 [00:04<00:06, 34.25it/s]
Loading 0: 44%|████▍ | 160/363 [00:04<00:05, 33.97it/s]
Loading 0: 45%|████▌ | 165/363 [00:04<00:05, 36.85it/s]
Loading 0: 47%|████▋ | 170/363 [00:04<00:04, 38.64it/s]
Loading 0: 48%|████▊ | 175/363 [00:04<00:04, 40.55it/s]
Loading 0: 50%|████▉ | 180/363 [00:04<00:04, 41.36it/s]
Loading 0: 51%|█████ | 185/363 [00:04<00:05, 35.16it/s]
Loading 0: 53%|█████▎ | 193/363 [00:05<00:03, 43.69it/s]
Loading 0: 55%|█████▍ | 198/363 [00:05<00:03, 44.71it/s]
Loading 0: 56%|█████▌ | 203/363 [00:05<00:04, 37.44it/s]
Loading 0: 58%|█████▊ | 211/363 [00:05<00:03, 45.35it/s]
Loading 0: 60%|█████▉ | 216/363 [00:05<00:03, 45.81it/s]
Loading 0: 61%|██████ | 221/363 [00:05<00:03, 46.32it/s]
Loading 0: 62%|██████▏ | 226/363 [00:06<00:04, 28.18it/s]
Loading 0: 63%|██████▎ | 230/363 [00:06<00:04, 28.36it/s]
Loading 0: 65%|██████▌ | 237/363 [00:06<00:03, 36.33it/s]
Loading 0: 67%|██████▋ | 242/363 [00:06<00:03, 37.23it/s]
Loading 0: 68%|██████▊ | 247/363 [00:06<00:02, 39.03it/s]
Loading 0: 70%|██████▉ | 253/363 [00:06<00:02, 39.31it/s]
Loading 0: 71%|███████ | 258/363 [00:06<00:02, 39.10it/s]
Loading 0: 73%|███████▎ | 264/363 [00:06<00:02, 42.62it/s]
Loading 0: 74%|███████▍ | 269/363 [00:07<00:02, 43.27it/s]
Loading 0: 75%|███████▌ | 274/363 [00:07<00:01, 44.72it/s]
Loading 0: 77%|███████▋ | 279/363 [00:07<00:01, 46.06it/s]
Loading 0: 78%|███████▊ | 284/363 [00:07<00:02, 37.21it/s]
Loading 0: 80%|████████ | 292/363 [00:07<00:01, 45.24it/s]
Loading 0: 82%|████████▏ | 298/363 [00:07<00:01, 43.08it/s]
Loading 0: 83%|████████▎ | 303/363 [00:07<00:01, 44.21it/s]
Loading 0: 85%|████████▍ | 308/363 [00:08<00:02, 23.10it/s]
Loading 0: 86%|████████▌ | 312/363 [00:08<00:02, 23.01it/s]
Loading 0: 88%|████████▊ | 320/363 [00:08<00:01, 31.71it/s]
Loading 0: 90%|████████▉ | 326/363 [00:08<00:01, 33.57it/s]
Loading 0: 91%|█████████ | 331/363 [00:08<00:00, 34.53it/s]
Loading 0: 93%|█████████▎| 338/363 [00:08<00:00, 40.57it/s]
Loading 0: 95%|█████████▍| 344/363 [00:09<00:00, 40.10it/s]
Loading 0: 96%|█████████▌| 349/363 [00:09<00:00, 38.40it/s]
Loading 0: 98%|█████████▊| 355/363 [00:09<00:00, 41.80it/s]
Loading 0: 99%|█████████▉| 360/363 [00:09<00:00, 42.55it/s]
Job latitudegames-muse-12b-v1-mkmlizer completed after 146.74s with status: succeeded
Stopping job with name latitudegames-muse-12b-v1-mkmlizer
Pipeline stage MKMLizer completed in 147.27s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.17s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service latitudegames-muse-12b-v1
Waiting for inference service latitudegames-muse-12b-v1 to be ready
Inference service latitudegames-muse-12b-v1 ready after 210.84398436546326s
Pipeline stage MKMLDeployer completed in 211.59s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.6145663261413574s
Received healthy response to inference request in 2.0240726470947266s
Received healthy response to inference request in 1.7902591228485107s
Received healthy response to inference request in 1.948925495147705s
Received healthy response to inference request in 2.105834722518921s
5 requests
0 failed requests
5th percentile: 1.8219923973083496
10th percentile: 1.8537256717681885
20th percentile: 1.9171922206878662
30th percentile: 1.9639549255371094
40th percentile: 1.994013786315918
50th percentile: 2.0240726470947266
60th percentile: 2.0567774772644043
70th percentile: 2.089482307434082
80th percentile: 2.207581043243408
90th percentile: 2.411073684692383
95th percentile: 2.51282000541687
99th percentile: 2.59421706199646
mean time: 2.096731662750244
Pipeline stage StressChecker completed in 11.93s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.66s
Shutdown handler de-registered
latitudegames-muse-12b_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 5637.01s
Shutdown handler de-registered
latitudegames-muse-12b_v1 status is now inactive due to auto deactivation removed underperforming models
latitudegames-muse-12b_v1 status is now torndown due to DeploymentManager action