Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name alexcuadron-chai-sft-12b-v5-mkmlizer
Waiting for job on alexcuadron-chai-sft-12b-v5-mkmlizer to finish
alexcuadron-chai-sft-12b-v5-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ _____ __ __ ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ /___/ ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ Version: 0.12.8 ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
Failed to get response for submission jellywibble-enzo-mafia-enemy_v12: HTTPConnectionPool(host='jellywibble-enzo-mafia-enemy-v12-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ https://mk1.ai ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ The license key for the current software has been verified as ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ belonging to: ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ Chai Research Corp. ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ Expiration: 2025-04-15 23:59:59 ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ║ ║
alexcuadron-chai-sft-12b-v5-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
alexcuadron-chai-sft-12b-v5-mkmlizer: Downloaded to shared memory in 46.158s
alexcuadron-chai-sft-12b-v5-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpmvwn6gkk, device:0
alexcuadron-chai-sft-12b-v5-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Failed to get response for submission jellywibble-enzo-mafia-enemy_v12: HTTPConnectionPool(host='jellywibble-enzo-mafia-enemy-v12-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
alexcuadron-chai-sft-12b-v5-mkmlizer: quantized model in 36.224s
alexcuadron-chai-sft-12b-v5-mkmlizer: Processed model AlexCuadron/chai-sft-12b in 82.383s
alexcuadron-chai-sft-12b-v5-mkmlizer: creating bucket guanaco-mkml-models
alexcuadron-chai-sft-12b-v5-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
alexcuadron-chai-sft-12b-v5-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/alexcuadron-chai-sft-12b-v5
alexcuadron-chai-sft-12b-v5-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/alexcuadron-chai-sft-12b-v5/tokenizer_config.json
alexcuadron-chai-sft-12b-v5-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/alexcuadron-chai-sft-12b-v5/tokenizer.json
alexcuadron-chai-sft-12b-v5-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/alexcuadron-chai-sft-12b-v5/flywheel_model.0.safetensors
alexcuadron-chai-sft-12b-v5-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:11, 31.02it/s]
Loading 0: 4%|▎ | 13/363 [00:00<00:06, 51.86it/s]
Loading 0: 5%|▌ | 19/363 [00:00<00:07, 45.94it/s]
Loading 0: 7%|▋ | 24/363 [00:00<00:07, 44.82it/s]
Loading 0: 9%|▊ | 31/363 [00:00<00:06, 51.25it/s]
Loading 0: 10%|█ | 37/363 [00:00<00:06, 48.47it/s]
Loading 0: 12%|█▏ | 42/363 [00:00<00:06, 46.67it/s]
Loading 0: 13%|█▎ | 49/363 [00:01<00:06, 52.03it/s]
Loading 0: 15%|█▌ | 55/363 [00:01<00:06, 49.74it/s]
Loading 0: 17%|█▋ | 61/363 [00:01<00:08, 37.33it/s]
Loading 0: 18%|█▊ | 66/363 [00:01<00:07, 37.32it/s]
Loading 0: 20%|█▉ | 72/363 [00:01<00:06, 41.69it/s]
Loading 0: 21%|██▏ | 78/363 [00:01<00:06, 41.86it/s]
Loading 0: 23%|██▎ | 83/363 [00:01<00:06, 41.32it/s]
Loading 0: 25%|██▍ | 90/363 [00:02<00:05, 45.99it/s]
Loading 0: 26%|██▋ | 96/363 [00:02<00:06, 44.46it/s]
Loading 0: 28%|██▊ | 101/363 [00:02<00:06, 43.39it/s]
Loading 0: 30%|██▉ | 108/363 [00:02<00:05, 49.85it/s]
Loading 0: 31%|███▏ | 114/363 [00:02<00:05, 45.65it/s]
Loading 0: 33%|███▎ | 119/363 [00:02<00:05, 44.10it/s]
Loading 0: 34%|███▍ | 125/363 [00:02<00:04, 47.84it/s]
Loading 0: 36%|███▌ | 130/363 [00:02<00:05, 45.83it/s]
Loading 0: 37%|███▋ | 135/363 [00:03<00:05, 44.26it/s]
Loading 0: 39%|███▊ | 140/363 [00:03<00:04, 45.15it/s]
Loading 0: 40%|███▉ | 145/363 [00:03<00:07, 28.55it/s]
Loading 0: 41%|████ | 149/363 [00:03<00:07, 29.35it/s]
Loading 0: 43%|████▎ | 156/363 [00:03<00:05, 37.04it/s]
Loading 0: 44%|████▍ | 161/363 [00:03<00:05, 37.80it/s]
Loading 0: 46%|████▌ | 166/363 [00:03<00:05, 38.15it/s]
Loading 0: 47%|████▋ | 171/363 [00:04<00:04, 39.27it/s]
Loading 0: 48%|████▊ | 176/363 [00:04<00:05, 32.19it/s]
Loading 0: 50%|█████ | 183/363 [00:04<00:04, 39.67it/s]
Loading 0: 52%|█████▏ | 188/363 [00:04<00:04, 40.99it/s]
Loading 0: 53%|█████▎ | 193/363 [00:04<00:04, 42.28it/s]
Loading 0: 55%|█████▍ | 199/363 [00:04<00:03, 41.46it/s]
Loading 0: 56%|█████▌ | 204/363 [00:04<00:03, 40.36it/s]
Loading 0: 58%|█████▊ | 211/363 [00:05<00:03, 45.26it/s]
Loading 0: 60%|█████▉ | 217/363 [00:05<00:03, 43.96it/s]
Loading 0: 61%|██████ | 222/363 [00:05<00:03, 44.53it/s]
Loading 0: 63%|██████▎ | 227/363 [00:05<00:04, 31.80it/s]
Loading 0: 64%|██████▎ | 231/363 [00:05<00:04, 31.23it/s]
Loading 0: 65%|██████▌ | 237/363 [00:05<00:03, 37.23it/s]
Loading 0: 67%|██████▋ | 242/363 [00:05<00:03, 38.58it/s]
Loading 0: 68%|██████▊ | 247/363 [00:06<00:02, 40.37it/s]
Loading 0: 70%|██████▉ | 253/363 [00:06<00:02, 40.20it/s]
Loading 0: 71%|███████ | 258/363 [00:06<00:02, 40.83it/s]
Loading 0: 73%|███████▎ | 265/363 [00:06<00:02, 46.05it/s]
Loading 0: 75%|███████▍ | 271/363 [00:06<00:02, 44.35it/s]
Loading 0: 76%|███████▌ | 276/363 [00:06<00:02, 42.45it/s]
Loading 0: 78%|███████▊ | 283/363 [00:06<00:01, 47.16it/s]
Loading 0: 80%|███████▉ | 289/363 [00:06<00:01, 45.79it/s]
Loading 0: 81%|████████ | 294/363 [00:07<00:01, 44.26it/s]
Loading 0: 82%|████████▏ | 299/363 [00:07<00:01, 44.79it/s]
Loading 0: 84%|████████▎ | 304/363 [00:13<00:23, 2.52it/s]
Loading 0: 85%|████████▍ | 308/363 [00:14<00:16, 3.25it/s]
Loading 0: 86%|████████▌ | 312/363 [00:14<00:12, 4.23it/s]
Loading 0: 88%|████████▊ | 320/363 [00:14<00:06, 7.04it/s]
Loading 0: 90%|████████▉ | 326/363 [00:14<00:03, 9.53it/s]
Loading 0: 91%|█████████ | 331/363 [00:14<00:02, 12.06it/s]
Loading 0: 93%|█████████▎| 338/363 [00:14<00:01, 16.85it/s]
Loading 0: 95%|█████████▍| 344/363 [00:14<00:00, 20.47it/s]
Loading 0: 96%|█████████▌| 349/363 [00:14<00:00, 23.57it/s]
Loading 0: 98%|█████████▊| 356/363 [00:15<00:00, 29.93it/s]
Loading 0: 99%|█████████▉| 361/363 [00:15<00:00, 32.99it/s]
Job alexcuadron-chai-sft-12b-v5-mkmlizer completed after 114.31s with status: succeeded
Stopping job with name alexcuadron-chai-sft-12b-v5-mkmlizer
Pipeline stage MKMLizer completed in 114.79s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service alexcuadron-chai-sft-12b-v5
Waiting for inference service alexcuadron-chai-sft-12b-v5 to be ready
Inference service alexcuadron-chai-sft-12b-v5 ready after 90.30964207649231s
Pipeline stage MKMLDeployer completed in 90.88s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2625558376312256s
Received healthy response to inference request in 1.6355719566345215s
Received healthy response to inference request in 1.5316870212554932s
Received healthy response to inference request in 1.488600730895996s
Received healthy response to inference request in 1.3008201122283936s
5 requests
0 failed requests
5th percentile: 1.338376235961914
10th percentile: 1.3759323596954345
20th percentile: 1.4510446071624756
30th percentile: 1.4972179889678956
40th percentile: 1.5144525051116944
50th percentile: 1.5316870212554932
60th percentile: 1.5732409954071045
70th percentile: 1.6147949695587158
Failed to get response for submission jellywibble-enzo-mafia-enemy_v12: HTTPConnectionPool(host='jellywibble-enzo-mafia-enemy-v12-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
80th percentile: 1.7609687328338623
90th percentile: 2.011762285232544
95th percentile: 2.1371590614318845
99th percentile: 2.2374764823913575
mean time: 1.6438471317291259
Pipeline stage StressChecker completed in 9.55s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.71s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.69s
Shutdown handler de-registered
alexcuadron-chai-sft-12b_v5 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4546.78s
Shutdown handler de-registered
alexcuadron-chai-sft-12b_v5 status is now inactive due to auto deactivation removed underperforming models
alexcuadron-chai-sft-12b_v5 status is now torndown due to DeploymentManager action