Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name alexcuadron-chai-v2-sft-14944-v4-mkmlizer
Waiting for job on alexcuadron-chai-v2-sft-14944-v4-mkmlizer to finish
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ _____ __ __ ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ /___/ ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ Version: 0.12.8 ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ https://mk1.ai ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ The license key for the current software has been verified as ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ belonging to: ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ Chai Research Corp. ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ Expiration: 2025-04-15 23:59:59 ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ║ ║
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: Downloaded to shared memory in 33.037s
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpd5mm_pdb, device:0
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: Saving flywheel model at /dev/shm/model_cache
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: quantized model in 37.486s
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: Processed model AlexCuadron/chai-v2-sft-12k-12b in 70.524s
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: creating bucket guanaco-mkml-models
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/alexcuadron-chai-v2-sft-14944-v4
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/alexcuadron-chai-v2-sft-14944-v4/tokenizer.json
alexcuadron-chai-v2-sft-14944-v4-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/alexcuadron-chai-v2-sft-14944-v4/flywheel_model.0.safetensors
alexcuadron-chai-v2-sft-14944-v4-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:12, 29.59it/s]
Loading 0: 3%|▎ | 12/363 [00:00<00:07, 46.56it/s]
Loading 0: 5%|▍ | 18/363 [00:00<00:07, 46.03it/s]
Loading 0: 6%|▋ | 23/363 [00:00<00:09, 34.82it/s]
Loading 0: 8%|▊ | 30/363 [00:00<00:07, 43.35it/s]
Loading 0: 10%|▉ | 35/363 [00:00<00:07, 42.17it/s]
Loading 0: 11%|█ | 40/363 [00:00<00:07, 42.03it/s]
Loading 0: 12%|█▏ | 45/363 [00:01<00:07, 42.87it/s]
Loading 0: 14%|█▍ | 50/363 [00:01<00:09, 34.53it/s]
Loading 0: 15%|█▌ | 56/363 [00:01<00:07, 40.11it/s]
Loading 0: 17%|█▋ | 61/363 [00:01<00:10, 29.98it/s]
Loading 0: 18%|█▊ | 65/363 [00:01<00:10, 29.74it/s]
Loading 0: 20%|█▉ | 71/363 [00:01<00:08, 35.77it/s]
Loading 0: 21%|██ | 76/363 [00:02<00:08, 35.75it/s]
Loading 0: 22%|██▏ | 81/363 [00:02<00:07, 37.32it/s]
Loading 0: 24%|██▎ | 86/363 [00:02<00:07, 38.83it/s]
Loading 0: 25%|██▌ | 91/363 [00:02<00:08, 32.17it/s]
Loading 0: 27%|██▋ | 98/363 [00:02<00:06, 39.05it/s]
Loading 0: 28%|██▊ | 103/363 [00:02<00:06, 38.65it/s]
Loading 0: 30%|██▉ | 108/363 [00:02<00:06, 40.27it/s]
Loading 0: 31%|███ | 113/363 [00:03<00:07, 33.56it/s]
Loading 0: 33%|███▎ | 118/363 [00:03<00:07, 32.98it/s]
Loading 0: 34%|███▍ | 125/363 [00:03<00:06, 39.60it/s]
Loading 0: 36%|███▌ | 130/363 [00:03<00:05, 38.91it/s]
Loading 0: 37%|███▋ | 135/363 [00:03<00:06, 37.12it/s]
Loading 0: 38%|███▊ | 139/363 [00:03<00:06, 36.17it/s]
Loading 0: 39%|███▉ | 143/363 [00:04<00:08, 25.83it/s]
Loading 0: 40%|████ | 147/363 [00:04<00:07, 27.61it/s]
Loading 0: 42%|████▏ | 151/363 [00:04<00:07, 29.49it/s]
Loading 0: 43%|████▎ | 156/363 [00:04<00:06, 33.78it/s]
Loading 0: 44%|████▍ | 160/363 [00:04<00:06, 33.19it/s]
Loading 0: 45%|████▌ | 165/363 [00:04<00:05, 36.15it/s]
Loading 0: 47%|████▋ | 169/363 [00:04<00:05, 34.12it/s]
Loading 0: 48%|████▊ | 174/363 [00:04<00:05, 34.85it/s]
Loading 0: 49%|████▉ | 178/363 [00:05<00:05, 33.97it/s]
Loading 0: 50%|█████ | 183/363 [00:05<00:04, 37.39it/s]
Loading 0: 52%|█████▏ | 187/363 [00:05<00:04, 36.16it/s]
Loading 0: 53%|█████▎ | 192/363 [00:05<00:04, 39.05it/s]
Loading 0: 54%|█████▍ | 196/363 [00:05<00:04, 36.61it/s]
Loading 0: 55%|█████▌ | 201/363 [00:05<00:04, 38.74it/s]
Loading 0: 56%|█████▋ | 205/363 [00:05<00:04, 36.84it/s]
Loading 0: 58%|█████▊ | 210/363 [00:05<00:03, 39.33it/s]
Loading 0: 59%|█████▉ | 214/363 [00:05<00:04, 37.18it/s]
Loading 0: 60%|██████ | 218/363 [00:06<00:04, 35.93it/s]
Loading 0: 61%|██████▏ | 223/363 [00:06<00:05, 27.38it/s]
Loading 0: 63%|██████▎ | 227/363 [00:06<00:04, 28.82it/s]
Loading 0: 64%|██████▎ | 231/363 [00:06<00:04, 27.88it/s]
Loading 0: 65%|██████▌ | 237/363 [00:06<00:03, 33.38it/s]
Loading 0: 66%|██████▋ | 241/363 [00:06<00:03, 33.19it/s]
Loading 0: 68%|██████▊ | 246/363 [00:06<00:03, 35.73it/s]
Loading 0: 69%|██████▉ | 250/363 [00:07<00:03, 34.44it/s]
Loading 0: 70%|███████ | 255/363 [00:07<00:02, 37.19it/s]
Loading 0: 71%|███████▏ | 259/363 [00:07<00:02, 36.00it/s]
Loading 0: 73%|███████▎ | 264/363 [00:07<00:02, 38.53it/s]
Loading 0: 74%|███████▍ | 268/363 [00:07<00:02, 35.86it/s]
Loading 0: 75%|███████▌ | 273/363 [00:07<00:02, 38.45it/s]
Loading 0: 76%|███████▋ | 277/363 [00:07<00:02, 36.98it/s]
Loading 0: 78%|███████▊ | 282/363 [00:07<00:02, 39.03it/s]
Loading 0: 79%|███████▉ | 286/363 [00:08<00:02, 37.08it/s]
Loading 0: 80%|████████ | 291/363 [00:08<00:01, 39.75it/s]
Loading 0: 82%|████████▏ | 296/363 [00:08<00:01, 39.44it/s]
Loading 0: 83%|████████▎ | 300/363 [00:08<00:01, 39.42it/s]
Loading 0: 84%|████████▎ | 304/363 [00:15<00:28, 2.04it/s]
Loading 0: 85%|████████▍ | 307/363 [00:15<00:21, 2.58it/s]
Loading 0: 86%|████████▌ | 312/363 [00:15<00:13, 3.84it/s]
Loading 0: 88%|████████▊ | 320/363 [00:15<00:06, 6.62it/s]
Loading 0: 90%|████████▉ | 325/363 [00:15<00:04, 8.77it/s]
Loading 0: 91%|█████████ | 330/363 [00:15<00:03, 10.83it/s]
Loading 0: 93%|█████████▎| 338/363 [00:15<00:01, 16.22it/s]
Loading 0: 94%|█████████▍| 343/363 [00:16<00:01, 19.57it/s]
Loading 0: 96%|█████████▌| 348/363 [00:16<00:00, 20.81it/s]
Loading 0: 98%|█████████▊| 356/363 [00:16<00:00, 28.25it/s]
Loading 0: 99%|█████████▉| 361/363 [00:16<00:00, 31.31it/s]
Job alexcuadron-chai-v2-sft-14944-v4-mkmlizer completed after 94.04s with status: succeeded
Stopping job with name alexcuadron-chai-v2-sft-14944-v4-mkmlizer
Pipeline stage MKMLizer completed in 94.50s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.14s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service alexcuadron-chai-v2-sft-14944-v4
Waiting for inference service alexcuadron-chai-v2-sft-14944-v4 to be ready
Inference service alexcuadron-chai-v2-sft-14944-v4 ready after 90.358389377594s
Pipeline stage MKMLDeployer completed in 90.86s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.421678304672241s
Received healthy response to inference request in 1.5432143211364746s
Received healthy response to inference request in 1.7698826789855957s
Received healthy response to inference request in 1.7025773525238037s
Received healthy response to inference request in 1.6663131713867188s
5 requests
0 failed requests
5th percentile: 1.5678340911865234
10th percentile: 1.5924538612365722
20th percentile: 1.64169340133667
30th percentile: 1.6735660076141357
40th percentile: 1.6880716800689697
50th percentile: 1.7025773525238037
60th percentile: 1.7294994831085204
70th percentile: 1.7564216136932373
80th percentile: 1.900241804122925
90th percentile: 2.160960054397583
95th percentile: 2.291319179534912
99th percentile: 2.3956064796447754
mean time: 1.8207331657409669
Pipeline stage StressChecker completed in 10.34s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.70s
Shutdown handler de-registered
alexcuadron-chai-v2-sft_14944_v4 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4670.43s
Shutdown handler de-registered
alexcuadron-chai-v2-sft_14944_v4 status is now inactive due to auto deactivation removed underperforming models
alexcuadron-chai-v2-sft_14944_v4 status is now torndown due to DeploymentManager action