Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-mistral31-ff4kex-37571-v3-mkmlizer
Waiting for job on chaiml-mistral31-ff4kex-37571-v3-mkmlizer to finish
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ _____ __ __ ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ /___/ ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ Version: 0.12.8 ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ https://mk1.ai ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ belonging to: ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ Chai Research Corp. ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ Expiration: 2025-04-15 23:59:59 ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ║ ║
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: Downloaded to shared memory in 63.867s
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp08snk6s1, device:0
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Failed to get response for submission jellywibble-felix-black_21853_v1: HTTPConnectionPool(host='jellywibble-felix-black-21853-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: quantized model in 52.892s
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: Processed model ChaiML/mistral31-ff4kexp1-simpo-v1-data31preffull in 116.759s
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: creating bucket guanaco-mkml-models
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-mistral31-ff4kex-37571-v3
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-mistral31-ff4kex-37571-v3/config.json
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-mistral31-ff4kex-37571-v3/special_tokens_map.json
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-mistral31-ff4kex-37571-v3/tokenizer_config.json
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-mistral31-ff4kex-37571-v3/tokenizer.json
chaiml-mistral31-ff4kex-37571-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-mistral31-ff4kex-37571-v3/flywheel_model.0.safetensors
chaiml-mistral31-ff4kex-37571-v3-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 4/363 [00:00<00:09, 38.10it/s]
Loading 0: 2%|▏ | 8/363 [00:00<00:11, 30.37it/s]
Loading 0: 3%|▎ | 12/363 [00:00<00:10, 33.35it/s]
Loading 0: 4%|▍ | 16/363 [00:00<00:11, 30.27it/s]
Loading 0: 6%|▌ | 21/363 [00:00<00:09, 34.82it/s]
Loading 0: 7%|▋ | 25/363 [00:00<00:10, 31.82it/s]
Loading 0: 9%|▉ | 32/363 [00:00<00:08, 39.02it/s]
Loading 0: 10%|▉ | 36/363 [00:01<00:14, 22.53it/s]
Loading 0: 11%|█ | 40/363 [00:01<00:12, 25.03it/s]
Loading 0: 12%|█▏ | 44/363 [00:01<00:12, 25.67it/s]
Loading 0: 13%|█▎ | 48/363 [00:01<00:11, 27.81it/s]
Loading 0: 14%|█▍ | 52/363 [00:01<00:11, 26.46it/s]
Loading 0: 16%|█▌ | 57/363 [00:01<00:10, 29.61it/s]
Loading 0: 17%|█▋ | 61/363 [00:02<00:10, 28.77it/s]
Loading 0: 18%|█▊ | 65/363 [00:02<00:10, 29.30it/s]
Loading 0: 19%|█▉ | 70/363 [00:02<00:11, 25.85it/s]
Loading 0: 20%|██ | 73/363 [00:02<00:13, 21.81it/s]
Loading 0: 22%|██▏ | 79/363 [00:02<00:10, 27.74it/s]
Loading 0: 23%|██▎ | 83/363 [00:02<00:09, 29.17it/s]
Loading 0: 24%|██▍ | 87/363 [00:03<00:10, 25.10it/s]
Loading 0: 25%|██▌ | 91/363 [00:03<00:09, 27.93it/s]
Loading 0: 26%|██▌ | 95/363 [00:03<00:09, 28.74it/s]
Loading 0: 27%|██▋ | 99/363 [00:03<00:08, 29.90it/s]
Loading 0: 28%|██▊ | 103/363 [00:03<00:09, 28.04it/s]
Loading 0: 29%|██▉ | 107/363 [00:03<00:10, 24.33it/s]
Loading 0: 31%|███ | 111/363 [00:03<00:09, 27.52it/s]
Loading 0: 32%|███▏ | 115/363 [00:04<00:09, 25.78it/s]
Loading 0: 33%|███▎ | 120/363 [00:04<00:08, 28.50it/s]
Loading 0: 34%|███▍ | 124/363 [00:04<00:08, 26.75it/s]
Loading 0: 36%|███▌ | 129/363 [00:04<00:08, 29.01it/s]
Loading 0: 37%|███▋ | 133/363 [00:04<00:08, 26.78it/s]
Loading 0: 38%|███▊ | 138/363 [00:04<00:07, 29.07it/s]
Loading 0: 39%|███▉ | 141/363 [00:05<00:08, 25.48it/s]
Loading 0: 40%|████ | 147/363 [00:05<00:06, 32.65it/s]
Loading 0: 42%|████▏ | 151/363 [00:05<00:08, 24.00it/s]
Loading 0: 42%|████▏ | 154/363 [00:05<00:09, 22.46it/s]
Loading 0: 43%|████▎ | 157/363 [00:05<00:08, 23.29it/s]
Loading 0: 44%|████▍ | 160/363 [00:05<00:08, 23.59it/s]
Loading 0: 45%|████▌ | 165/363 [00:06<00:07, 26.97it/s]
Loading 0: 46%|████▋ | 168/363 [00:06<00:08, 24.19it/s]
Loading 0: 48%|████▊ | 174/363 [00:06<00:06, 29.81it/s]
Loading 0: 49%|████▉ | 178/363 [00:06<00:06, 28.77it/s]
Loading 0: 50%|█████ | 182/363 [00:06<00:06, 29.03it/s]
Loading 0: 52%|█████▏ | 187/363 [00:06<00:06, 26.16it/s]
Loading 0: 52%|█████▏ | 190/363 [00:07<00:07, 23.40it/s]
Loading 0: 53%|█████▎ | 193/363 [00:07<00:07, 23.95it/s]
Loading 0: 54%|█████▍ | 196/363 [00:07<00:06, 24.20it/s]
Loading 0: 55%|█████▌ | 200/363 [00:21<00:06, 24.20it/s]
Loading 0: 55%|█████▌ | 201/363 [00:21<02:59, 1.11s/it]
Loading 0: 56%|█████▌ | 203/363 [00:21<02:27, 1.08it/s]
Loading 0: 57%|█████▋ | 208/363 [00:21<01:28, 1.75it/s]
Loading 0: 58%|█████▊ | 211/363 [00:21<01:08, 2.22it/s]
Loading 0: 59%|█████▉ | 214/363 [00:22<00:50, 2.93it/s]
Loading 0: 60%|██████ | 218/363 [00:22<00:34, 4.20it/s]
Loading 0: 61%|██████ | 221/363 [00:22<00:26, 5.39it/s]
Loading 0: 62%|██████▏ | 224/363 [00:22<00:21, 6.45it/s]
Loading 0: 63%|██████▎ | 229/363 [00:22<00:14, 9.48it/s]
Loading 0: 64%|██████▍ | 232/363 [00:22<00:11, 11.27it/s]
Loading 0: 65%|██████▌ | 237/363 [00:22<00:08, 15.22it/s]
Loading 0: 66%|██████▌ | 240/363 [00:23<00:07, 16.23it/s]
Loading 0: 68%|██████▊ | 246/363 [00:23<00:05, 21.91it/s]
Loading 0: 69%|██████▉ | 250/363 [00:23<00:05, 22.50it/s]
Loading 0: 70%|███████ | 255/363 [00:23<00:04, 25.63it/s]
Loading 0: 71%|███████▏ | 259/363 [00:23<00:04, 25.69it/s]
Loading 0: 73%|███████▎ | 266/363 [00:23<00:03, 30.60it/s]
Loading 0: 74%|███████▍ | 270/363 [00:24<00:04, 20.82it/s]
Loading 0: 75%|███████▌ | 274/363 [00:24<00:03, 23.45it/s]
Loading 0: 76%|███████▋ | 277/363 [00:24<00:03, 23.52it/s]
Loading 0: 78%|███████▊ | 282/363 [00:24<00:03, 26.11it/s]
Loading 0: 79%|███████▊ | 285/363 [00:24<00:03, 23.86it/s]
Loading 0: 80%|████████ | 291/363 [00:24<00:02, 28.50it/s]
Loading 0: 81%|████████▏ | 295/363 [00:25<00:02, 27.42it/s]
Loading 0: 82%|████████▏ | 299/363 [00:25<00:02, 27.86it/s]
Loading 0: 84%|████████▎ | 304/363 [00:25<00:02, 25.25it/s]
Loading 0: 85%|████████▍ | 307/363 [00:25<00:02, 23.12it/s]
Loading 0: 85%|████████▌ | 310/363 [00:25<00:02, 23.99it/s]
Loading 0: 86%|████████▌ | 313/363 [00:25<00:02, 24.32it/s]
Loading 0: 88%|████████▊ | 318/363 [00:26<00:01, 27.60it/s]
Loading 0: 88%|████████▊ | 321/363 [00:26<00:01, 25.41it/s]
Loading 0: 90%|█████████ | 327/363 [00:26<00:01, 30.99it/s]
Loading 0: 91%|█████████ | 331/363 [00:26<00:01, 29.47it/s]
Loading 0: 92%|█████████▏| 335/363 [00:26<00:00, 30.01it/s]
Loading 0: 93%|█████████▎| 339/363 [00:26<00:00, 30.33it/s]
Loading 0: 94%|█████████▍| 343/363 [00:33<00:10, 1.91it/s]
Loading 0: 96%|█████████▌| 348/363 [00:33<00:05, 2.79it/s]
Loading 0: 98%|█████████▊| 355/363 [00:33<00:01, 4.58it/s]
Loading 0: 99%|█████████▉| 359/363 [00:34<00:00, 5.76it/s]
Job chaiml-mistral31-ff4kex-37571-v3-mkmlizer completed after 156.35s with status: succeeded
Stopping job with name chaiml-mistral31-ff4kex-37571-v3-mkmlizer
Pipeline stage MKMLizer completed in 156.78s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.14s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-mistral31-ff4kex-37571-v3
Waiting for inference service chaiml-mistral31-ff4kex-37571-v3 to be ready
Inference service chaiml-mistral31-ff4kex-37571-v3 ready after 120.48378705978394s
Pipeline stage MKMLDeployer completed in 120.90s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.0217113494873047s
Received healthy response to inference request in 2.8691465854644775s
Received healthy response to inference request in 2.483856201171875s
Received healthy response to inference request in 2.459623336791992s
5 requests
1 failed requests
5th percentile: 2.4644699096679688
10th percentile: 2.4693164825439453
20th percentile: 2.4790096282958984
30th percentile: 2.5609142780303955
40th percentile: 2.7150304317474365
50th percentile: 2.8691465854644775
60th percentile: 2.930172491073608
70th percentile: 2.9911983966827393
80th percentile: 6.443994617462161
90th percentile: 13.288561153411866
95th percentile: 16.710844421386717
99th percentile: 19.4486710357666
mean time: 6.193493032455445
%s, retrying in %s seconds...
Received healthy response to inference request in 2.5618462562561035s
Received healthy response to inference request in 2.924821615219116s
Received healthy response to inference request in 2.458974599838257s
Received healthy response to inference request in 2.582920551300049s
Received healthy response to inference request in 2.5645627975463867s
5 requests
0 failed requests
5th percentile: 2.479548931121826
10th percentile: 2.5001232624053955
20th percentile: 2.541271924972534
30th percentile: 2.5623895645141603
40th percentile: 2.5634761810302735
50th percentile: 2.5645627975463867
60th percentile: 2.5719058990478514
70th percentile: 2.5792490005493165
80th percentile: 2.6513007640838624
90th percentile: 2.788061189651489
95th percentile: 2.8564414024353026
99th percentile: 2.9111455726623534
mean time: 2.6186251640319824
Pipeline stage StressChecker completed in 46.56s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.71s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.65s
Shutdown handler de-registered
chaiml-mistral31-ff4kex_37571_v3 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 5577.84s
Shutdown handler de-registered
chaiml-mistral31-ff4kex_37571_v3 status is now inactive due to auto deactivation removed underperforming models
chaiml-mistral31-ff4kex_37571_v3 status is now torndown due to DeploymentManager action