Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name rica40325-dpo-1008-5162s-v3-mkmlizer
Waiting for job on rica40325-dpo-1008-5162s-v3-mkmlizer to finish
rica40325-dpo-1008-5162s-v3-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ _____ __ __ ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ /___/ ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ Version: 0.11.12 ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ https://mk1.ai ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ The license key for the current software has been verified as ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ belonging to: ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ Chai Research Corp. ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ║ ║
rica40325-dpo-1008-5162s-v3-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
rica40325-dpo-1008-5162s-v3-mkmlizer: Downloaded to shared memory in 110.178s
rica40325-dpo-1008-5162s-v3-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmparygvct8, device:0
rica40325-dpo-1008-5162s-v3-mkmlizer: Saving flywheel model at /dev/shm/model_cache
rica40325-dpo-1008-5162s-v3-mkmlizer: quantized model in 43.355s
rica40325-dpo-1008-5162s-v3-mkmlizer: Processed model rica40325/dpo_1008_5162s in 153.533s
rica40325-dpo-1008-5162s-v3-mkmlizer: creating bucket guanaco-mkml-models
rica40325-dpo-1008-5162s-v3-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
rica40325-dpo-1008-5162s-v3-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v3
rica40325-dpo-1008-5162s-v3-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v3/config.json
rica40325-dpo-1008-5162s-v3-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v3/special_tokens_map.json
rica40325-dpo-1008-5162s-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v3/tokenizer_config.json
rica40325-dpo-1008-5162s-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v3/tokenizer.json
rica40325-dpo-1008-5162s-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v3/flywheel_model.0.safetensors
rica40325-dpo-1008-5162s-v3-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:16, 21.23it/s]
Loading 0: 3%|▎ | 10/363 [00:00<00:13, 26.03it/s]
Loading 0: 4%|▍ | 14/363 [00:00<00:15, 22.84it/s]
Loading 0: 5%|▌ | 19/363 [00:00<00:11, 28.67it/s]
Loading 0: 6%|▋ | 23/363 [00:01<00:15, 21.46it/s]
Loading 0: 7%|▋ | 26/363 [00:01<00:17, 19.00it/s]
Loading 0: 9%|▊ | 31/363 [00:01<00:13, 24.67it/s]
Loading 0: 10%|▉ | 35/363 [00:01<00:12, 25.59it/s]
Loading 0: 11%|█ | 39/363 [00:01<00:12, 26.46it/s]
Loading 0: 12%|█▏ | 42/363 [00:01<00:13, 23.98it/s]
Loading 0: 13%|█▎ | 46/363 [00:01<00:11, 26.99it/s]
Loading 0: 14%|█▍ | 50/363 [00:02<00:12, 24.08it/s]
Loading 0: 15%|█▌ | 55/363 [00:02<00:11, 26.74it/s]
Loading 0: 17%|█▋ | 60/363 [00:02<00:10, 28.06it/s]
Loading 0: 17%|█▋ | 63/363 [00:02<00:14, 21.31it/s]
Loading 0: 18%|█▊ | 66/363 [00:02<00:13, 21.66it/s]
Loading 0: 19%|█▉ | 69/363 [00:02<00:12, 23.22it/s]
Loading 0: 20%|█▉ | 72/363 [00:03<00:13, 21.49it/s]
Loading 0: 21%|██ | 77/363 [00:03<00:12, 22.01it/s]
Loading 0: 23%|██▎ | 82/363 [00:03<00:10, 26.83it/s]
Loading 0: 24%|██▎ | 86/363 [00:03<00:11, 23.90it/s]
Loading 0: 25%|██▌ | 91/363 [00:03<00:09, 28.34it/s]
Loading 0: 26%|██▌ | 95/363 [00:03<00:10, 24.80it/s]
Loading 0: 28%|██▊ | 100/363 [00:04<00:09, 29.09it/s]
Loading 0: 29%|██▊ | 104/363 [00:04<00:13, 19.51it/s]
Loading 0: 30%|███ | 109/363 [00:04<00:10, 23.81it/s]
Loading 0: 31%|███ | 113/363 [00:04<00:11, 22.14it/s]
Loading 0: 33%|███▎ | 118/363 [00:04<00:09, 26.49it/s]
Loading 0: 34%|███▎ | 122/363 [00:05<00:10, 23.51it/s]
Loading 0: 35%|███▍ | 127/363 [00:05<00:08, 27.84it/s]
Loading 0: 36%|███▌ | 131/363 [00:05<00:09, 24.73it/s]
Loading 0: 37%|███▋ | 136/363 [00:05<00:07, 29.00it/s]
Loading 0: 39%|███▉ | 141/363 [00:05<00:07, 29.69it/s]
Loading 0: 40%|███▉ | 145/363 [00:06<00:10, 21.12it/s]
Loading 0: 41%|████ | 149/363 [00:06<00:10, 20.31it/s]
Loading 0: 42%|████▏ | 154/363 [00:06<00:08, 24.55it/s]
Loading 0: 44%|████▎ | 158/363 [00:06<00:09, 22.45it/s]
Loading 0: 45%|████▍ | 163/363 [00:06<00:07, 26.76it/s]
Loading 0: 46%|████▌ | 167/363 [00:06<00:08, 23.73it/s]
Loading 0: 47%|████▋ | 172/363 [00:07<00:06, 27.72it/s]
Loading 0: 48%|████▊ | 176/363 [00:07<00:07, 24.75it/s]
Loading 0: 50%|████▉ | 181/363 [00:07<00:06, 29.13it/s]
Loading 0: 51%|█████ | 185/363 [00:07<00:09, 19.53it/s]
Loading 0: 52%|█████▏ | 190/363 [00:07<00:07, 23.84it/s]
Loading 0: 53%|█████▎ | 194/363 [00:08<00:07, 22.11it/s]
Loading 0: 55%|█████▍ | 199/363 [00:08<00:06, 26.48it/s]
Loading 0: 56%|█████▌ | 203/363 [00:08<00:06, 23.13it/s]
Loading 0: 57%|█████▋ | 208/363 [00:08<00:05, 27.49it/s]
Loading 0: 58%|█████▊ | 212/363 [00:08<00:06, 24.48it/s]
Loading 0: 60%|█████▉ | 217/363 [00:08<00:05, 28.77it/s]
Loading 0: 61%|██████ | 222/363 [00:08<00:04, 29.45it/s]
Loading 0: 62%|██████▏ | 226/363 [00:09<00:06, 21.05it/s]
Loading 0: 63%|██████▎ | 230/363 [00:09<00:06, 20.31it/s]
Loading 0: 65%|██████▍ | 235/363 [00:09<00:05, 24.78it/s]
Loading 0: 66%|██████▌ | 239/363 [00:09<00:05, 22.70it/s]
Loading 0: 67%|██████▋ | 244/363 [00:09<00:04, 27.17it/s]
Loading 0: 68%|██████▊ | 248/363 [00:10<00:04, 24.16it/s]
Loading 0: 70%|██████▉ | 253/363 [00:10<00:03, 28.49it/s]
Loading 0: 71%|███████ | 257/363 [00:10<00:04, 25.09it/s]
Loading 0: 72%|███████▏ | 262/363 [00:10<00:03, 29.52it/s]
Loading 0: 73%|███████▎ | 266/363 [00:10<00:04, 19.66it/s]
Loading 0: 75%|███████▍ | 271/363 [00:11<00:03, 24.09it/s]
Loading 0: 76%|███████▌ | 275/363 [00:11<00:03, 22.16it/s]
Loading 0: 77%|███████▋ | 280/363 [00:11<00:03, 26.61it/s]
Loading 0: 78%|███████▊ | 284/363 [00:11<00:03, 23.79it/s]
Loading 0: 80%|███████▉ | 289/363 [00:11<00:02, 28.17it/s]
Loading 0: 81%|████████ | 293/363 [00:11<00:02, 24.85it/s]
Loading 0: 82%|████████▏ | 298/363 [00:12<00:02, 28.99it/s]
Loading 0: 83%|████████▎ | 303/363 [00:12<00:02, 29.25it/s]
Loading 0: 85%|████████▍ | 307/363 [00:12<00:02, 20.85it/s]
Loading 0: 86%|████████▌ | 311/363 [00:12<00:02, 20.15it/s]
Loading 0: 87%|████████▋ | 316/363 [00:12<00:01, 24.67it/s]
Loading 0: 88%|████████▊ | 320/363 [00:13<00:01, 22.62it/s]
Loading 0: 90%|████████▉ | 325/363 [00:13<00:01, 27.16it/s]
Loading 0: 91%|█████████ | 329/363 [00:13<00:01, 24.23it/s]
Loading 0: 92%|█████████▏| 334/363 [00:13<00:01, 28.32it/s]
Loading 0: 93%|█████████▎| 338/363 [00:13<00:01, 24.88it/s]
Loading 0: 94%|█████████▍| 343/363 [00:13<00:00, 29.25it/s]
Loading 0: 96%|█████████▌| 347/363 [00:20<00:08, 1.99it/s]
Loading 0: 96%|█████████▋| 350/363 [00:21<00:05, 2.50it/s]
Loading 0: 97%|█████████▋| 353/363 [00:21<00:03, 3.18it/s]
Loading 0: 98%|█████████▊| 357/363 [00:21<00:01, 4.33it/s]
Loading 0: 100%|█████████▉| 362/363 [00:21<00:00, 6.44it/s]
Job rica40325-dpo-1008-5162s-v3-mkmlizer completed after 185.29s with status: succeeded
Stopping job with name rica40325-dpo-1008-5162s-v3-mkmlizer
Pipeline stage MKMLizer completed in 185.77s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service rica40325-dpo-1008-5162s-v3
Waiting for inference service rica40325-dpo-1008-5162s-v3 to be ready
Inference service rica40325-dpo-1008-5162s-v3 ready after 130.49925661087036s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Pipeline stage MKMLDeployer completed in 138.87s
run pipeline stage %s
Running pipeline stage StressChecker
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 9.21093487739563s
Received healthy response to inference request in 7.874724626541138s
Received healthy response to inference request in 9.541566610336304s
Received healthy response to inference request in 2.0450875759124756s
Received healthy response to inference request in 2.2728219032287598s
5 requests
0 failed requests
5th percentile: 2.0906344413757325
10th percentile: 2.1361813068389894
20th percentile: 2.227275037765503
30th percentile: 3.393202447891235
40th percentile: 5.633963537216188
50th percentile: 7.874724626541138
60th percentile: 8.409208726882934
70th percentile: 8.943692827224732
80th percentile: 9.277061223983765
90th percentile: 9.409313917160034
95th percentile: 9.475440263748169
99th percentile: 9.528341341018677
mean time: 6.189027118682861
%s, retrying in %s seconds...
Received healthy response to inference request in 1.6735591888427734s
Received healthy response to inference request in 1.7700884342193604s
Received healthy response to inference request in 1.969529151916504s
Received healthy response to inference request in 1.9925432205200195s
Received healthy response to inference request in 1.71803617477417s
5 requests
0 failed requests
5th percentile: 1.6824545860290527
10th percentile: 1.691349983215332
20th percentile: 1.7091407775878906
30th percentile: 1.728446626663208
40th percentile: 1.7492675304412841
50th percentile: 1.7700884342193604
60th percentile: 1.8498647212982178
70th percentile: 1.9296410083770752
80th percentile: 1.974131965637207
90th percentile: 1.9833375930786132
95th percentile: 1.9879404067993165
99th percentile: 1.9916226577758789
mean time: 1.8247512340545655
Pipeline stage StressChecker completed in 42.90s
Shutdown handler de-registered
rica40325-dpo-1008-5162s_v3 status is now deployed due to DeploymentManager action
rica40325-dpo-1008-5162s_v3 status is now inactive due to auto deactivation removed underperforming models
rica40325-dpo-1008-5162s_v3 status is now torndown due to DeploymentManager action