Running pipeline stage MKMLizer
Starting job with name shuttleai-shuttle-2-5-mini-v1-mkmlizer
Waiting for job on shuttleai-shuttle-2-5-mini-v1-mkmlizer to finish
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-mini_v1: ('http://shuttleai-shuttle-2-5-1-mini-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ _____ __ __ ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ /___/ ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ Version: 0.9.7 ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ https://mk1.ai ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ The license key for the current software has been verified as ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ belonging to: ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ Chai Research Corp. ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ║ ║
shuttleai-shuttle-2-5-mini-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission shuttleai-shuttle-2-5-1-mini_v1: ('http://shuttleai-shuttle-2-5-1-mini-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-mini_v1: ('http://shuttleai-shuttle-2-5-1-mini-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-mini_v1: ('http://shuttleai-shuttle-2-5-1-mini-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'EOF\n')
Failed to get response for submission shuttleai-shuttle-2-5-1-mini_v1: ('http://shuttleai-shuttle-2-5-1-mini-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'EOF\n')
shuttleai-shuttle-2-5-mini-v1-mkmlizer: Downloaded to shared memory in 30.430s
shuttleai-shuttle-2-5-mini-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp9x37nb83, device:0
shuttleai-shuttle-2-5-mini-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-mini_v1: ('http://shuttleai-shuttle-2-5-1-mini-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'activator request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
shuttleai-shuttle-2-5-mini-v1-mkmlizer: quantized model in 35.747s
shuttleai-shuttle-2-5-mini-v1-mkmlizer: Processed model shuttleai/shuttle-2.5-mini in 66.177s
shuttleai-shuttle-2-5-mini-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
shuttleai-shuttle-2-5-mini-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/shuttleai-shuttle-2-5-mini-v1
shuttleai-shuttle-2-5-mini-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/shuttleai-shuttle-2-5-mini-v1/config.json
shuttleai-shuttle-2-5-mini-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/shuttleai-shuttle-2-5-mini-v1/special_tokens_map.json
shuttleai-shuttle-2-5-mini-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/shuttleai-shuttle-2-5-mini-v1/tokenizer_config.json
shuttleai-shuttle-2-5-mini-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/shuttleai-shuttle-2-5-mini-v1/tokenizer.json
shuttleai-shuttle-2-5-mini-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/shuttleai-shuttle-2-5-mini-v1/flywheel_model.0.safetensors
shuttleai-shuttle-2-5-mini-v1-mkmlizer: loading reward model from ChaiML/gpt2_xl_pairwise_89m_step_347634
shuttleai-shuttle-2-5-mini-v1-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:10, 33.58it/s]
Loading 0: 4%|▎ | 13/363 [00:00<00:06, 54.39it/s]
Loading 0: 5%|▌ | 19/363 [00:00<00:07, 48.33it/s]
Loading 0: 7%|▋ | 25/363 [00:00<00:06, 48.69it/s]
Loading 0: 9%|▊ | 31/363 [00:00<00:06, 51.25it/s]
Loading 0: 10%|█ | 37/363 [00:00<00:06, 47.92it/s]
Loading 0: 12%|█▏ | 42/363 [00:00<00:07, 44.87it/s]
Loading 0: 13%|█▎ | 49/363 [00:01<00:06, 49.84it/s]
Loading 0: 15%|█▌ | 55/363 [00:01<00:06, 46.72it/s]
Loading 0: 17%|█▋ | 61/363 [00:01<00:08, 34.60it/s]
Loading 0: 18%|█▊ | 65/363 [00:01<00:08, 33.95it/s]
Loading 0: 20%|█▉ | 72/363 [00:01<00:07, 40.39it/s]
Loading 0: 21%|██▏ | 78/363 [00:01<00:06, 40.80it/s]
Loading 0: 23%|██▎ | 83/363 [00:01<00:06, 41.06it/s]
Loading 0: 25%|██▍ | 90/363 [00:02<00:05, 46.26it/s]
Loading 0: 26%|██▋ | 96/363 [00:02<00:05, 44.51it/s]
Loading 0: 28%|██▊ | 101/363 [00:02<00:06, 42.00it/s]
Loading 0: 30%|██▉ | 108/363 [00:02<00:05, 48.17it/s]
Loading 0: 31%|███▏ | 114/363 [00:02<00:05, 43.99it/s]
Loading 0: 33%|███▎ | 119/363 [00:02<00:05, 42.38it/s]
Loading 0: 35%|███▍ | 126/363 [00:02<00:05, 46.17it/s]
Loading 0: 36%|███▋ | 132/363 [00:03<00:05, 43.96it/s]
Loading 0: 38%|███▊ | 137/363 [00:03<00:05, 42.83it/s]
Loading 0: 39%|███▉ | 142/363 [00:03<00:06, 33.48it/s]
Loading 0: 40%|████ | 146/363 [00:03<00:06, 34.59it/s]
Loading 0: 41%|████▏ | 150/363 [00:03<00:06, 33.58it/s]
Loading 0: 43%|████▎ | 156/363 [00:03<00:05, 39.36it/s]
Loading 0: 44%|████▍ | 161/363 [00:03<00:04, 40.68it/s]
Loading 0: 46%|████▌ | 166/363 [00:03<00:04, 41.93it/s]
Loading 0: 47%|████▋ | 172/363 [00:04<00:04, 41.83it/s]
Loading 0: 49%|████▉ | 177/363 [00:04<00:04, 40.93it/s]
Loading 0: 51%|█████ | 184/363 [00:04<00:03, 46.18it/s]
Loading 0: 52%|█████▏ | 190/363 [00:04<00:03, 43.49it/s]
Loading 0: 54%|█████▎ | 195/363 [00:04<00:03, 42.13it/s]
Loading 0: 56%|█████▌ | 202/363 [00:04<00:03, 46.69it/s]
Loading 0: 57%|█████▋ | 208/363 [00:04<00:03, 43.99it/s]
Loading 0: 59%|█████▊ | 213/363 [00:05<00:03, 42.59it/s]
Loading 0: 60%|██████ | 219/363 [00:05<00:03, 46.64it/s]
Loading 0: 62%|██████▏ | 224/363 [00:05<00:04, 33.35it/s]
Loading 0: 63%|██████▎ | 228/363 [00:05<00:03, 34.20it/s]
Loading 0: 64%|██████▍ | 232/363 [00:05<00:03, 34.51it/s]
Loading 0: 66%|██████▌ | 238/363 [00:05<00:03, 39.36it/s]
Loading 0: 67%|██████▋ | 244/363 [00:05<00:02, 39.93it/s]
Loading 0: 69%|██████▊ | 249/363 [00:05<00:02, 39.91it/s]
Loading 0: 70%|███████ | 255/363 [00:06<00:02, 44.76it/s]
Loading 0: 72%|███████▏ | 260/363 [00:06<00:02, 43.99it/s]
Loading 0: 73%|███████▎ | 265/363 [00:06<00:02, 44.07it/s]
Loading 0: 74%|███████▍ | 270/363 [00:06<00:02, 45.46it/s]
Loading 0: 76%|███████▌ | 275/363 [00:06<00:02, 38.30it/s]
Loading 0: 78%|███████▊ | 283/363 [00:06<00:01, 46.79it/s]
Loading 0: 79%|███████▉ | 288/363 [00:06<00:01, 47.33it/s]
Loading 0: 81%|████████ | 293/363 [00:07<00:01, 38.88it/s]
Loading 0: 82%|████████▏ | 299/363 [00:07<00:01, 43.68it/s]
Loading 0: 84%|████████▎ | 304/363 [00:13<00:23, 2.56it/s]
Loading 0: 85%|████████▍ | 308/363 [00:14<00:16, 3.29it/s]
Loading 0: 86%|████████▌ | 312/363 [00:14<00:12, 4.20it/s]
Loading 0: 88%|████████▊ | 320/363 [00:14<00:06, 6.96it/s]
Loading 0: 90%|████████▉ | 326/363 [00:14<00:03, 9.38it/s]
Loading 0: 91%|█████████ | 331/363 [00:14<00:02, 11.75it/s]
Loading 0: 93%|█████████▎| 338/363 [00:14<00:01, 16.37it/s]
Loading 0: 95%|█████████▍| 344/363 [00:14<00:00, 19.93it/s]
Loading 0: 96%|█████████▌| 349/363 [00:14<00:00, 23.07it/s]
Loading 0: 98%|█████████▊| 356/363 [00:15<00:00, 29.29it/s]
Loading 0: 100%|█████████▉| 362/363 [00:15<00:00, 32.02it/s]
/opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:957: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
shuttleai-shuttle-2-5-mini-v1-mkmlizer: warnings.warn(
shuttleai-shuttle-2-5-mini-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:785: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
shuttleai-shuttle-2-5-mini-v1-mkmlizer: warnings.warn(
shuttleai-shuttle-2-5-mini-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:469: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
shuttleai-shuttle-2-5-mini-v1-mkmlizer: warnings.warn(
shuttleai-shuttle-2-5-mini-v1-mkmlizer:
Downloading shards: 0%| | 0/2 [00:00<?, ?it/s]
Downloading shards: 50%|█████ | 1/2 [00:05<00:05, 5.36s/it]
Downloading shards: 100%|██████████| 2/2 [00:08<00:00, 3.95s/it]
Downloading shards: 100%|██████████| 2/2 [00:08<00:00, 4.16s/it]
shuttleai-shuttle-2-5-mini-v1-mkmlizer:
Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s]
Loading checkpoint shards: 50%|█████ | 1/2 [00:00<00:00, 2.38it/s]
Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 3.94it/s]
Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 3.59it/s]
shuttleai-shuttle-2-5-mini-v1-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
shuttleai-shuttle-2-5-mini-v1-mkmlizer: Saving duration: 1.381s
shuttleai-shuttle-2-5-mini-v1-mkmlizer: Processed model ChaiML/gpt2_xl_pairwise_89m_step_347634 in 13.266s
shuttleai-shuttle-2-5-mini-v1-mkmlizer: creating bucket guanaco-reward-models
shuttleai-shuttle-2-5-mini-v1-mkmlizer: Bucket 's3://guanaco-reward-models/' created
shuttleai-shuttle-2-5-mini-v1-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/shuttleai-shuttle-2-5-mini-v1_reward
shuttleai-shuttle-2-5-mini-v1-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/shuttleai-shuttle-2-5-mini-v1_reward/config.json
shuttleai-shuttle-2-5-mini-v1-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/shuttleai-shuttle-2-5-mini-v1_reward/special_tokens_map.json
shuttleai-shuttle-2-5-mini-v1-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/shuttleai-shuttle-2-5-mini-v1_reward/tokenizer_config.json
shuttleai-shuttle-2-5-mini-v1-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/shuttleai-shuttle-2-5-mini-v1_reward/merges.txt
shuttleai-shuttle-2-5-mini-v1-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/shuttleai-shuttle-2-5-mini-v1_reward/vocab.json
shuttleai-shuttle-2-5-mini-v1-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/shuttleai-shuttle-2-5-mini-v1_reward/tokenizer.json
shuttleai-shuttle-2-5-mini-v1-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/shuttleai-shuttle-2-5-mini-v1_reward/reward.tensors
Job shuttleai-shuttle-2-5-mini-v1-mkmlizer completed after 115.34s with status: succeeded
Stopping job with name shuttleai-shuttle-2-5-mini-v1-mkmlizer
Pipeline stage MKMLizer completed in 116.36s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.11s
Running pipeline stage ISVCDeployer
Creating inference service shuttleai-shuttle-2-5-mini-v1
Waiting for inference service shuttleai-shuttle-2-5-mini-v1 to be ready
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-mini_v1: ('http://shuttleai-shuttle-2-5-1-mini-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Inference service shuttleai-shuttle-2-5-mini-v1 ready after 140.88140726089478s
Pipeline stage ISVCDeployer completed in 142.53s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.641197443008423s
Received healthy response to inference request in 1.7066493034362793s
Received healthy response to inference request in 1.7075841426849365s
Received healthy response to inference request in 1.7128212451934814s
Received healthy response to inference request in 1.6985902786254883s
5 requests
0 failed requests
5th percentile: 1.7002020835876466
10th percentile: 1.7018138885498046
20th percentile: 1.705037498474121
30th percentile: 1.7068362712860108
40th percentile: 1.7072102069854735
50th percentile: 1.7075841426849365
60th percentile: 1.7096789836883546
70th percentile: 1.7117738246917724
80th percentile: 1.8984964847564698
90th percentile: 2.2698469638824466
95th percentile: 2.4555222034454345
99th percentile: 2.6040623950958253
mean time: 1.8933684825897217
Pipeline stage StressChecker completed in 10.27s
shuttleai-shuttle-2-5-mini_v1 status is now deployed due to DeploymentManager action
shuttleai-shuttle-2-5-mini_v1 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of shuttleai-shuttle-2-5-mini_v1
Running pipeline stage ISVCDeleter
Checking if service shuttleai-shuttle-2-5-mini-v1 is running
Tearing down inference service shuttleai-shuttle-2-5-mini-v1
Service shuttleai-shuttle-2-5-mini-v1 has been torndown
Pipeline stage ISVCDeleter completed in 5.41s
Running pipeline stage MKMLModelDeleter
Cleaning model data from S3
Cleaning model data from model cache
Deleting key shuttleai-shuttle-2-5-mini-v1/config.json from bucket guanaco-mkml-models
Deleting key shuttleai-shuttle-2-5-mini-v1/flywheel_model.0.safetensors from bucket guanaco-mkml-models
Deleting key shuttleai-shuttle-2-5-mini-v1/special_tokens_map.json from bucket guanaco-mkml-models
Deleting key shuttleai-shuttle-2-5-mini-v1/tokenizer.json from bucket guanaco-mkml-models
Deleting key shuttleai-shuttle-2-5-mini-v1/tokenizer_config.json from bucket guanaco-mkml-models
Cleaning model data from model cache
Deleting key shuttleai-shuttle-2-5-mini-v1_reward/config.json from bucket guanaco-reward-models
Deleting key shuttleai-shuttle-2-5-mini-v1_reward/merges.txt from bucket guanaco-reward-models
Deleting key shuttleai-shuttle-2-5-mini-v1_reward/reward.tensors from bucket guanaco-reward-models
Deleting key shuttleai-shuttle-2-5-mini-v1_reward/special_tokens_map.json from bucket guanaco-reward-models
Deleting key shuttleai-shuttle-2-5-mini-v1_reward/tokenizer.json from bucket guanaco-reward-models
Deleting key shuttleai-shuttle-2-5-mini-v1_reward/tokenizer_config.json from bucket guanaco-reward-models
Deleting key shuttleai-shuttle-2-5-mini-v1_reward/vocab.json from bucket guanaco-reward-models
Pipeline stage MKMLModelDeleter completed in 6.44s
shuttleai-shuttle-2-5-mini_v1 status is now torndown due to DeploymentManager action