Running pipeline stage MKMLizer
Starting job with name zonemercy-graft-cogent-v-7573-v1-mkmlizer
Waiting for job on zonemercy-graft-cogent-v-7573-v1-mkmlizer to finish
Stopping job with name zonemercy-graft-cogent-v-7573-v1-mkmlizer
%s, retrying in %s seconds...
Starting job with name zonemercy-graft-cogent-v-7573-v1-mkmlizer
Waiting for job on zonemercy-graft-cogent-v-7573-v1-mkmlizer to finish
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ _____ __ __ ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ /___/ ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ Version: 0.9.9 ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ https://mk1.ai ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ The license key for the current software has been verified as ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ belonging to: ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ Chai Research Corp. ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ║ ║
zonemercy-graft-cogent-v-7573-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
zonemercy-graft-cogent-v-7573-v1-mkmlizer: Downloaded to shared memory in 105.535s
zonemercy-graft-cogent-v-7573-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp2ggr6_z_, device:0
zonemercy-graft-cogent-v-7573-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
zonemercy-graft-cogent-v-7573-v1-mkmlizer: quantized model in 41.727s
zonemercy-graft-cogent-v-7573-v1-mkmlizer: Processed model zonemercy/Graft-Cogent-v1-Acute-Nemo-v0-5e6ep1 in 147.262s
zonemercy-graft-cogent-v-7573-v1-mkmlizer: creating bucket guanaco-mkml-models
zonemercy-graft-cogent-v-7573-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zonemercy-graft-cogent-v-7573-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zonemercy-graft-cogent-v-7573-v1
zonemercy-graft-cogent-v-7573-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zonemercy-graft-cogent-v-7573-v1/config.json
zonemercy-graft-cogent-v-7573-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zonemercy-graft-cogent-v-7573-v1/special_tokens_map.json
zonemercy-graft-cogent-v-7573-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zonemercy-graft-cogent-v-7573-v1/tokenizer_config.json
zonemercy-graft-cogent-v-7573-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zonemercy-graft-cogent-v-7573-v1/tokenizer.json
zonemercy-graft-cogent-v-7573-v1-mkmlizer: loading reward model from ChaiML/gpt2_xl_pairwise_89m_step_347634
zonemercy-graft-cogent-v-7573-v1-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:15, 22.87it/s]
Loading 0: 3%|▎ | 10/363 [00:00<00:12, 27.95it/s]
Loading 0: 4%|▍ | 14/363 [00:00<00:14, 24.60it/s]
Loading 0: 5%|▌ | 19/363 [00:00<00:11, 31.14it/s]
Loading 0: 6%|▋ | 23/363 [00:00<00:14, 24.08it/s]
Loading 0: 7%|▋ | 26/363 [00:01<00:16, 20.85it/s]
Loading 0: 9%|▊ | 31/363 [00:01<00:12, 26.47it/s]
Loading 0: 10%|▉ | 35/363 [00:01<00:12, 27.28it/s]
Loading 0: 11%|█ | 39/363 [00:01<00:11, 28.58it/s]
Loading 0: 12%|█▏ | 43/363 [00:01<00:11, 27.67it/s]
Loading 0: 13%|█▎ | 48/363 [00:01<00:10, 30.42it/s]
Loading 0: 14%|█▍ | 52/363 [00:01<00:10, 28.88it/s]
Loading 0: 15%|█▌ | 56/363 [00:02<00:10, 28.24it/s]
Loading 0: 17%|█▋ | 60/363 [00:02<00:10, 29.26it/s]
Loading 0: 17%|█▋ | 63/363 [00:02<00:13, 21.60it/s]
Loading 0: 18%|█▊ | 66/363 [00:02<00:13, 22.27it/s]
Loading 0: 20%|█▉ | 71/363 [00:02<00:11, 26.19it/s]
Loading 0: 20%|██ | 74/363 [00:02<00:11, 26.06it/s]
Loading 0: 21%|██ | 77/363 [00:03<00:13, 21.63it/s]
Loading 0: 23%|██▎ | 84/363 [00:03<00:09, 29.13it/s]
Loading 0: 24%|██▍ | 88/363 [00:03<00:09, 28.05it/s]
Loading 0: 26%|██▌ | 93/363 [00:03<00:08, 30.15it/s]
Loading 0: 27%|██▋ | 97/363 [00:03<00:09, 28.52it/s]
Loading 0: 28%|██▊ | 101/363 [00:03<00:11, 23.49it/s]
Loading 0: 29%|██▊ | 104/363 [00:04<00:12, 20.54it/s]
Loading 0: 31%|███ | 111/363 [00:04<00:09, 27.28it/s]
Loading 0: 31%|███▏ | 114/363 [00:04<00:09, 25.46it/s]
Loading 0: 33%|███▎ | 120/363 [00:04<00:08, 29.80it/s]
Loading 0: 34%|███▍ | 124/363 [00:04<00:08, 28.50it/s]
Loading 0: 36%|███▌ | 129/363 [00:04<00:07, 30.77it/s]
Loading 0: 37%|███▋ | 133/363 [00:04<00:07, 29.12it/s]
Loading 0: 38%|███▊ | 137/363 [00:05<00:07, 29.25it/s]
Loading 0: 39%|███▉ | 142/363 [00:05<00:08, 25.36it/s]
Loading 0: 40%|███▉ | 145/363 [00:05<00:09, 23.78it/s]
Loading 0: 41%|████ | 149/363 [00:05<00:09, 22.65it/s]
Loading 0: 43%|████▎ | 156/363 [00:05<00:07, 28.17it/s]
Loading 0: 44%|████▍ | 159/363 [00:06<00:07, 26.06it/s]
Loading 0: 45%|████▌ | 165/363 [00:06<00:06, 30.36it/s]
Loading 0: 47%|████▋ | 169/363 [00:06<00:06, 28.89it/s]
Loading 0: 48%|████▊ | 174/363 [00:06<00:06, 30.90it/s]
Loading 0: 49%|████▉ | 178/363 [00:06<00:06, 28.81it/s]
Loading 0: 50%|████▉ | 181/363 [00:06<00:06, 28.10it/s]
Loading 0: 51%|█████ | 184/363 [00:07<00:08, 20.70it/s]
Loading 0: 52%|█████▏ | 187/363 [00:07<00:08, 21.71it/s]
Loading 0: 53%|█████▎ | 192/363 [00:07<00:06, 25.26it/s]
Loading 0: 54%|█████▎ | 195/363 [00:07<00:07, 23.58it/s]
Loading 0: 55%|█████▌ | 200/363 [00:07<00:05, 29.27it/s]
Loading 0: 56%|█████▌ | 204/363 [00:07<00:06, 25.57it/s]
Loading 0: 57%|█████▋ | 208/363 [00:07<00:05, 28.24it/s]
Loading 0: 58%|█████▊ | 212/363 [00:08<00:06, 24.66it/s]
Loading 0: 60%|█████▉ | 217/363 [00:08<00:04, 29.54it/s]
Loading 0: 61%|██████ | 222/363 [00:08<00:04, 30.26it/s]
Loading 0: 62%|██████▏ | 226/363 [00:08<00:06, 21.65it/s]
Loading 0: 63%|██████▎ | 230/363 [00:08<00:06, 21.10it/s]
Loading 0: 65%|██████▌ | 237/363 [00:08<00:04, 27.37it/s]
Loading 0: 66%|██████▋ | 241/363 [00:09<00:04, 26.67it/s]
Loading 0: 68%|██████▊ | 246/363 [00:09<00:04, 29.15it/s]
Loading 0: 69%|██████▉ | 250/363 [00:09<00:04, 28.10it/s]
Loading 0: 70%|███████ | 255/363 [00:09<00:03, 29.70it/s]
Loading 0: 71%|███████▏ | 259/363 [00:09<00:03, 27.88it/s]
Loading 0: 72%|███████▏ | 263/363 [00:09<00:04, 23.22it/s]
Loading 0: 73%|███████▎ | 266/363 [00:10<00:04, 20.13it/s]
Loading 0: 75%|███████▌ | 273/363 [00:10<00:03, 26.77it/s]
Loading 0: 76%|███████▌ | 276/363 [00:10<00:03, 24.88it/s]
Loading 0: 77%|███████▋ | 280/363 [00:10<00:02, 27.82it/s]
Loading 0: 78%|███████▊ | 284/363 [00:10<00:03, 25.12it/s]
Loading 0: 80%|████████ | 291/363 [00:10<00:02, 31.25it/s]
Loading 0: 81%|████████▏ | 295/363 [00:11<00:02, 28.95it/s]
Loading 0: 82%|████████▏ | 299/363 [00:11<00:02, 28.90it/s]
Loading 0: 84%|████████▎ | 304/363 [00:11<00:02, 24.99it/s]
Loading 0: 85%|████████▍ | 307/363 [00:11<00:02, 23.46it/s]
Loading 0: 86%|████████▌ | 311/363 [00:11<00:02, 22.31it/s]
Loading 0: 87%|████████▋ | 316/363 [00:11<00:01, 27.32it/s]
Loading 0: 88%|████████▊ | 320/363 [00:12<00:01, 24.27it/s]
Loading 0: 90%|████████▉ | 325/363 [00:12<00:01, 29.14it/s]
Loading 0: 91%|█████████ | 329/363 [00:12<00:01, 25.34it/s]
Loading 0: 92%|█████████▏| 334/363 [00:12<00:00, 30.22it/s]
Loading 0: 93%|█████████▎| 338/363 [00:12<00:00, 26.14it/s]
Loading 0: 95%|█████████▍| 344/363 [00:19<00:08, 2.20it/s]
Loading 0: 96%|█████████▌| 348/363 [00:19<00:05, 2.87it/s]
Loading 0: 97%|█████████▋| 353/363 [00:20<00:02, 4.05it/s]
Loading 0: 98%|█████████▊| 357/363 [00:20<00:01, 5.16it/s]
/opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:957: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
zonemercy-graft-cogent-v-7573-v1-mkmlizer: warnings.warn(
zonemercy-graft-cogent-v-7573-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:785: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
zonemercy-graft-cogent-v-7573-v1-mkmlizer: warnings.warn(
zonemercy-graft-cogent-v-7573-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:469: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
zonemercy-graft-cogent-v-7573-v1-mkmlizer: warnings.warn(
zonemercy-graft-cogent-v-7573-v1-mkmlizer:
Downloading shards: 0%| | 0/2 [00:00<?, ?it/s]
Downloading shards: 50%|█████ | 1/2 [00:05<00:05, 5.46s/it]
Downloading shards: 100%|██████████| 2/2 [00:07<00:00, 3.74s/it]
Downloading shards: 100%|██████████| 2/2 [00:07<00:00, 4.00s/it]
zonemercy-graft-cogent-v-7573-v1-mkmlizer:
Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s]
Loading checkpoint shards: 50%|█████ | 1/2 [00:00<00:00, 2.21it/s]
Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 3.69it/s]
Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 3.35it/s]
zonemercy-graft-cogent-v-7573-v1-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
zonemercy-graft-cogent-v-7573-v1-mkmlizer: Saving duration: 1.351s
zonemercy-graft-cogent-v-7573-v1-mkmlizer: Processed model ChaiML/gpt2_xl_pairwise_89m_step_347634 in 13.157s
zonemercy-graft-cogent-v-7573-v1-mkmlizer: creating bucket guanaco-reward-models
zonemercy-graft-cogent-v-7573-v1-mkmlizer: Bucket 's3://guanaco-reward-models/' created
zonemercy-graft-cogent-v-7573-v1-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/zonemercy-graft-cogent-v-7573-v1_reward
zonemercy-graft-cogent-v-7573-v1-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/zonemercy-graft-cogent-v-7573-v1_reward/config.json
zonemercy-graft-cogent-v-7573-v1-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/zonemercy-graft-cogent-v-7573-v1_reward/special_tokens_map.json
zonemercy-graft-cogent-v-7573-v1-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/zonemercy-graft-cogent-v-7573-v1_reward/tokenizer_config.json
zonemercy-graft-cogent-v-7573-v1-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/zonemercy-graft-cogent-v-7573-v1_reward/merges.txt
zonemercy-graft-cogent-v-7573-v1-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/zonemercy-graft-cogent-v-7573-v1_reward/vocab.json
zonemercy-graft-cogent-v-7573-v1-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/zonemercy-graft-cogent-v-7573-v1_reward/tokenizer.json
zonemercy-graft-cogent-v-7573-v1-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/zonemercy-graft-cogent-v-7573-v1_reward/reward.tensors
Job zonemercy-graft-cogent-v-7573-v1-mkmlizer completed after 200.41s with status: succeeded
Stopping job with name zonemercy-graft-cogent-v-7573-v1-mkmlizer
Pipeline stage MKMLizer completed in 201.51s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.11s
Running pipeline stage ISVCDeployer
Creating inference service zonemercy-graft-cogent-v-7573-v1
Waiting for inference service zonemercy-graft-cogent-v-7573-v1 to be ready
Failed to get response for submission zonemercy-burly-blue-cp2500_v5: ('http://zonemercy-burly-blue-cp2500-v5-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:36528->127.0.0.1:8080: read: connection reset by peer\n')
Inference service zonemercy-graft-cogent-v-7573-v1 ready after 231.48332023620605s
Pipeline stage ISVCDeployer completed in 232.34s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.678600311279297s
Received healthy response to inference request in 1.7638444900512695s
Received healthy response to inference request in 1.775660514831543s
Received healthy response to inference request in 1.7625596523284912s
Received healthy response to inference request in 1.7827937602996826s
5 requests
0 failed requests
5th percentile: 1.7628166198730468
10th percentile: 1.7630735874176025
20th percentile: 1.763587522506714
30th percentile: 1.7662076950073242
40th percentile: 1.7709341049194336
50th percentile: 1.775660514831543
60th percentile: 1.7785138130187987
70th percentile: 1.7813671112060547
80th percentile: 1.9619550704956057
90th percentile: 2.320277690887451
95th percentile: 2.499439001083374
99th percentile: 2.642768049240112
mean time: 1.9526917457580566
Pipeline stage StressChecker completed in 16.60s
zonemercy-graft-cogent-v_7573_v1 status is now deployed due to DeploymentManager action
zonemercy-graft-cogent-v_7573_v1 status is now inactive due to auto deactivation removed underperforming models
zonemercy-graft-cogent-v_7573_v1 status is now torndown due to DeploymentManager action