Running pipeline stage MKMLizer
Starting job with name endevor-infinityrp-v1-7b-v2-mkmlizer
Waiting for job on endevor-infinityrp-v1-7b-v2-mkmlizer to finish
endevor-infinityrp-v1-7b-v2-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ _____ __ __ ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ /___/ ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ Version: 0.6.11 ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ The license key for the current software has been verified as ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ belonging to: ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ Chai Research Corp. ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ Expiration: 2024-04-15 23:59:59 ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ║ ║
endevor-infinityrp-v1-7b-v2-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
endevor-infinityrp-v1-7b-v2-mkmlizer:
.gitattributes: 0%| | 0.00/1.52k [00:00<?, ?B/s]
.gitattributes: 100%|██████████| 1.52k/1.52k [00:00<00:00, 19.6MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer:
README.md: 0%| | 0.00/1.77k [00:00<?, ?B/s]
README.md: 100%|██████████| 1.77k/1.77k [00:00<00:00, 29.4MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer:
config.json: 0%| | 0.00/642 [00:00<?, ?B/s]
config.json: 100%|██████████| 642/642 [00:00<00:00, 10.6MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer:
model-00001-of-00002.safetensors: 0%| | 0.00/9.94G [00:00<?, ?B/s]
model-00001-of-00002.safetensors: 0%| | 10.5M/9.94G [00:00<03:17, 50.3MB/s]
model-00001-of-00002.safetensors: 0%| | 21.0M/9.94G [00:00<03:28, 47.6MB/s]
model-00001-of-00002.safetensors: 1%| | 115M/9.94G [00:00<00:34, 286MB/s]
model-00001-of-00002.safetensors: 2%|▏ | 157M/9.94G [00:00<00:38, 256MB/s]
model-00001-of-00002.safetensors: 2%|▏ | 241M/9.94G [00:00<00:28, 342MB/s]
model-00001-of-00002.safetensors: 3%|▎ | 336M/9.94G [00:01<00:21, 438MB/s]
model-00001-of-00002.safetensors: 4%|▍ | 388M/9.94G [00:01<00:21, 439MB/s]
model-00001-of-00002.safetensors: 4%|▍ | 440M/9.94G [00:01<00:22, 430MB/s]
model-00001-of-00002.safetensors: 5%|▌ | 524M/9.94G [00:01<00:18, 521MB/s]
model-00001-of-00002.safetensors: 6%|▌ | 587M/9.94G [00:01<00:18, 505MB/s]
model-00001-of-00002.safetensors: 7%|▋ | 671M/9.94G [00:01<00:15, 587MB/s]
model-00001-of-00002.safetensors: 8%|▊ | 776M/9.94G [00:01<00:14, 645MB/s]
model-00001-of-00002.safetensors: 9%|▉ | 870M/9.94G [00:01<00:12, 719MB/s]
model-00001-of-00002.safetensors: 10%|▉ | 954M/9.94G [00:01<00:12, 723MB/s]
model-00001-of-00002.safetensors: 10%|█ | 1.04G/9.94G [00:02<00:12, 724MB/s]
model-00001-of-00002.safetensors: 11%|█▏ | 1.12G/9.94G [00:02<00:12, 712MB/s]
model-00001-of-00002.safetensors: 12%|█▏ | 1.23G/9.94G [00:02<00:10, 797MB/s]
model-00001-of-00002.safetensors: 13%|█▎ | 1.31G/9.94G [00:02<00:13, 636MB/s]
model-00001-of-00002.safetensors: 14%|█▍ | 1.38G/9.94G [00:02<00:16, 515MB/s]
model-00001-of-00002.safetensors: 15%|█▍ | 1.47G/9.94G [00:02<00:15, 556MB/s]
model-00001-of-00002.safetensors: 17%|█▋ | 1.66G/9.94G [00:02<00:09, 844MB/s]
model-00001-of-00002.safetensors: 24%|██▍ | 2.42G/9.94G [00:03<00:03, 2.42GB/s]
model-00001-of-00002.safetensors: 27%|██▋ | 2.71G/9.94G [00:03<00:04, 1.72GB/s]
model-00001-of-00002.safetensors: 30%|██▉ | 2.94G/9.94G [00:03<00:05, 1.22GB/s]
model-00001-of-00002.safetensors: 31%|███▏ | 3.11G/9.94G [00:03<00:05, 1.29GB/s]
model-00001-of-00002.safetensors: 33%|███▎ | 3.29G/9.94G [00:04<00:06, 1.04GB/s]
model-00001-of-00002.safetensors: 35%|███▍ | 3.44G/9.94G [00:04<00:07, 908MB/s]
model-00001-of-00002.safetensors: 36%|███▌ | 3.57G/9.94G [00:04<00:06, 955MB/s]
model-00001-of-00002.safetensors: 37%|███▋ | 3.69G/9.94G [00:04<00:06, 920MB/s]
model-00001-of-00002.safetensors: 39%|███▉ | 3.92G/9.94G [00:04<00:05, 1.17GB/s]
model-00001-of-00002.safetensors: 41%|████ | 4.07G/9.94G [00:04<00:04, 1.18GB/s]
model-00001-of-00002.safetensors: 42%|████▏ | 4.20G/9.94G [00:05<00:05, 973MB/s]
model-00001-of-00002.safetensors: 44%|████▍ | 4.39G/9.94G [00:05<00:05, 1.11GB/s]
model-00001-of-00002.safetensors: 45%|████▌ | 4.52G/9.94G [00:05<00:05, 1.00GB/s]
model-00001-of-00002.safetensors: 47%|████▋ | 4.63G/9.94G [00:05<00:05, 968MB/s]
model-00001-of-00002.safetensors: 48%|████▊ | 4.79G/9.94G [00:05<00:04, 1.10GB/s]
model-00001-of-00002.safetensors: 50%|████▉ | 4.93G/9.94G [00:05<00:04, 1.07GB/s]
model-00001-of-00002.safetensors: 51%|█████ | 5.04G/9.94G [00:05<00:05, 949MB/s]
model-00001-of-00002.safetensors: 52%|█████▏ | 5.17G/9.94G [00:05<00:04, 1.01GB/s]
model-00001-of-00002.safetensors: 53%|█████▎ | 5.32G/9.94G [00:06<00:04, 1.12GB/s]
model-00001-of-00002.safetensors: 55%|█████▍ | 5.44G/9.94G [00:06<00:04, 928MB/s]
model-00001-of-00002.safetensors: 56%|█████▌ | 5.55G/9.94G [00:06<00:04, 948MB/s]
model-00001-of-00002.safetensors: 57%|█████▋ | 5.70G/9.94G [00:06<00:04, 1.02GB/s]
model-00001-of-00002.safetensors: 59%|█████▊ | 5.82G/9.94G [00:06<00:05, 820MB/s]
model-00001-of-00002.safetensors: 60%|██████ | 6.00G/9.94G [00:06<00:03, 1.01GB/s]
model-00001-of-00002.safetensors: 61%|██████▏ | 6.11G/9.94G [00:07<00:04, 833MB/s]
model-00001-of-00002.safetensors: 63%|██████▎ | 6.22G/9.94G [00:07<00:04, 878MB/s]
model-00001-of-00002.safetensors: 64%|██████▍ | 6.34G/9.94G [00:07<00:03, 956MB/s]
model-00001-of-00002.safetensors: 66%|██████▌ | 6.57G/9.94G [00:07<00:02, 1.14GB/s]
model-00001-of-00002.safetensors: 67%|██████▋ | 6.70G/9.94G [00:07<00:03, 938MB/s]
model-00001-of-00002.safetensors: 69%|██████▉ | 6.85G/9.94G [00:07<00:02, 1.04GB/s]
model-00001-of-00002.safetensors: 71%|███████▏ | 7.09G/9.94G [00:07<00:02, 1.36GB/s]
model-00001-of-00002.safetensors: 73%|███████▎ | 7.25G/9.94G [00:08<00:02, 1.03GB/s]
model-00001-of-00002.safetensors: 74%|███████▍ | 7.37G/9.94G [00:08<00:02, 997MB/s]
model-00001-of-00002.safetensors: 77%|███████▋ | 7.67G/9.94G [00:08<00:01, 1.24GB/s]
model-00001-of-00002.safetensors: 78%|███████▊ | 7.80G/9.94G [00:08<00:02, 1.04GB/s]
model-00001-of-00002.safetensors: 80%|████████ | 7.99G/9.94G [00:08<00:01, 1.21GB/s]
model-00001-of-00002.safetensors: 82%|████████▏ | 8.17G/9.94G [00:08<00:01, 1.23GB/s]
model-00001-of-00002.safetensors: 84%|████████▎ | 8.30G/9.94G [00:08<00:01, 1.07GB/s]
model-00001-of-00002.safetensors: 85%|████████▍ | 8.43G/9.94G [00:09<00:01, 953MB/s]
model-00001-of-00002.safetensors: 87%|████████▋ | 8.61G/9.94G [00:09<00:01, 1.12GB/s]
model-00001-of-00002.safetensors: 88%|████████▊ | 8.73G/9.94G [00:09<00:01, 941MB/s]
model-00001-of-00002.safetensors: 90%|████████▉ | 8.91G/9.94G [00:09<00:00, 1.10GB/s]
model-00001-of-00002.safetensors: 92%|█████████▏| 9.10G/9.94G [00:09<00:00, 1.28GB/s]
model-00001-of-00002.safetensors: 93%|█████████▎| 9.25G/9.94G [00:09<00:00, 1.31GB/s]
model-00001-of-00002.safetensors: 96%|█████████▌| 9.50G/9.94G [00:09<00:00, 1.62GB/s]
model-00001-of-00002.safetensors: 100%|█████████▉| 9.90G/9.94G [00:09<00:00, 2.16GB/s]
model-00001-of-00002.safetensors: 100%|█████████▉| 9.94G/9.94G [00:10<00:00, 972MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer:
model-00002-of-00002.safetensors: 0%| | 0.00/4.54G [00:00<?, ?B/s]
model-00002-of-00002.safetensors: 0%| | 10.5M/4.54G [00:00<02:11, 34.4MB/s]
model-00002-of-00002.safetensors: 1%|▏ | 62.9M/4.54G [00:00<00:26, 166MB/s]
model-00002-of-00002.safetensors: 3%|▎ | 115M/4.54G [00:00<00:18, 241MB/s]
model-00002-of-00002.safetensors: 4%|▍ | 178M/4.54G [00:00<00:13, 327MB/s]
model-00002-of-00002.safetensors: 5%|▌ | 231M/4.54G [00:00<00:12, 345MB/s]
model-00002-of-00002.safetensors: 6%|▌ | 283M/4.54G [00:00<00:11, 385MB/s]
model-00002-of-00002.safetensors: 7%|▋ | 336M/4.54G [00:01<00:10, 420MB/s]
model-00002-of-00002.safetensors: 9%|▊ | 388M/4.54G [00:01<00:09, 416MB/s]
model-00002-of-00002.safetensors: 10%|▉ | 451M/4.54G [00:01<00:09, 427MB/s]
model-00002-of-00002.safetensors: 11%|█ | 503M/4.54G [00:01<00:09, 442MB/s]
model-00002-of-00002.safetensors: 12%|█▏ | 566M/4.54G [00:01<00:08, 458MB/s]
model-00002-of-00002.safetensors: 14%|█▍ | 629M/4.54G [00:01<00:08, 451MB/s]
model-00002-of-00002.safetensors: 15%|█▌ | 692M/4.54G [00:01<00:08, 468MB/s]
model-00002-of-00002.safetensors: 17%|█▋ | 755M/4.54G [00:01<00:07, 496MB/s]
model-00002-of-00002.safetensors: 18%|█▊ | 807M/4.54G [00:02<00:07, 487MB/s]
model-00002-of-00002.safetensors: 19%|█▉ | 881M/4.54G [00:02<00:06, 534MB/s]
model-00002-of-00002.safetensors: 21%|██ | 965M/4.54G [00:02<00:06, 595MB/s]
model-00002-of-00002.safetensors: 23%|██▎ | 1.03G/4.54G [00:02<00:06, 584MB/s]
model-00002-of-00002.safetensors: 24%|██▍ | 1.09G/4.54G [00:02<00:08, 385MB/s]
model-00002-of-00002.safetensors: 32%|███▏ | 1.44G/4.54G [00:02<00:03, 1.00GB/s]
model-00002-of-00002.safetensors: 47%|████▋ | 2.15G/4.54G [00:02<00:01, 2.08GB/s]
model-00002-of-00002.safetensors: 52%|█████▏ | 2.38G/4.54G [00:03<00:01, 1.56GB/s]
model-00002-of-00002.safetensors: 57%|█████▋ | 2.57G/4.54G [00:03<00:01, 1.37GB/s]
model-00002-of-00002.safetensors: 60%|██████ | 2.74G/4.54G [00:03<00:01, 1.28GB/s]
model-00002-of-00002.safetensors: 64%|██████▎ | 2.89G/4.54G [00:03<00:01, 1.34GB/s]
model-00002-of-00002.safetensors: 67%|██████▋ | 3.04G/4.54G [00:03<00:01, 1.26GB/s]
model-00002-of-00002.safetensors: 70%|██████▉ | 3.18G/4.54G [00:03<00:01, 1.18GB/s]
model-00002-of-00002.safetensors: 73%|███████▎ | 3.30G/4.54G [00:04<00:01, 1.15GB/s]
model-00002-of-00002.safetensors: 76%|███████▌ | 3.43G/4.54G [00:04<00:00, 1.13GB/s]
model-00002-of-00002.safetensors: 78%|███████▊ | 3.54G/4.54G [00:04<00:00, 1.03GB/s]
model-00002-of-00002.safetensors: 80%|████████ | 3.65G/4.54G [00:04<00:01, 689MB/s]
model-00002-of-00002.safetensors: 82%|████████▏ | 3.73G/4.54G [00:04<00:01, 649MB/s]
model-00002-of-00002.safetensors: 84%|████████▍ | 3.82G/4.54G [00:04<00:01, 630MB/s]
model-00002-of-00002.safetensors: 86%|████████▌ | 3.90G/4.54G [00:05<00:00, 669MB/s]
model-00002-of-00002.safetensors: 100%|█████████▉| 4.54G/4.54G [00:05<00:00, 877MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer:
model.safetensors.index.json: 0%| | 0.00/22.8k [00:00<?, ?B/s]
model.safetensors.index.json: 100%|██████████| 22.8k/22.8k [00:00<00:00, 155MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer:
special_tokens_map.json: 0%| | 0.00/414 [00:00<?, ?B/s]
special_tokens_map.json: 100%|██████████| 414/414 [00:00<00:00, 5.66MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer:
tokenizer.json: 0%| | 0.00/1.80M [00:00<?, ?B/s]
tokenizer.json: 100%|██████████| 1.80M/1.80M [00:00<00:00, 69.0MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer:
tokenizer.model: 0%| | 0.00/493k [00:00<?, ?B/s]
tokenizer.model: 100%|██████████| 493k/493k [00:00<00:00, 10.1MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer:
tokenizer_config.json: 0%| | 0.00/967 [00:00<?, ?B/s]
tokenizer_config.json: 100%|██████████| 967/967 [00:00<00:00, 15.3MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer: Downloaded to shared memory in 17.251s
endevor-infinityrp-v1-7b-v2-mkmlizer: quantizing model to /dev/shm/model_cache
endevor-infinityrp-v1-7b-v2-mkmlizer: Saving mkml model at /dev/shm/model_cache
endevor-infinityrp-v1-7b-v2-mkmlizer: Reading /tmp/tmpwmmjf15p/model.safetensors.index.json
endevor-infinityrp-v1-7b-v2-mkmlizer:
Profiling: 0%| | 0/291 [00:00<?, ?it/s]
Profiling: 0%| | 1/291 [00:01<06:07, 1.27s/it]
Profiling: 5%|▍ | 14/291 [00:01<00:20, 13.70it/s]
Profiling: 11%|█ | 31/291 [00:01<00:07, 32.99it/s]
Profiling: 16%|█▋ | 48/291 [00:01<00:04, 52.86it/s]
Profiling: 21%|██ | 61/291 [00:01<00:03, 66.11it/s]
Profiling: 26%|██▌ | 76/291 [00:01<00:02, 82.41it/s]
Profiling: 32%|███▏ | 93/291 [00:01<00:01, 100.54it/s]
Profiling: 37%|███▋ | 108/291 [00:02<00:01, 111.11it/s]
Profiling: 42%|████▏ | 123/291 [00:02<00:01, 117.18it/s]
Profiling: 49%|████▉ | 143/291 [00:02<00:01, 134.41it/s]
Profiling: 55%|█████▍ | 160/291 [00:02<00:00, 142.55it/s]
Profiling: 60%|██████ | 176/291 [00:02<00:00, 145.19it/s]
Profiling: 66%|██████▌ | 192/291 [00:02<00:00, 144.60it/s]
Profiling: 71%|███████▏ | 208/291 [00:04<00:03, 25.53it/s]
Profiling: 76%|███████▋ | 222/291 [00:04<00:02, 32.46it/s]
Profiling: 82%|████████▏ | 239/291 [00:04<00:01, 43.52it/s]
Profiling: 88%|████████▊ | 256/291 [00:04<00:00, 56.30it/s]
Profiling: 93%|█████████▎| 270/291 [00:04<00:00, 66.93it/s]
Profiling: 98%|█████████▊| 285/291 [00:04<00:00, 78.26it/s]
Profiling: 100%|██████████| 291/291 [00:05<00:00, 56.45it/s]
endevor-infinityrp-v1-7b-v2-mkmlizer: quantized model in 15.944s
endevor-infinityrp-v1-7b-v2-mkmlizer: Processed model Endevor/InfinityRP-v1-7B in 34.087s
endevor-infinityrp-v1-7b-v2-mkmlizer: creating bucket guanaco-mkml-models
endevor-infinityrp-v1-7b-v2-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
endevor-infinityrp-v1-7b-v2-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/endevor-infinityrp-v1-7b-v2
endevor-infinityrp-v1-7b-v2-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/endevor-infinityrp-v1-7b-v2/special_tokens_map.json
endevor-infinityrp-v1-7b-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer.model s3://guanaco-mkml-models/endevor-infinityrp-v1-7b-v2/tokenizer.model
endevor-infinityrp-v1-7b-v2-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/endevor-infinityrp-v1-7b-v2/config.json
endevor-infinityrp-v1-7b-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/endevor-infinityrp-v1-7b-v2/tokenizer_config.json
endevor-infinityrp-v1-7b-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/endevor-infinityrp-v1-7b-v2/tokenizer.json
endevor-infinityrp-v1-7b-v2-mkmlizer: cp /dev/shm/model_cache/mkml_model.tensors s3://guanaco-mkml-models/endevor-infinityrp-v1-7b-v2/mkml_model.tensors
endevor-infinityrp-v1-7b-v2-mkmlizer: loading reward model from rirv938/reward_gpt2_medium_preference_24m_e2
endevor-infinityrp-v1-7b-v2-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:1067: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
endevor-infinityrp-v1-7b-v2-mkmlizer: warnings.warn(
endevor-infinityrp-v1-7b-v2-mkmlizer:
config.json: 0%| | 0.00/1.05k [00:00<?, ?B/s]
config.json: 100%|██████████| 1.05k/1.05k [00:00<00:00, 12.7MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:690: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
endevor-infinityrp-v1-7b-v2-mkmlizer: warnings.warn(
endevor-infinityrp-v1-7b-v2-mkmlizer:
tokenizer_config.json: 0%| | 0.00/234 [00:00<?, ?B/s]
tokenizer_config.json: 100%|██████████| 234/234 [00:00<00:00, 2.67MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer:
vocab.json: 0%| | 0.00/1.04M [00:00<?, ?B/s]
vocab.json: 100%|██████████| 1.04M/1.04M [00:00<00:00, 9.58MB/s]
vocab.json: 100%|██████████| 1.04M/1.04M [00:00<00:00, 9.51MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer:
tokenizer.json: 0%| | 0.00/2.11M [00:00<?, ?B/s]
tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 8.45MB/s]
tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 8.42MB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:472: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
endevor-infinityrp-v1-7b-v2-mkmlizer: warnings.warn(
endevor-infinityrp-v1-7b-v2-mkmlizer:
pytorch_model.bin: 0%| | 0.00/1.44G [00:00<?, ?B/s]
pytorch_model.bin: 1%|▏ | 21.0M/1.44G [00:00<00:07, 190MB/s]
pytorch_model.bin: 7%|▋ | 105M/1.44G [00:00<00:02, 488MB/s]
pytorch_model.bin: 12%|█▏ | 178M/1.44G [00:00<00:02, 537MB/s]
pytorch_model.bin: 17%|█▋ | 252M/1.44G [00:00<00:02, 588MB/s]
pytorch_model.bin: 24%|██▍ | 346M/1.44G [00:00<00:01, 702MB/s]
pytorch_model.bin: 29%|██▉ | 419M/1.44G [00:00<00:01, 658MB/s]
pytorch_model.bin: 38%|███▊ | 545M/1.44G [00:00<00:01, 811MB/s]
pytorch_model.bin: 49%|████▊ | 703M/1.44G [00:00<00:00, 1.01GB/s]
pytorch_model.bin: 63%|██████▎ | 912M/1.44G [00:01<00:00, 1.32GB/s]
pytorch_model.bin: 85%|████████▌ | 1.23G/1.44G [00:01<00:00, 1.88GB/s]
pytorch_model.bin: 100%|█████████▉| 1.44G/1.44G [00:01<00:00, 1.18GB/s]
endevor-infinityrp-v1-7b-v2-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
endevor-infinityrp-v1-7b-v2-mkmlizer: Saving duration: 0.270s
endevor-infinityrp-v1-7b-v2-mkmlizer: Processed model rirv938/reward_gpt2_medium_preference_24m_e2 in 5.109s
endevor-infinityrp-v1-7b-v2-mkmlizer: creating bucket guanaco-reward-models
endevor-infinityrp-v1-7b-v2-mkmlizer: Bucket 's3://guanaco-reward-models/' created
endevor-infinityrp-v1-7b-v2-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/endevor-infinityrp-v1-7b-v2_reward
endevor-infinityrp-v1-7b-v2-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/endevor-infinityrp-v1-7b-v2_reward/tokenizer_config.json
endevor-infinityrp-v1-7b-v2-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/endevor-infinityrp-v1-7b-v2_reward/config.json
endevor-infinityrp-v1-7b-v2-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/endevor-infinityrp-v1-7b-v2_reward/merges.txt
endevor-infinityrp-v1-7b-v2-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/endevor-infinityrp-v1-7b-v2_reward/special_tokens_map.json
endevor-infinityrp-v1-7b-v2-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/endevor-infinityrp-v1-7b-v2_reward/vocab.json
endevor-infinityrp-v1-7b-v2-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/endevor-infinityrp-v1-7b-v2_reward/tokenizer.json
endevor-infinityrp-v1-7b-v2-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/endevor-infinityrp-v1-7b-v2_reward/reward.tensors
Job endevor-infinityrp-v1-7b-v2-mkmlizer completed after 64.3s with status: succeeded
Stopping job with name endevor-infinityrp-v1-7b-v2-mkmlizer
Pipeline stage MKMLizer completed in 68.39s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.14s
Running pipeline stage ISVCDeployer
Creating inference service endevor-infinityrp-v1-7b-v2
Waiting for inference service endevor-infinityrp-v1-7b-v2 to be ready
Inference service endevor-infinityrp-v1-7b-v2 ready after 40.243735790252686s
Pipeline stage ISVCDeployer completed in 47.88s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8549907207489014s
Received healthy response to inference request in 1.2063124179840088s
Received healthy response to inference request in 1.217651605606079s
Received healthy response to inference request in 1.2510297298431396s
Received healthy response to inference request in 1.2724590301513672s
5 requests
0 failed requests
5th percentile: 1.2085802555084229
10th percentile: 1.210848093032837
20th percentile: 1.215383768081665
30th percentile: 1.2243272304534911
40th percentile: 1.2376784801483154
50th percentile: 1.2510297298431396
60th percentile: 1.2596014499664308
70th percentile: 1.2681731700897216
80th percentile: 1.388965368270874
90th percentile: 1.6219780445098877
95th percentile: 1.7384843826293945
99th percentile: 1.831689453125
mean time: 1.3604887008666993
Pipeline stage StressChecker completed in 7.76s
Running pipeline stage DaemonicModelEvalScorer
Pipeline stage DaemonicModelEvalScorer completed in 0.05s
Running pipeline stage DaemonicSafetyScorer
Running M-Eval for topic stay_in_character
Pipeline stage DaemonicSafetyScorer completed in 0.05s
M-Eval Dataset for topic stay_in_character is loaded
endevor-infinityrp-v1-7b_v2 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of endevor-infinityrp-v1-7b_v2
Running pipeline stage ISVCDeleter
Checking if service endevor-infinityrp-v1-7b-v2 is running
Tearing down inference service endevor-infinityrp-v1-7b-v2
Toredown service endevor-infinityrp-v1-7b-v2
Pipeline stage ISVCDeleter completed in 5.43s
Running pipeline stage MKMLModelDeleter
Cleaning model data from S3
Cleaning model data from model cache
Deleting key endevor-infinityrp-v1-7b-v2/config.json from bucket guanaco-mkml-models
Deleting key endevor-infinityrp-v1-7b-v2/mkml_model.tensors from bucket guanaco-mkml-models
Deleting key endevor-infinityrp-v1-7b-v2/special_tokens_map.json from bucket guanaco-mkml-models
Deleting key endevor-infinityrp-v1-7b-v2/tokenizer.json from bucket guanaco-mkml-models
Deleting key endevor-infinityrp-v1-7b-v2/tokenizer.model from bucket guanaco-mkml-models
Deleting key endevor-infinityrp-v1-7b-v2/tokenizer_config.json from bucket guanaco-mkml-models
Cleaning model data from model cache
Deleting key endevor-infinityrp-v1-7b-v2_reward/config.json from bucket guanaco-reward-models
Deleting key endevor-infinityrp-v1-7b-v2_reward/merges.txt from bucket guanaco-reward-models
Deleting key endevor-infinityrp-v1-7b-v2_reward/reward.tensors from bucket guanaco-reward-models
Deleting key endevor-infinityrp-v1-7b-v2_reward/special_tokens_map.json from bucket guanaco-reward-models
Deleting key endevor-infinityrp-v1-7b-v2_reward/tokenizer.json from bucket guanaco-reward-models
Deleting key endevor-infinityrp-v1-7b-v2_reward/tokenizer_config.json from bucket guanaco-reward-models
Deleting key endevor-infinityrp-v1-7b-v2_reward/vocab.json from bucket guanaco-reward-models
Pipeline stage MKMLModelDeleter completed in 2.45s
endevor-infinityrp-v1-7b_v2 status is now torndown due to DeploymentManager action