submission_id: sanjiwatsuki-kunoichi-7b_v3
developer_uid: zonemercy
status: torndown
model_repo: SanjiWatsuki/Kunoichi-7B
reward_repo: rirv938/reward_gpt2_medium_preference_24m_e2
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', '###'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
formatter: {'memory_template': "### Instruction:\n\nRole-play as {bot_name} based on the Persona: {memory}.\n\nAvoid repetition, don't loop. Develop the plot slowly, always stay in character. Describe all actions in full, elaborate, explicit, graphic, and vivid detail. Mention all relevant sensory perceptions.", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '### Response:\n\n{bot_name}: {message}\n\n', 'user_template': '### Input:\n\n{user_name}: {message}\n\n', 'response_template': '### Response:\n\n{bot_name}:', 'truncate_by_message': False}
timestamp: 2024-03-22T06:21:41+00:00
model_name: sanjiwatsuki-kunoichi-7b_v3
model_eval_status: success
model_group: SanjiWatsuki/Kunoichi-7B
num_battles: 112113
num_wins: 59413
celo_rating: 1178.88
propriety_score: 0.0
propriety_total_count: 0.0
submission_type: basic
model_architecture: None
model_num_parameters: 7241732096.0
best_of: 16
max_input_tokens: 512
max_output_tokens: 64
display_name: sanjiwatsuki-kunoichi-7b_v3
ineligible_reason: propriety_total_count < 800
language_model: SanjiWatsuki/Kunoichi-7B
model_size: 7B
reward_model: rirv938/reward_gpt2_medium_preference_24m_e2
us_pacific_date: 2024-03-21
win_ratio: 0.5299385441474227
preference_data_url: None
Resubmit model
Running pipeline stage MKMLizer
Starting job with name sanjiwatsuki-kunoichi-7b-v3-mkmlizer
Waiting for job on sanjiwatsuki-kunoichi-7b-v3-mkmlizer to finish
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ _____ __ __ ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ /___/ ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ Version: 0.6.11 ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ The license key for the current software has been verified as ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ belonging to: ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ Chai Research Corp. ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ Expiration: 2024-04-15 23:59:59 ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ║ ║
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: .gitattributes: 0%| | 0.00/1.52k [00:00<?, ?B/s] .gitattributes: 100%|██████████| 1.52k/1.52k [00:00<00:00, 17.6MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: README.md: 0%| | 0.00/4.71k [00:00<?, ?B/s] README.md: 100%|██████████| 4.71k/4.71k [00:00<00:00, 35.0MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: assets/asset: 0.00B [00:00, ?B/s] assets/asset: 0.00B [00:00, ?B/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: assets/kunoichi.png: 0%| | 0.00/704k [00:00<?, ?B/s] assets/kunoichi.png: 100%|██████████| 704k/704k [00:00<00:00, 3.39MB/s] assets/kunoichi.png: 100%|██████████| 704k/704k [00:00<00:00, 3.38MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: config.json: 0%| | 0.00/674 [00:00<?, ?B/s] config.json: 100%|██████████| 674/674 [00:00<00:00, 7.16MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: model-00001-of-00002.safetensors: 0%| | 0.00/9.86G [00:00<?, ?B/s] model-00001-of-00002.safetensors: 0%| | 10.5M/9.86G [00:16<4:17:53, 636kB/s] model-00001-of-00002.safetensors: 0%| | 21.0M/9.86G [00:16<1:47:02, 1.53MB/s] model-00001-of-00002.safetensors: 0%| | 41.9M/9.86G [00:16<40:33, 4.03MB/s] model-00001-of-00002.safetensors: 1%| | 62.9M/9.86G [00:16<21:52, 7.46MB/s] model-00001-of-00002.safetensors: 1%| | 83.9M/9.86G [00:17<14:26, 11.3MB/s] model-00001-of-00002.safetensors: 2%|▏ | 178M/9.86G [00:17<04:06, 39.3MB/s] model-00001-of-00002.safetensors: 3%|▎ | 294M/9.86G [00:17<01:52, 85.0MB/s] model-00001-of-00002.safetensors: 5%|▌ | 514M/9.86G [00:17<00:46, 202MB/s] model-00001-of-00002.safetensors: 6%|▋ | 629M/9.86G [00:17<00:34, 267MB/s] model-00001-of-00002.safetensors: 8%|▊ | 744M/9.86G [00:17<00:26, 349MB/s] model-00001-of-00002.safetensors: 11%|█▏ | 1.11G/9.86G [00:17<00:11, 733MB/s] model-00001-of-00002.safetensors: 16%|█▌ | 1.53G/9.86G [00:18<00:06, 1.21GB/s] model-00001-of-00002.safetensors: 18%|█▊ | 1.78G/9.86G [00:18<00:07, 1.03GB/s] model-00001-of-00002.safetensors: 20%|██ | 1.98G/9.86G [00:18<00:07, 1.03GB/s] model-00001-of-00002.safetensors: 22%|██▏ | 2.15G/9.86G [00:18<00:08, 913MB/s] model-00001-of-00002.safetensors: 23%|██▎ | 2.29G/9.86G [00:18<00:08, 936MB/s] model-00001-of-00002.safetensors: 25%|██▌ | 2.49G/9.86G [00:19<00:06, 1.10GB/s] model-00001-of-00002.safetensors: 27%|██▋ | 2.63G/9.86G [00:19<00:07, 953MB/s] model-00001-of-00002.safetensors: 28%|██▊ | 2.80G/9.86G [00:19<00:06, 1.09GB/s] model-00001-of-00002.safetensors: 31%|███▏ | 3.08G/9.86G [00:19<00:04, 1.45GB/s] model-00001-of-00002.safetensors: 33%|███▎ | 3.27G/9.86G [00:19<00:04, 1.35GB/s] model-00001-of-00002.safetensors: 35%|███▍ | 3.44G/9.86G [00:19<00:05, 1.22GB/s] model-00001-of-00002.safetensors: 36%|███▋ | 3.59G/9.86G [00:19<00:05, 1.16GB/s] model-00001-of-00002.safetensors: 38%|███▊ | 3.72G/9.86G [00:20<00:05, 1.17GB/s] model-00001-of-00002.safetensors: 39%|███▉ | 3.85G/9.86G [00:20<00:05, 1.10GB/s] model-00001-of-00002.safetensors: 41%|████ | 4.06G/9.86G [00:20<00:04, 1.33GB/s] model-00001-of-00002.safetensors: 43%|████▎ | 4.20G/9.86G [00:20<00:04, 1.33GB/s] model-00001-of-00002.safetensors: 44%|████▍ | 4.38G/9.86G [00:20<00:03, 1.39GB/s] model-00001-of-00002.safetensors: 47%|████▋ | 4.62G/9.86G [00:20<00:03, 1.63GB/s] model-00001-of-00002.safetensors: 49%|████▊ | 4.80G/9.86G [00:20<00:03, 1.47GB/s] model-00001-of-00002.safetensors: 50%|█████ | 4.96G/9.86G [00:20<00:03, 1.31GB/s] model-00001-of-00002.safetensors: 52%|█████▏ | 5.14G/9.86G [00:21<00:03, 1.41GB/s] model-00001-of-00002.safetensors: 54%|█████▎ | 5.30G/9.86G [00:21<00:03, 1.44GB/s] model-00001-of-00002.safetensors: 56%|█████▌ | 5.52G/9.86G [00:21<00:02, 1.58GB/s] model-00001-of-00002.safetensors: 59%|█████▉ | 5.80G/9.86G [00:21<00:02, 1.87GB/s] model-00001-of-00002.safetensors: 61%|██████ | 6.00G/9.86G [00:21<00:02, 1.58GB/s] model-00001-of-00002.safetensors: 63%|██████▎ | 6.19G/9.86G [00:21<00:02, 1.65GB/s] model-00001-of-00002.safetensors: 65%|██████▍ | 6.36G/9.86G [00:21<00:02, 1.48GB/s] model-00001-of-00002.safetensors: 66%|██████▋ | 6.54G/9.86G [00:21<00:02, 1.49GB/s] model-00001-of-00002.safetensors: 68%|██████▊ | 6.72G/9.86G [00:22<00:02, 1.55GB/s] model-00001-of-00002.safetensors: 70%|██████▉ | 6.89G/9.86G [00:22<00:02, 1.46GB/s] model-00001-of-00002.safetensors: 72%|███████▏ | 7.11G/9.86G [00:22<00:01, 1.62GB/s] model-00001-of-00002.safetensors: 74%|███████▍ | 7.29G/9.86G [00:22<00:01, 1.65GB/s] model-00001-of-00002.safetensors: 76%|███████▌ | 7.47G/9.86G [00:22<00:01, 1.46GB/s] model-00001-of-00002.safetensors: 77%|███████▋ | 7.62G/9.86G [00:22<00:01, 1.38GB/s] model-00001-of-00002.safetensors: 79%|███████▉ | 7.77G/9.86G [00:22<00:01, 1.27GB/s] model-00001-of-00002.safetensors: 80%|████████ | 7.91G/9.86G [00:22<00:01, 1.23GB/s] model-00001-of-00002.safetensors: 82%|████████▏ | 8.05G/9.86G [00:23<00:01, 1.26GB/s] model-00001-of-00002.safetensors: 83%|████████▎ | 8.21G/9.86G [00:23<00:01, 1.33GB/s] model-00001-of-00002.safetensors: 85%|████████▍ | 8.35G/9.86G [00:23<00:01, 1.20GB/s] model-00001-of-00002.safetensors: 86%|████████▌ | 8.47G/9.86G [00:23<00:01, 1.18GB/s] model-00001-of-00002.safetensors: 88%|████████▊ | 8.65G/9.86G [00:23<00:00, 1.28GB/s] model-00001-of-00002.safetensors: 89%|████████▉ | 8.79G/9.86G [00:23<00:00, 1.29GB/s] model-00001-of-00002.safetensors: 91%|█████████ | 8.93G/9.86G [00:23<00:00, 1.29GB/s] model-00001-of-00002.safetensors: 93%|█████████▎| 9.20G/9.86G [00:23<00:00, 1.65GB/s] model-00001-of-00002.safetensors: 99%|█████████▊| 9.72G/9.86G [00:23<00:00, 2.55GB/s] model-00001-of-00002.safetensors: 100%|█████████▉| 9.86G/9.86G [00:39<00:00, 2.55GB/s] model-00001-of-00002.safetensors: 100%|█████████▉| 9.86G/9.86G [00:39<00:00, 252MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: model-00002-of-00002.safetensors: 0%| | 0.00/4.62G [00:00<?, ?B/s] model-00002-of-00002.safetensors: 0%| | 10.5M/4.62G [00:01<07:52, 9.76MB/s] model-00002-of-00002.safetensors: 0%| | 21.0M/4.62G [00:01<03:41, 20.8MB/s] model-00002-of-00002.safetensors: 1%| | 41.9M/4.62G [00:01<01:41, 45.1MB/s] model-00002-of-00002.safetensors: 1%|▏ | 62.9M/4.62G [00:01<01:08, 66.7MB/s] model-00002-of-00002.safetensors: 2%|▏ | 83.9M/4.62G [00:01<01:25, 53.4MB/s] model-00002-of-00002.safetensors: 3%|▎ | 126M/4.62G [00:02<00:45, 98.6MB/s] model-00002-of-00002.safetensors: 13%|█▎ | 619M/4.62G [00:02<00:04, 833MB/s] model-00002-of-00002.safetensors: 22%|██▏ | 1.04G/4.62G [00:02<00:02, 1.39GB/s] model-00002-of-00002.safetensors: 27%|██▋ | 1.27G/4.62G [00:02<00:02, 1.44GB/s] model-00002-of-00002.safetensors: 34%|███▍ | 1.58G/4.62G [00:02<00:01, 1.79GB/s] model-00002-of-00002.safetensors: 39%|███▉ | 1.82G/4.62G [00:02<00:01, 1.46GB/s] model-00002-of-00002.safetensors: 44%|████▍ | 2.02G/4.62G [00:02<00:01, 1.46GB/s] model-00002-of-00002.safetensors: 48%|████▊ | 2.21G/4.62G [00:03<00:01, 1.44GB/s] model-00002-of-00002.safetensors: 52%|█████▏ | 2.39G/4.62G [00:03<00:01, 1.43GB/s] model-00002-of-00002.safetensors: 55%|█████▌ | 2.56G/4.62G [00:03<00:01, 1.23GB/s] model-00002-of-00002.safetensors: 59%|█████▊ | 2.71G/4.62G [00:03<00:01, 1.04GB/s] model-00002-of-00002.safetensors: 61%|██████▏ | 2.84G/4.62G [00:03<00:01, 1.07GB/s] model-00002-of-00002.safetensors: 64%|██████▍ | 2.97G/4.62G [00:03<00:01, 1.10GB/s] model-00002-of-00002.safetensors: 67%|██████▋ | 3.09G/4.62G [00:03<00:01, 1.06GB/s] model-00002-of-00002.safetensors: 72%|███████▏ | 3.33G/4.62G [00:04<00:00, 1.37GB/s] model-00002-of-00002.safetensors: 76%|███████▌ | 3.49G/4.62G [00:04<00:00, 1.27GB/s] model-00002-of-00002.safetensors: 79%|███████▊ | 3.64G/4.62G [00:04<00:00, 1.18GB/s] model-00002-of-00002.safetensors: 81%|████████▏ | 3.76G/4.62G [00:04<00:00, 1.07GB/s] model-00002-of-00002.safetensors: 86%|████████▌ | 3.96G/4.62G [00:04<00:00, 1.28GB/s] model-00002-of-00002.safetensors: 100%|█████████▉| 4.62G/4.62G [00:18<00:00, 1.28GB/s] model-00002-of-00002.safetensors: 100%|█████████▉| 4.62G/4.62G [00:18<00:00, 249MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: model.safetensors.index.json: 0%| | 0.00/22.8k [00:00<?, ?B/s] model.safetensors.index.json: 100%|██████████| 22.8k/22.8k [00:00<00:00, 144MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: special_tokens_map.json: 0%| | 0.00/414 [00:00<?, ?B/s] special_tokens_map.json: 100%|██████████| 414/414 [00:00<00:00, 4.88MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: tokenizer.json: 0%| | 0.00/1.80M [00:00<?, ?B/s] tokenizer.json: 100%|██████████| 1.80M/1.80M [00:00<00:00, 16.9MB/s] tokenizer.json: 100%|██████████| 1.80M/1.80M [00:00<00:00, 16.8MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: tokenizer.model: 0%| | 0.00/493k [00:00<?, ?B/s] tokenizer.model: 100%|██████████| 493k/493k [00:00<00:00, 2.33MB/s] tokenizer.model: 100%|██████████| 493k/493k [00:00<00:00, 2.33MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: tokenizer_config.json: 0%| | 0.00/967 [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 967/967 [00:00<00:00, 12.7MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: Downloaded to shared memory in 84.621s
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: quantizing model to /dev/shm/model_cache
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: Saving mkml model at /dev/shm/model_cache
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: Reading /tmp/tmpjfh30mhy/model.safetensors.index.json
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: Profiling: 0%| | 0/291 [00:00<?, ?it/s] Profiling: 0%| | 1/291 [00:01<05:39, 1.17s/it] Profiling: 5%|▍ | 14/291 [00:01<00:18, 14.68it/s] Profiling: 11%|█ | 31/291 [00:01<00:07, 34.84it/s] Profiling: 16%|█▋ | 48/291 [00:01<00:04, 55.30it/s] Profiling: 21%|██ | 61/291 [00:01<00:03, 68.78it/s] Profiling: 26%|██▌ | 76/291 [00:01<00:02, 84.37it/s] Profiling: 32%|███▏ | 93/291 [00:01<00:01, 101.35it/s] Profiling: 37%|███▋ | 107/291 [00:01<00:01, 109.31it/s] Profiling: 42%|████▏ | 121/291 [00:02<00:01, 116.69it/s] Profiling: 47%|████▋ | 138/291 [00:02<00:01, 128.08it/s] Profiling: 53%|█████▎ | 153/291 [00:02<00:01, 131.26it/s] Profiling: 58%|█████▊ | 168/291 [00:02<00:00, 131.42it/s] Profiling: 63%|██████▎ | 184/291 [00:02<00:00, 137.19it/s] Profiling: 69%|██████▊ | 200/291 [00:04<00:03, 24.61it/s] Profiling: 73%|███████▎ | 213/291 [00:04<00:02, 31.17it/s] Profiling: 79%|███████▉ | 230/291 [00:04<00:01, 42.58it/s] Profiling: 85%|████████▍ | 247/291 [00:04<00:00, 55.80it/s] Profiling: 90%|█████████ | 262/291 [00:04<00:00, 67.85it/s] Profiling: 95%|█████████▍| 276/291 [00:04<00:00, 78.29it/s] Profiling: 100%|██████████| 291/291 [00:05<00:00, 56.73it/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: quantized model in 16.854s
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: Processed model SanjiWatsuki/Kunoichi-7B in 102.433s
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: creating bucket guanaco-mkml-models
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/sanjiwatsuki-kunoichi-7b-v3
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/sanjiwatsuki-kunoichi-7b-v3/config.json
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/sanjiwatsuki-kunoichi-7b-v3/special_tokens_map.json
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/sanjiwatsuki-kunoichi-7b-v3/tokenizer_config.json
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer.model s3://guanaco-mkml-models/sanjiwatsuki-kunoichi-7b-v3/tokenizer.model
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/sanjiwatsuki-kunoichi-7b-v3/tokenizer.json
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: cp /dev/shm/model_cache/mkml_model.tensors s3://guanaco-mkml-models/sanjiwatsuki-kunoichi-7b-v3/mkml_model.tensors
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: loading reward model from rirv938/reward_gpt2_medium_preference_24m_e2
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:1067: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: warnings.warn(
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: config.json: 0%| | 0.00/1.05k [00:00<?, ?B/s] config.json: 100%|██████████| 1.05k/1.05k [00:00<00:00, 12.1MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:690: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: warnings.warn(
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: tokenizer_config.json: 0%| | 0.00/234 [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 234/234 [00:00<00:00, 2.69MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: vocab.json: 0%| | 0.00/1.04M [00:00<?, ?B/s] vocab.json: 100%|██████████| 1.04M/1.04M [00:00<00:00, 4.45MB/s] vocab.json: 100%|██████████| 1.04M/1.04M [00:00<00:00, 4.44MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: tokenizer.json: 0%| | 0.00/2.11M [00:00<?, ?B/s] tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 2.65MB/s] tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 2.65MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:472: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: warnings.warn(
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: pytorch_model.bin: 0%| | 0.00/1.44G [00:00<?, ?B/s] pytorch_model.bin: 1%| | 10.5M/1.44G [00:01<02:30, 9.51MB/s] pytorch_model.bin: 1%|▏ | 21.0M/1.44G [00:01<01:09, 20.4MB/s] pytorch_model.bin: 3%|▎ | 41.9M/1.44G [00:01<00:31, 44.1MB/s] pytorch_model.bin: 4%|▍ | 62.9M/1.44G [00:01<00:20, 66.2MB/s] pytorch_model.bin: 6%|▌ | 83.9M/1.44G [00:01<00:16, 84.6MB/s] pytorch_model.bin: 8%|▊ | 115M/1.44G [00:01<00:12, 108MB/s] pytorch_model.bin: 9%|▉ | 136M/1.44G [00:01<00:10, 127MB/s] pytorch_model.bin: 12%|█▏ | 178M/1.44G [00:02<00:06, 184MB/s] pytorch_model.bin: 25%|██▌ | 367M/1.44G [00:02<00:01, 555MB/s] pytorch_model.bin: 37%|███▋ | 535M/1.44G [00:02<00:01, 817MB/s] pytorch_model.bin: 44%|████▍ | 640M/1.44G [00:02<00:01, 791MB/s] pytorch_model.bin: 72%|███████▏ | 1.04G/1.44G [00:02<00:00, 1.58GB/s] pytorch_model.bin: 100%|█████████▉| 1.44G/1.44G [00:02<00:00, 2.02GB/s] pytorch_model.bin: 100%|█████████▉| 1.44G/1.44G [00:16<00:00, 2.02GB/s] pytorch_model.bin: 100%|█████████▉| 1.44G/1.44G [00:16<00:00, 89.1MB/s]
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: Saving duration: 0.289s
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: Processed model rirv938/reward_gpt2_medium_preference_24m_e2 in 27.821s
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: creating bucket guanaco-reward-models
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: Bucket 's3://guanaco-reward-models/' created
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/sanjiwatsuki-kunoichi-7b-v3_reward
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/sanjiwatsuki-kunoichi-7b-v3_reward/config.json
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/sanjiwatsuki-kunoichi-7b-v3_reward/special_tokens_map.json
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/sanjiwatsuki-kunoichi-7b-v3_reward/tokenizer_config.json
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/sanjiwatsuki-kunoichi-7b-v3_reward/merges.txt
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/sanjiwatsuki-kunoichi-7b-v3_reward/vocab.json
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/sanjiwatsuki-kunoichi-7b-v3_reward/tokenizer.json
sanjiwatsuki-kunoichi-7b-v3-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/sanjiwatsuki-kunoichi-7b-v3_reward/reward.tensors
Job sanjiwatsuki-kunoichi-7b-v3-mkmlizer completed after 225.85s with status: succeeded
Stopping job with name sanjiwatsuki-kunoichi-7b-v3-mkmlizer
Pipeline stage MKMLizer completed in 228.97s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.14s
Running pipeline stage ISVCDeployer
Creating inference service sanjiwatsuki-kunoichi-7b-v3
Waiting for inference service sanjiwatsuki-kunoichi-7b-v3 to be ready
Inference service sanjiwatsuki-kunoichi-7b-v3 ready after 40.25096678733826s
Pipeline stage ISVCDeployer completed in 46.95s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2934205532073975s
Received healthy response to inference request in 1.189298152923584s
Received healthy response to inference request in 1.083580493927002s
Received healthy response to inference request in 1.1684939861297607s
Received healthy response to inference request in 1.1761794090270996s
5 requests
0 failed requests
5th percentile: 1.1005631923675536
10th percentile: 1.1175458908081055
20th percentile: 1.151511287689209
30th percentile: 1.1700310707092285
40th percentile: 1.173105239868164
50th percentile: 1.1761794090270996
60th percentile: 1.1814269065856933
70th percentile: 1.1866744041442872
80th percentile: 1.410122632980347
90th percentile: 1.851771593093872
95th percentile: 2.0725960731506348
99th percentile: 2.249255657196045
mean time: 1.3821945190429688
Pipeline stage StressChecker completed in 7.84s
Running pipeline stage DaemonicModelEvalScorer
Pipeline stage DaemonicModelEvalScorer completed in 0.04s
Running pipeline stage DaemonicSafetyScorer
Running M-Eval for topic stay_in_character
Pipeline stage DaemonicSafetyScorer completed in 0.07s
M-Eval Dataset for topic stay_in_character is loaded
sanjiwatsuki-kunoichi-7b_v3 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of sanjiwatsuki-kunoichi-7b_v3
Running pipeline stage ISVCDeleter
Checking if service sanjiwatsuki-kunoichi-7b-v3 is running
Tearing down inference service sanjiwatsuki-kunoichi-7b-v3
Toredown service sanjiwatsuki-kunoichi-7b-v3
Pipeline stage ISVCDeleter completed in 3.74s
Running pipeline stage MKMLModelDeleter
Cleaning model data from S3
Cleaning model data from model cache
Deleting key sanjiwatsuki-kunoichi-7b-v3/config.json from bucket guanaco-mkml-models
Deleting key sanjiwatsuki-kunoichi-7b-v3/mkml_model.tensors from bucket guanaco-mkml-models
Deleting key sanjiwatsuki-kunoichi-7b-v3/special_tokens_map.json from bucket guanaco-mkml-models
Deleting key sanjiwatsuki-kunoichi-7b-v3/tokenizer.json from bucket guanaco-mkml-models
Deleting key sanjiwatsuki-kunoichi-7b-v3/tokenizer.model from bucket guanaco-mkml-models
Deleting key sanjiwatsuki-kunoichi-7b-v3/tokenizer_config.json from bucket guanaco-mkml-models
Cleaning model data from model cache
Deleting key sanjiwatsuki-kunoichi-7b-v3_reward/config.json from bucket guanaco-reward-models
Deleting key sanjiwatsuki-kunoichi-7b-v3_reward/merges.txt from bucket guanaco-reward-models
Deleting key sanjiwatsuki-kunoichi-7b-v3_reward/reward.tensors from bucket guanaco-reward-models
Deleting key sanjiwatsuki-kunoichi-7b-v3_reward/special_tokens_map.json from bucket guanaco-reward-models
Deleting key sanjiwatsuki-kunoichi-7b-v3_reward/tokenizer.json from bucket guanaco-reward-models
Deleting key sanjiwatsuki-kunoichi-7b-v3_reward/tokenizer_config.json from bucket guanaco-reward-models
Deleting key sanjiwatsuki-kunoichi-7b-v3_reward/vocab.json from bucket guanaco-reward-models
Pipeline stage MKMLModelDeleter completed in 1.91s
sanjiwatsuki-kunoichi-7b_v3 status is now torndown due to DeploymentManager action

Usage Metrics

Latency Metrics