submission_id: khanhnto-khanhnto_v49
developer_uid: robert_irvine
status: inactive
model_repo: khanhnto/khanhnto
reward_repo: rirv938/gpt2_ties_merge_preference_plus_classic_e2_density_99
generation_params: {'temperature': 1.2, 'top_p': 0.7, 'top_k': 50, 'presence_penalty': 0.8, 'frequency_penalty': 0.2, 'stopping_words': ['<\\s>', '###', '\n'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "### Instruction:\n\n{bot_name}'s Persona: {memory}.\n\nPlay the role of {bot_name}. Engage in a chat with {user_name} while stay in character. Do not write dialogues and narration for {user_name}. {bot_name} should response with messages of medium length.", 'prompt_template': '{prompt}\n\n', 'bot_template': '### Response:\n\n{bot_name}: {message}\n\n', 'user_template': '### Input:\n\n{user_name}: {message}\n\n', 'response_template': '### Response:\n\n{bot_name}:'}
reward_formatter: {'memory_template': 'Memory: {memory}\n', 'prompt_template': '{prompt}\n', 'bot_template': 'Bot: {message}\n', 'user_template': 'User: {message}\n', 'response_template': 'Bot:'}
timestamp: 2024-02-16T00:57:50+00:00
model_name: khanhnto-khanhnto_v49
model_eval_status: success
safety_score: 0.98
entertaining: 6.86
stay_in_character: 8.58
user_preference: 7.34
double_thumbs_up: 3351
thumbs_up: 5272
thumbs_down: 2534
num_battles: 122858
num_wins: 59793
win_ratio: 0.48668381383385695
celo_rating: 1146.63
Resubmit model
Running pipeline stage MKMLizer
Starting job with name khanhnto-khanhnto-v49-mkmlizer
Waiting for job on khanhnto-khanhnto-v49-mkmlizer to finish
khanhnto-khanhnto-v49-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
khanhnto-khanhnto-v49-mkmlizer: ║ _____ __ __ ║
khanhnto-khanhnto-v49-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
khanhnto-khanhnto-v49-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
khanhnto-khanhnto-v49-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
khanhnto-khanhnto-v49-mkmlizer: ║ /___/ ║
khanhnto-khanhnto-v49-mkmlizer: ║ ║
khanhnto-khanhnto-v49-mkmlizer: ║ Version: 0.6.11 ║
khanhnto-khanhnto-v49-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
khanhnto-khanhnto-v49-mkmlizer: ║ ║
khanhnto-khanhnto-v49-mkmlizer: ║ The license key for the current software has been verified as ║
khanhnto-khanhnto-v49-mkmlizer: ║ belonging to: ║
khanhnto-khanhnto-v49-mkmlizer: ║ ║
khanhnto-khanhnto-v49-mkmlizer: ║ Chai Research Corp. ║
khanhnto-khanhnto-v49-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
khanhnto-khanhnto-v49-mkmlizer: ║ Expiration: 2024-04-15 23:59:59 ║
khanhnto-khanhnto-v49-mkmlizer: ║ ║
khanhnto-khanhnto-v49-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
khanhnto-khanhnto-v49-mkmlizer: .gitattributes: 0%| | 0.00/1.52k [00:00<?, ?B/s] .gitattributes: 100%|██████████| 1.52k/1.52k [00:00<00:00, 14.0MB/s]
khanhnto-khanhnto-v49-mkmlizer: added_tokens.json: 0%| | 0.00/21.0 [00:00<?, ?B/s] added_tokens.json: 100%|██████████| 21.0/21.0 [00:00<00:00, 171kB/s]
khanhnto-khanhnto-v49-mkmlizer: config.json: 0%| | 0.00/702 [00:00<?, ?B/s] config.json: 100%|██████████| 702/702 [00:00<00:00, 5.68MB/s]
khanhnto-khanhnto-v49-mkmlizer: generation_config.json: 0%| | 0.00/137 [00:00<?, ?B/s] generation_config.json: 100%|██████████| 137/137 [00:00<00:00, 2.18MB/s]
khanhnto-khanhnto-v49-mkmlizer: model-00001-of-00006.safetensors: 0%| | 0.00/4.98G [00:00<?, ?B/s] model-00001-of-00006.safetensors: 0%| | 10.5M/4.98G [00:00<05:23, 15.3MB/s] model-00001-of-00006.safetensors: 1%| | 41.9M/4.98G [00:00<01:15, 65.0MB/s] model-00001-of-00006.safetensors: 3%|▎ | 126M/4.98G [00:00<00:23, 208MB/s] model-00001-of-00006.safetensors: 5%|▍ | 231M/4.98G [00:01<00:12, 384MB/s] model-00001-of-00006.safetensors: 7%|▋ | 357M/4.98G [00:01<00:07, 583MB/s] model-00001-of-00006.safetensors: 10%|█ | 514M/4.98G [00:01<00:05, 824MB/s] model-00001-of-00006.safetensors: 15%|█▍ | 724M/4.98G [00:01<00:03, 1.13GB/s] model-00001-of-00006.safetensors: 17%|█▋ | 870M/4.98G [00:01<00:03, 1.22GB/s] model-00001-of-00006.safetensors: 20%|██ | 1.02G/4.98G [00:01<00:03, 1.24GB/s] model-00001-of-00006.safetensors: 23%|██▎ | 1.16G/4.98G [00:01<00:03, 1.25GB/s] model-00001-of-00006.safetensors: 26%|██▌ | 1.30G/4.98G [00:01<00:03, 1.13GB/s] model-00001-of-00006.safetensors: 29%|██▉ | 1.47G/4.98G [00:01<00:02, 1.26GB/s] model-00001-of-00006.safetensors: 32%|███▏ | 1.60G/4.98G [00:02<00:02, 1.19GB/s] model-00001-of-00006.safetensors: 35%|███▍ | 1.73G/4.98G [00:02<00:03, 993MB/s] model-00001-of-00006.safetensors: 40%|███▉ | 1.97G/4.98G [00:02<00:02, 1.31GB/s] model-00001-of-00006.safetensors: 44%|████▍ | 2.18G/4.98G [00:02<00:01, 1.48GB/s] model-00001-of-00006.safetensors: 47%|████▋ | 2.35G/4.98G [00:02<00:01, 1.48GB/s] model-00001-of-00006.safetensors: 50%|█████ | 2.51G/4.98G [00:02<00:01, 1.46GB/s] model-00001-of-00006.safetensors: 54%|█████▎ | 2.66G/4.98G [00:02<00:01, 1.45GB/s] model-00001-of-00006.safetensors: 57%|█████▋ | 2.82G/4.98G [00:02<00:01, 1.25GB/s] model-00001-of-00006.safetensors: 59%|█████▉ | 2.96G/4.98G [00:03<00:01, 1.23GB/s] model-00001-of-00006.safetensors: 62%|██████▏ | 3.10G/4.98G [00:03<00:01, 1.27GB/s] model-00001-of-00006.safetensors: 65%|██████▌ | 3.25G/4.98G [00:03<00:01, 1.29GB/s] model-00001-of-00006.safetensors: 68%|██████▊ | 3.39G/4.98G [00:03<00:01, 1.30GB/s] model-00001-of-00006.safetensors: 73%|███████▎ | 3.62G/4.98G [00:03<00:00, 1.42GB/s] model-00001-of-00006.safetensors: 77%|███████▋ | 3.82G/4.98G [00:03<00:00, 1.54GB/s] model-00001-of-00006.safetensors: 81%|████████ | 4.03G/4.98G [00:03<00:00, 1.67GB/s] model-00001-of-00006.safetensors: 84%|████████▍ | 4.20G/4.98G [00:03<00:00, 1.43GB/s] model-00001-of-00006.safetensors: 88%|████████▊ | 4.40G/4.98G [00:04<00:00, 1.52GB/s] model-00001-of-00006.safetensors: 95%|█████████▌| 4.75G/4.98G [00:04<00:00, 1.97GB/s] model-00001-of-00006.safetensors: 100%|█████████▉| 4.96G/4.98G [00:04<00:00, 1.14GB/s] model-00001-of-00006.safetensors: 100%|█████████▉| 4.98G/4.98G [00:05<00:00, 961MB/s]
khanhnto-khanhnto-v49-mkmlizer: model-00002-of-00006.safetensors: 0%| | 0.00/4.97G [00:00<?, ?B/s] model-00002-of-00006.safetensors: 0%| | 10.5M/4.97G [00:00<02:47, 29.7MB/s] model-00002-of-00006.safetensors: 0%| | 21.0M/4.97G [00:00<01:51, 44.3MB/s] model-00002-of-00006.safetensors: 3%|▎ | 136M/4.97G [00:00<00:14, 331MB/s] model-00002-of-00006.safetensors: 6%|▌ | 304M/4.97G [00:00<00:06, 695MB/s] model-00002-of-00006.safetensors: 8%|▊ | 409M/4.97G [00:00<00:06, 747MB/s] model-00002-of-00006.safetensors: 12%|█▏ | 608M/4.97G [00:00<00:04, 1.01GB/s] model-00002-of-00006.safetensors: 15%|█▍ | 724M/4.97G [00:01<00:04, 1.04GB/s] model-00002-of-00006.safetensors: 17%|█▋ | 839M/4.97G [00:01<00:05, 754MB/s] model-00002-of-00006.safetensors: 20%|██ | 996M/4.97G [00:01<00:04, 923MB/s] model-00002-of-00006.safetensors: 22%|██▏ | 1.11G/4.97G [00:01<00:04, 933MB/s] model-00002-of-00006.safetensors: 25%|██▍ | 1.23G/4.97G [00:01<00:04, 845MB/s] model-00002-of-00006.safetensors: 27%|██▋ | 1.33G/4.97G [00:01<00:04, 874MB/s] model-00002-of-00006.safetensors: 30%|███ | 1.50G/4.97G [00:01<00:03, 1.06GB/s] model-00002-of-00006.safetensors: 33%|███▎ | 1.63G/4.97G [00:02<00:03, 886MB/s] model-00002-of-00006.safetensors: 35%|███▍ | 1.73G/4.97G [00:02<00:04, 799MB/s] model-00002-of-00006.safetensors: 37%|███▋ | 1.86G/4.97G [00:02<00:03, 874MB/s] model-00002-of-00006.safetensors: 39%|███▉ | 1.96G/4.97G [00:02<00:03, 841MB/s] model-00002-of-00006.safetensors: 41%|████▏ | 2.06G/4.97G [00:02<00:03, 794MB/s] model-00002-of-00006.safetensors: 44%|████▎ | 2.17G/4.97G [00:02<00:03, 821MB/s] model-00002-of-00006.safetensors: 46%|████▌ | 2.30G/4.97G [00:02<00:02, 910MB/s] model-00002-of-00006.safetensors: 48%|████▊ | 2.40G/4.97G [00:03<00:03, 853MB/s] model-00002-of-00006.safetensors: 51%|█████ | 2.53G/4.97G [00:03<00:02, 932MB/s] model-00002-of-00006.safetensors: 53%|█████▎ | 2.64G/4.97G [00:03<00:02, 967MB/s] model-00002-of-00006.safetensors: 55%|█████▌ | 2.75G/4.97G [00:03<00:02, 782MB/s] model-00002-of-00006.safetensors: 58%|█████▊ | 2.87G/4.97G [00:03<00:02, 866MB/s] model-00002-of-00006.safetensors: 60%|█████▉ | 2.97G/4.97G [00:03<00:02, 778MB/s] model-00002-of-00006.safetensors: 61%|██████▏ | 3.05G/4.97G [00:03<00:02, 679MB/s] model-00002-of-00006.safetensors: 64%|██████▍ | 3.18G/4.97G [00:04<00:02, 783MB/s] model-00002-of-00006.safetensors: 67%|██████▋ | 3.32G/4.97G [00:04<00:01, 938MB/s] model-00002-of-00006.safetensors: 69%|██████▉ | 3.43G/4.97G [00:04<00:01, 868MB/s] model-00002-of-00006.safetensors: 71%|███████ | 3.52G/4.97G [00:04<00:01, 809MB/s] model-00002-of-00006.safetensors: 74%|███████▎ | 3.66G/4.97G [00:04<00:01, 939MB/s] model-00002-of-00006.safetensors: 76%|███████▌ | 3.76G/4.97G [00:04<00:01, 828MB/s] model-00002-of-00006.safetensors: 78%|███████▊ | 3.86G/4.97G [00:04<00:01, 577MB/s] model-00002-of-00006.safetensors: 79%|███████▉ | 3.93G/4.97G [00:05<00:01, 589MB/s] model-00002-of-00006.safetensors: 82%|████████▏ | 4.06G/4.97G [00:05<00:01, 719MB/s] model-00002-of-00006.safetensors: 84%|████████▍ | 4.17G/4.97G [00:05<00:00, 807MB/s] model-00002-of-00006.safetensors: 86%|████████▋ | 4.29G/4.97G [00:05<00:00, 876MB/s] model-00002-of-00006.safetensors: 88%|████████▊ | 4.39G/4.97G [00:05<00:00, 896MB/s] model-00002-of-00006.safetensors: 91%|█████████ | 4.50G/4.97G [00:05<00:00, 717MB/s] model-00002-of-00006.safetensors: 92%|█████████▏| 4.59G/4.97G [00:05<00:00, 761MB/s] model-00002-of-00006.safetensors: 94%|█████████▍| 4.69G/4.97G [00:06<00:00, 655MB/s] model-00002-of-00006.safetensors: 96%|█████████▌| 4.77G/4.97G [00:06<00:00, 496MB/s] model-00002-of-00006.safetensors: 97%|█████████▋| 4.83G/4.97G [00:06<00:00, 414MB/s] model-00002-of-00006.safetensors: 98%|█████████▊| 4.89G/4.97G [00:06<00:00, 370MB/s] model-00002-of-00006.safetensors: 99%|█████████▉| 4.94G/4.97G [00:07<00:00, 303MB/s] model-00002-of-00006.safetensors: 100%|█████████▉| 4.97G/4.97G [00:07<00:00, 672MB/s]
khanhnto-khanhnto-v49-mkmlizer: model-00003-of-00006.safetensors: 0%| | 0.00/4.97G [00:00<?, ?B/s] model-00003-of-00006.safetensors: 0%| | 10.5M/4.97G [00:00<03:10, 26.0MB/s] model-00003-of-00006.safetensors: 2%|▏ | 105M/4.97G [00:00<00:19, 256MB/s] model-00003-of-00006.safetensors: 5%|▍ | 231M/4.97G [00:00<00:09, 519MB/s] model-00003-of-00006.safetensors: 7%|▋ | 346M/4.97G [00:00<00:06, 667MB/s] model-00003-of-00006.safetensors: 10%|█ | 514M/4.97G [00:00<00:05, 879MB/s] model-00003-of-00006.safetensors: 13%|█▎ | 650M/4.97G [00:00<00:04, 1.00GB/s] model-00003-of-00006.safetensors: 16%|█▌ | 797M/4.97G [00:01<00:03, 1.06GB/s] model-00003-of-00006.safetensors: 19%|█▊ | 923M/4.97G [00:01<00:03, 1.01GB/s] model-00003-of-00006.safetensors: 21%|██ | 1.04G/4.97G [00:01<00:04, 967MB/s] model-00003-of-00006.safetensors: 24%|██▍ | 1.18G/4.97G [00:01<00:03, 1.02GB/s] model-00003-of-00006.safetensors: 26%|██▌ | 1.30G/4.97G [00:01<00:03, 972MB/s] model-00003-of-00006.safetensors: 31%|███ | 1.52G/4.97G [00:01<00:02, 1.25GB/s] model-00003-of-00006.safetensors: 34%|███▍ | 1.68G/4.97G [00:01<00:02, 1.30GB/s] model-00003-of-00006.safetensors: 37%|███▋ | 1.86G/4.97G [00:01<00:02, 1.41GB/s] model-00003-of-00006.safetensors: 40%|████ | 2.00G/4.97G [00:02<00:02, 1.30GB/s] model-00003-of-00006.safetensors: 43%|████▎ | 2.15G/4.97G [00:02<00:02, 1.33GB/s] model-00003-of-00006.safetensors: 47%|████▋ | 2.32G/4.97G [00:02<00:01, 1.36GB/s] model-00003-of-00006.safetensors: 50%|████▉ | 2.46G/4.97G [00:02<00:02, 972MB/s] model-00003-of-00006.safetensors: 52%|█████▏ | 2.58G/4.97G [00:02<00:02, 978MB/s] model-00003-of-00006.safetensors: 57%|█████▋ | 2.85G/4.97G [00:02<00:01, 1.37GB/s] model-00003-of-00006.safetensors: 66%|██████▌ | 3.26G/4.97G [00:02<00:00, 2.02GB/s] model-00003-of-00006.safetensors: 73%|███████▎ | 3.63G/4.97G [00:02<00:00, 2.40GB/s] model-00003-of-00006.safetensors: 78%|███████▊ | 3.90G/4.97G [00:03<00:00, 1.81GB/s] model-00003-of-00006.safetensors: 83%|████████▎ | 4.12G/4.97G [00:03<00:00, 1.53GB/s] model-00003-of-00006.safetensors: 87%|████████▋ | 4.31G/4.97G [00:03<00:00, 1.42GB/s] model-00003-of-00006.safetensors: 91%|█████████ | 4.51G/4.97G [00:03<00:00, 1.51GB/s] model-00003-of-00006.safetensors: 94%|█████████▍| 4.69G/4.97G [00:03<00:00, 1.53GB/s] model-00003-of-00006.safetensors: 98%|█████████▊| 4.88G/4.97G [00:03<00:00, 1.42GB/s] model-00003-of-00006.safetensors: 100%|█████████▉| 4.97G/4.97G [00:05<00:00, 924MB/s]
khanhnto-khanhnto-v49-mkmlizer: model-00004-of-00006.safetensors: 0%| | 0.00/4.93G [00:00<?, ?B/s] model-00004-of-00006.safetensors: 0%| | 10.5M/4.93G [00:00<02:39, 30.8MB/s] model-00004-of-00006.safetensors: 1%| | 31.5M/4.93G [00:00<00:58, 83.2MB/s] model-00004-of-00006.safetensors: 3%|▎ | 168M/4.93G [00:00<00:11, 427MB/s] model-00004-of-00006.safetensors: 5%|▌ | 262M/4.93G [00:00<00:08, 566MB/s] model-00004-of-00006.safetensors: 9%|▉ | 440M/4.93G [00:00<00:04, 900MB/s] model-00004-of-00006.safetensors: 11%|█▏ | 556M/4.93G [00:00<00:04, 928MB/s] model-00004-of-00006.safetensors: 14%|█▍ | 713M/4.93G [00:01<00:03, 1.08GB/s] model-00004-of-00006.safetensors: 19%|█▊ | 923M/4.93G [00:01<00:02, 1.35GB/s] model-00004-of-00006.safetensors: 23%|██▎ | 1.15G/4.93G [00:01<00:02, 1.58GB/s] model-00004-of-00006.safetensors: 27%|██▋ | 1.32G/4.93G [00:01<00:02, 1.56GB/s] model-00004-of-00006.safetensors: 32%|███▏ | 1.56G/4.93G [00:01<00:01, 1.74GB/s] model-00004-of-00006.safetensors: 37%|███▋ | 1.80G/4.93G [00:01<00:01, 1.90GB/s] model-00004-of-00006.safetensors: 41%|████ | 2.03G/4.93G [00:01<00:01, 2.00GB/s] model-00004-of-00006.safetensors: 45%|████▌ | 2.24G/4.93G [00:01<00:01, 1.67GB/s] model-00004-of-00006.safetensors: 50%|████▉ | 2.46G/4.93G [00:01<00:01, 1.78GB/s] model-00004-of-00006.safetensors: 54%|█████▍ | 2.65G/4.93G [00:02<00:01, 1.49GB/s] model-00004-of-00006.safetensors: 58%|█████▊ | 2.88G/4.93G [00:02<00:01, 1.66GB/s] model-00004-of-00006.safetensors: 64%|██████▎ | 3.14G/4.93G [00:02<00:00, 1.85GB/s] model-00004-of-00006.safetensors: 68%|██████▊ | 3.33G/4.93G [00:02<00:01, 1.50GB/s] model-00004-of-00006.safetensors: 73%|███████▎ | 3.60G/4.93G [00:02<00:00, 1.74GB/s] model-00004-of-00006.safetensors: 77%|███████▋ | 3.80G/4.93G [00:02<00:00, 1.73GB/s] model-00004-of-00006.safetensors: 81%|████████ | 3.98G/4.93G [00:02<00:00, 1.75GB/s] model-00004-of-00006.safetensors: 85%|████████▍ | 4.17G/4.93G [00:02<00:00, 1.62GB/s] model-00004-of-00006.safetensors: 88%|████████▊ | 4.35G/4.93G [00:03<00:00, 1.62GB/s] model-00004-of-00006.safetensors: 96%|█████████▌| 4.73G/4.93G [00:03<00:00, 2.17GB/s] model-00004-of-00006.safetensors: 100%|█████████▉| 4.93G/4.93G [00:05<00:00, 904MB/s]
khanhnto-khanhnto-v49-mkmlizer: model-00005-of-00006.safetensors: 0%| | 0.00/4.93G [00:00<?, ?B/s] model-00005-of-00006.safetensors: 0%| | 10.5M/4.93G [00:00<03:08, 26.1MB/s] model-00005-of-00006.safetensors: 1%|▏ | 62.9M/4.93G [00:00<00:31, 153MB/s] model-00005-of-00006.safetensors: 5%|▌ | 252M/4.93G [00:00<00:07, 609MB/s] model-00005-of-00006.safetensors: 7%|▋ | 357M/4.93G [00:00<00:06, 706MB/s] model-00005-of-00006.safetensors: 10%|▉ | 482M/4.93G [00:00<00:05, 834MB/s] model-00005-of-00006.safetensors: 13%|█▎ | 629M/4.93G [00:00<00:04, 942MB/s] model-00005-of-00006.safetensors: 16%|█▌ | 786M/4.93G [00:01<00:03, 1.11GB/s] model-00005-of-00006.safetensors: 18%|█▊ | 912M/4.93G [00:01<00:03, 1.01GB/s] model-00005-of-00006.safetensors: 22%|██▏ | 1.07G/4.93G [00:01<00:03, 1.14GB/s] model-00005-of-00006.safetensors: 24%|██▍ | 1.20G/4.93G [00:01<00:03, 996MB/s] model-00005-of-00006.safetensors: 27%|██▋ | 1.31G/4.93G [00:01<00:03, 998MB/s] model-00005-of-00006.safetensors: 32%|███▏ | 1.58G/4.93G [00:01<00:02, 1.43GB/s] model-00005-of-00006.safetensors: 35%|███▌ | 1.74G/4.93G [00:01<00:03, 1.00GB/s] model-00005-of-00006.safetensors: 38%|███▊ | 1.87G/4.93G [00:02<00:02, 1.03GB/s] model-00005-of-00006.safetensors: 44%|████▍ | 2.16G/4.93G [00:02<00:01, 1.45GB/s] model-00005-of-00006.safetensors: 53%|█████▎ | 2.62G/4.93G [00:02<00:01, 2.21GB/s] model-00005-of-00006.safetensors: 60%|██████ | 2.97G/4.93G [00:02<00:00, 2.52GB/s] model-00005-of-00006.safetensors: 66%|██████▌ | 3.26G/4.93G [00:02<00:00, 2.06GB/s] model-00005-of-00006.safetensors: 71%|███████ | 3.51G/4.93G [00:02<00:00, 1.84GB/s] model-00005-of-00006.safetensors: 76%|███████▌ | 3.73G/4.93G [00:02<00:00, 1.79GB/s] model-00005-of-00006.safetensors: 80%|███████▉ | 3.93G/4.93G [00:03<00:00, 1.59GB/s] model-00005-of-00006.safetensors: 83%|████████▎ | 4.11G/4.93G [00:03<00:00, 1.62GB/s] model-00005-of-00006.safetensors: 87%|████████▋ | 4.29G/4.93G [00:03<00:00, 1.53GB/s] model-00005-of-00006.safetensors: 95%|█████████▍| 4.67G/4.93G [00:03<00:00, 2.00GB/s] model-00005-of-00006.safetensors: 99%|█████████▉| 4.88G/4.93G [00:03<00:00, 1.36GB/s] model-00005-of-00006.safetensors: 100%|█████████▉| 4.93G/4.93G [00:04<00:00, 999MB/s]
khanhnto-khanhnto-v49-mkmlizer: model-00006-of-00006.safetensors: 0%| | 0.00/1.25G [00:00<?, ?B/s] model-00006-of-00006.safetensors: 1%| | 10.5M/1.25G [00:00<00:36, 33.5MB/s] model-00006-of-00006.safetensors: 3%|▎ | 31.5M/1.25G [00:00<00:14, 83.2MB/s] model-00006-of-00006.safetensors: 7%|▋ | 83.9M/1.25G [00:00<00:05, 206MB/s] model-00006-of-00006.safetensors: 18%|█▊ | 220M/1.25G [00:00<00:01, 517MB/s] model-00006-of-00006.safetensors: 24%|██▎ | 294M/1.25G [00:00<00:02, 456MB/s] model-00006-of-00006.safetensors: 36%|███▌ | 448M/1.25G [00:00<00:01, 714MB/s] model-00006-of-00006.safetensors: 47%|████▋ | 585M/1.25G [00:01<00:00, 873MB/s] model-00006-of-00006.safetensors: 55%|█████▌ | 689M/1.25G [00:01<00:01, 484MB/s] model-00006-of-00006.safetensors: 74%|███████▍ | 920M/1.25G [00:01<00:00, 787MB/s] model-00006-of-00006.safetensors: 98%|█████████▊| 1.22G/1.25G [00:01<00:00, 1.12GB/s] model-00006-of-00006.safetensors: 100%|█████████▉| 1.25G/1.25G [00:03<00:00, 382MB/s]
khanhnto-khanhnto-v49-mkmlizer: tokenizer.model: 0%| | 0.00/500k [00:00<?, ?B/s] tokenizer.model: 100%|██████████| 500k/500k [00:00<00:00, 5.50MB/s]
khanhnto-khanhnto-v49-mkmlizer: tokenizer_config.json: 0%| | 0.00/1.02k [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 1.02k/1.02k [00:00<00:00, 8.27MB/s]
khanhnto-khanhnto-v49-mkmlizer: Downloaded to shared memory in 35.631s
khanhnto-khanhnto-v49-mkmlizer: quantizing model to /dev/shm/model_cache
khanhnto-khanhnto-v49-mkmlizer: Saving mkml model at /dev/shm/model_cache
khanhnto-khanhnto-v49-mkmlizer: Reading /tmp/tmp2aohfc8a/model.safetensors.index.json
khanhnto-khanhnto-v49-mkmlizer: Profiling: 0%| | 0/363 [00:00<?, ?it/s] Profiling: 0%| | 1/363 [00:02<14:42, 2.44s/it] Profiling: 2%|▏ | 9/363 [00:02<01:13, 4.80it/s] Profiling: 5%|▍ | 17/363 [00:02<00:33, 10.25it/s] Profiling: 7%|▋ | 25/363 [00:02<00:20, 16.72it/s] Profiling: 9%|▉ | 32/363 [00:02<00:14, 22.94it/s] Profiling: 11%|█▏ | 41/363 [00:02<00:10, 32.07it/s] Profiling: 14%|█▍ | 50/363 [00:03<00:07, 40.92it/s] Profiling: 16%|█▋ | 59/363 [00:03<00:06, 48.94it/s] Profiling: 19%|█▉ | 69/363 [00:03<00:08, 35.99it/s] Profiling: 21%|██ | 75/363 [00:03<00:07, 38.46it/s] Profiling: 23%|██▎ | 83/363 [00:03<00:06, 45.37it/s] Profiling: 25%|██▍ | 90/363 [00:03<00:05, 50.16it/s] Profiling: 27%|██▋ | 99/363 [00:04<00:04, 57.37it/s] Profiling: 30%|██▉ | 108/363 [00:04<00:04, 63.18it/s] Profiling: 33%|███▎ | 118/363 [00:04<00:03, 71.16it/s] Profiling: 35%|███▍ | 126/363 [00:04<00:03, 71.53it/s] Profiling: 37%|███▋ | 134/363 [00:04<00:03, 72.80it/s] Profiling: 39%|███▉ | 142/363 [00:05<00:06, 34.38it/s] Profiling: 42%|████▏ | 151/363 [00:05<00:04, 42.54it/s] Profiling: 44%|████▎ | 158/363 [00:05<00:04, 47.35it/s] Profiling: 46%|████▌ | 167/363 [00:05<00:03, 54.70it/s] Profiling: 48%|████▊ | 176/363 [00:05<00:03, 61.00it/s] Profiling: 51%|█████ | 185/363 [00:05<00:02, 66.01it/s] Profiling: 53%|█████▎ | 194/363 [00:05<00:02, 69.57it/s] Profiling: 56%|█████▌ | 203/363 [00:05<00:02, 72.14it/s] Profiling: 58%|█████▊ | 211/363 [00:06<00:03, 43.32it/s] Profiling: 61%|██████ | 220/363 [00:06<00:02, 50.58it/s] Profiling: 63%|██████▎ | 229/363 [00:06<00:02, 57.22it/s] Profiling: 66%|██████▌ | 238/363 [00:06<00:01, 62.70it/s] Profiling: 68%|██████▊ | 247/363 [00:06<00:01, 67.20it/s] Profiling: 71%|███████ | 256/363 [00:06<00:01, 70.73it/s] Profiling: 73%|███████▎ | 265/363 [00:06<00:01, 73.09it/s] Profiling: 75%|███████▌ | 273/363 [00:06<00:01, 72.77it/s] Profiling: 77%|███████▋ | 281/363 [00:07<00:01, 44.38it/s] Profiling: 79%|███████▉ | 288/363 [00:07<00:01, 48.78it/s] Profiling: 82%|████████▏ | 296/363 [00:07<00:01, 54.78it/s] Profiling: 84%|████████▎ | 304/363 [00:07<00:00, 60.14it/s] Profiling: 86%|████████▌ | 312/363 [00:07<00:00, 64.53it/s] Profiling: 88%|████████▊ | 320/363 [00:07<00:00, 66.72it/s] Profiling: 91%|█████████ | 329/363 [00:07<00:00, 70.61it/s] Profiling: 93%|█████████▎| 338/363 [00:08<00:00, 73.02it/s] Profiling: 96%|█████████▌| 349/363 [00:10<00:01, 13.55it/s] Profiling: 98%|█████████▊| 357/363 [00:10<00:00, 17.23it/s] Profiling: 100%|██████████| 363/363 [00:10<00:00, 35.29it/s]
khanhnto-khanhnto-v49-mkmlizer: quantized model in 30.396s
khanhnto-khanhnto-v49-mkmlizer: Processed model khanhnto/khanhnto in 67.983s
khanhnto-khanhnto-v49-mkmlizer: creating bucket guanaco-mkml-models
khanhnto-khanhnto-v49-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
khanhnto-khanhnto-v49-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/khanhnto-khanhnto-v49
khanhnto-khanhnto-v49-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/khanhnto-khanhnto-v49/config.json
khanhnto-khanhnto-v49-mkmlizer: cp /dev/shm/model_cache/tokenizer.model s3://guanaco-mkml-models/khanhnto-khanhnto-v49/tokenizer.model
khanhnto-khanhnto-v49-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/khanhnto-khanhnto-v49/tokenizer_config.json
khanhnto-khanhnto-v49-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/khanhnto-khanhnto-v49/special_tokens_map.json
khanhnto-khanhnto-v49-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/khanhnto-khanhnto-v49/tokenizer.json
khanhnto-khanhnto-v49-mkmlizer: cp /dev/shm/model_cache/added_tokens.json s3://guanaco-mkml-models/khanhnto-khanhnto-v49/added_tokens.json
khanhnto-khanhnto-v49-mkmlizer: cp /dev/shm/model_cache/mkml_model.tensors s3://guanaco-mkml-models/khanhnto-khanhnto-v49/mkml_model.tensors
khanhnto-khanhnto-v49-mkmlizer: tokenizer_config.json: 0%| | 0.00/445 [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 445/445 [00:00<00:00, 3.09MB/s]
khanhnto-khanhnto-v49-mkmlizer: vocab.json: 0%| | 0.00/798k [00:00<?, ?B/s] vocab.json: 100%|██████████| 798k/798k [00:00<00:00, 7.89MB/s] vocab.json: 100%|██████████| 798k/798k [00:00<00:00, 7.86MB/s]
khanhnto-khanhnto-v49-mkmlizer: merges.txt: 0%| | 0.00/456k [00:00<?, ?B/s] merges.txt: 100%|██████████| 456k/456k [00:00<00:00, 38.2MB/s]
khanhnto-khanhnto-v49-mkmlizer: tokenizer.json: 0%| | 0.00/2.11M [00:00<?, ?B/s] tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 19.4MB/s] tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 19.3MB/s]
khanhnto-khanhnto-v49-mkmlizer: special_tokens_map.json: 0%| | 0.00/441 [00:00<?, ?B/s] special_tokens_map.json: 100%|██████████| 441/441 [00:00<00:00, 3.48MB/s]
khanhnto-khanhnto-v49-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:472: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
khanhnto-khanhnto-v49-mkmlizer: warnings.warn(
khanhnto-khanhnto-v49-mkmlizer: model.safetensors.index.json: 0%| | 0.00/10.5k [00:00<?, ?B/s] model.safetensors.index.json: 100%|██████████| 10.5k/10.5k [00:00<00:00, 87.1MB/s]
khanhnto-khanhnto-v49-mkmlizer: Downloading shards: 0%| | 0/1 [00:00<?, ?it/s]
khanhnto-khanhnto-v49-mkmlizer: model-00001-of-00001.safetensors: 0%| | 0.00/249M [00:00<?, ?B/s]
khanhnto-khanhnto-v49-mkmlizer: model-00001-of-00001.safetensors: 4%|▍ | 10.5M/249M [00:00<00:07, 31.7MB/s]
khanhnto-khanhnto-v49-mkmlizer: model-00001-of-00001.safetensors: 8%|▊ | 21.0M/249M [00:00<00:04, 53.2MB/s]
khanhnto-khanhnto-v49-mkmlizer: model-00001-of-00001.safetensors: 29%|██▉ | 73.4M/249M [00:00<00:00, 187MB/s] 
khanhnto-khanhnto-v49-mkmlizer: model-00001-of-00001.safetensors: 79%|███████▉ | 196M/249M [00:00<00:00, 478MB/s]  model-00001-of-00001.safetensors: 100%|█████████▉| 249M/249M [00:00<00:00, 277MB/s]
khanhnto-khanhnto-v49-mkmlizer: Downloading shards: 100%|██████████| 1/1 [00:01<00:00, 1.38s/it] Downloading shards: 100%|██████████| 1/1 [00:01<00:00, 1.38s/it]
khanhnto-khanhnto-v49-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
khanhnto-khanhnto-v49-mkmlizer: Saving duration: 0.085s
khanhnto-khanhnto-v49-mkmlizer: Processed model rirv938/gpt2_ties_merge_preference_plus_classic_e2_density_99 in 3.302s
khanhnto-khanhnto-v49-mkmlizer: creating bucket guanaco-reward-models
khanhnto-khanhnto-v49-mkmlizer: Bucket 's3://guanaco-reward-models/' created
khanhnto-khanhnto-v49-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/khanhnto-khanhnto-v49_reward
khanhnto-khanhnto-v49-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/khanhnto-khanhnto-v49_reward/config.json
khanhnto-khanhnto-v49-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/khanhnto-khanhnto-v49_reward/tokenizer_config.json
khanhnto-khanhnto-v49-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/khanhnto-khanhnto-v49_reward/special_tokens_map.json
khanhnto-khanhnto-v49-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/khanhnto-khanhnto-v49_reward/vocab.json
khanhnto-khanhnto-v49-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/khanhnto-khanhnto-v49_reward/merges.txt
khanhnto-khanhnto-v49-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/khanhnto-khanhnto-v49_reward/tokenizer.json
khanhnto-khanhnto-v49-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/khanhnto-khanhnto-v49_reward/reward.tensors
Job khanhnto-khanhnto-v49-mkmlizer completed after 95.97s with status: succeeded
Stopping job with name khanhnto-khanhnto-v49-mkmlizer
Pipeline stage MKMLizer completed in 101.45s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.12s
Running pipeline stage ISVCDeployer
Creating inference service khanhnto-khanhnto-v49
Waiting for inference service khanhnto-khanhnto-v49 to be ready
Inference service khanhnto-khanhnto-v49 ready after 50.328490257263184s
Pipeline stage ISVCDeployer completed in 58.59s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.5496671199798584s
Received healthy response to inference request in 1.2753117084503174s
Received healthy response to inference request in 1.099461317062378s
Received healthy response to inference request in 1.50203537940979s
Received healthy response to inference request in 1.5526762008666992s
5 requests
0 failed requests
5th percentile: 1.1346313953399658
10th percentile: 1.1698014736175537
20th percentile: 1.2401416301727295
30th percentile: 1.3206564426422118
40th percentile: 1.411345911026001
50th percentile: 1.50203537940979
60th percentile: 1.5222917079925538
70th percentile: 1.5425480365753175
80th percentile: 1.7520743846893312
90th percentile: 2.150870752334595
95th percentile: 2.350268936157226
99th percentile: 2.509787483215332
mean time: 1.5958303451538085
Pipeline stage StressChecker completed in 8.95s
Running pipeline stage DaemonicModelEvalScorer
Pipeline stage DaemonicModelEvalScorer completed in 0.04s
Running pipeline stage DaemonicSafetyScorer
Running M-Eval for topic stay_in_character
Pipeline stage DaemonicSafetyScorer completed in 0.06s
M-Eval Dataset for topic stay_in_character is loaded
AUTO_DEACTIVATION: submission %s deactivated %s
khanhnto-khanhnto_v49 status is now inactive due to auto deactivation removed underperforming models
AUTO_DEACTIVATION: submission %s deactivated %s
khanhnto-khanhnto_v49 status is now deployed due to admin request
khanhnto-khanhnto_v49 status is now inactive due to auto deactivation removed underperforming models

Usage Metrics

Latency Metrics