submission_id: khanhnto-khanhnto_v61
developer_uid: chai_backend_admin
status: inactive
model_repo: khanhnto/khanhnto
reward_repo: rirv938/gpt2_ties_merge_preference_plus_classic_e2_density_99
generation_params: {'temperature': 1.2, 'top_p': 0.7, 'top_k': 50, 'presence_penalty': 0.8, 'frequency_penalty': 0.2, 'stopping_words': ['\n', '<\\s>', '###'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 128}
formatter: {'memory_template': "### Instruction:\n\n{bot_name}'s Persona: {memory}.\n\nPlay the role of {bot_name}. Engage in a chat with {user_name} while stay in character. Do not write dialogues and narration for {user_name}. {bot_name} should response with messages of medium length.", 'prompt_template': '{prompt}\n\n', 'bot_template': '### Response:\n\n{bot_name}: {message}\n\n', 'user_template': '### Input:\n\n{user_name}: {message}\n\n', 'response_template': '### Response:\n\n{bot_name}:'}
reward_formatter: {'memory_template': 'Memory: {memory}\n', 'prompt_template': '{prompt}\n', 'bot_template': 'Bot: {message}\n', 'user_template': 'User: {message}\n', 'response_template': 'Bot:'}
timestamp: 2024-03-31T01:30:18+00:00
model_name: khanhnto-128
model_eval_status: success
safety_score: 0.99
entertaining: 6.66
stay_in_character: 8.58
user_preference: 7.18
double_thumbs_up: 1
thumbs_up: 0
thumbs_down: 0
num_battles: 127
num_wins: 62
win_ratio: 0.4881889763779528
celo_rating: None
Resubmit model
Running pipeline stage MKMLizer
Starting job with name khanhnto-khanhnto-v61-mkmlizer
Waiting for job on khanhnto-khanhnto-v61-mkmlizer to finish
khanhnto-khanhnto-v61-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
khanhnto-khanhnto-v61-mkmlizer: ║ _____ __ __ ║
khanhnto-khanhnto-v61-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
khanhnto-khanhnto-v61-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
khanhnto-khanhnto-v61-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
khanhnto-khanhnto-v61-mkmlizer: ║ /___/ ║
khanhnto-khanhnto-v61-mkmlizer: ║ ║
khanhnto-khanhnto-v61-mkmlizer: ║ Version: 0.6.11 ║
khanhnto-khanhnto-v61-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
khanhnto-khanhnto-v61-mkmlizer: ║ ║
khanhnto-khanhnto-v61-mkmlizer: ║ The license key for the current software has been verified as ║
khanhnto-khanhnto-v61-mkmlizer: ║ belonging to: ║
khanhnto-khanhnto-v61-mkmlizer: ║ ║
khanhnto-khanhnto-v61-mkmlizer: ║ Chai Research Corp. ║
khanhnto-khanhnto-v61-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
khanhnto-khanhnto-v61-mkmlizer: ║ Expiration: 2024-07-15 23:59:59 ║
khanhnto-khanhnto-v61-mkmlizer: ║ ║
khanhnto-khanhnto-v61-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
khanhnto-khanhnto-v61-mkmlizer: .gitattributes: 0%| | 0.00/1.52k [00:00<?, ?B/s] .gitattributes: 100%|██████████| 1.52k/1.52k [00:00<00:00, 18.9MB/s]
khanhnto-khanhnto-v61-mkmlizer: added_tokens.json: 0%| | 0.00/21.0 [00:00<?, ?B/s] added_tokens.json: 100%|██████████| 21.0/21.0 [00:00<00:00, 173kB/s]
khanhnto-khanhnto-v61-mkmlizer: config.json: 0%| | 0.00/702 [00:00<?, ?B/s] config.json: 100%|██████████| 702/702 [00:00<00:00, 5.56MB/s]
khanhnto-khanhnto-v61-mkmlizer: generation_config.json: 0%| | 0.00/137 [00:00<?, ?B/s] generation_config.json: 100%|██████████| 137/137 [00:00<00:00, 1.12MB/s]
khanhnto-khanhnto-v61-mkmlizer: model-00001-of-00006.safetensors: 0%| | 0.00/4.98G [00:00<?, ?B/s] model-00001-of-00006.safetensors: 0%| | 10.5M/4.98G [00:00<01:37, 50.8MB/s] model-00001-of-00006.safetensors: 2%|▏ | 94.4M/4.98G [00:00<00:16, 290MB/s] model-00001-of-00006.safetensors: 3%|▎ | 126M/4.98G [00:00<00:17, 270MB/s] model-00001-of-00006.safetensors: 3%|▎ | 157M/4.98G [00:00<00:18, 266MB/s] model-00001-of-00006.safetensors: 4%|▍ | 199M/4.98G [00:00<00:19, 246MB/s] model-00001-of-00006.safetensors: 6%|▋ | 315M/4.98G [00:00<00:10, 435MB/s] model-00001-of-00006.safetensors: 7%|▋ | 367M/4.98G [00:01<00:11, 389MB/s] model-00001-of-00006.safetensors: 8%|▊ | 409M/4.98G [00:01<00:12, 377MB/s] model-00001-of-00006.safetensors: 11%|█ | 524M/4.98G [00:01<00:07, 557MB/s] model-00001-of-00006.safetensors: 13%|█▎ | 661M/4.98G [00:01<00:05, 755MB/s] model-00001-of-00006.safetensors: 15%|█▍ | 744M/4.98G [00:01<00:08, 495MB/s] model-00001-of-00006.safetensors: 17%|█▋ | 849M/4.98G [00:01<00:07, 590MB/s] model-00001-of-00006.safetensors: 20%|██ | 1.02G/4.98G [00:02<00:05, 776MB/s] model-00001-of-00006.safetensors: 23%|██▎ | 1.15G/4.98G [00:02<00:04, 900MB/s] model-00001-of-00006.safetensors: 25%|██▌ | 1.26G/4.98G [00:02<00:04, 838MB/s] model-00001-of-00006.safetensors: 27%|██▋ | 1.35G/4.98G [00:02<00:04, 847MB/s] model-00001-of-00006.safetensors: 30%|███ | 1.50G/4.98G [00:02<00:03, 934MB/s] model-00001-of-00006.safetensors: 32%|███▏ | 1.60G/4.98G [00:02<00:03, 881MB/s] model-00001-of-00006.safetensors: 34%|███▍ | 1.70G/4.98G [00:02<00:04, 744MB/s] model-00001-of-00006.safetensors: 38%|███▊ | 1.88G/4.98G [00:02<00:03, 928MB/s] model-00001-of-00006.safetensors: 40%|███▉ | 1.98G/4.98G [00:03<00:03, 850MB/s] model-00001-of-00006.safetensors: 42%|████▏ | 2.08G/4.98G [00:03<00:03, 779MB/s] model-00001-of-00006.safetensors: 45%|████▍ | 2.22G/4.98G [00:03<00:03, 915MB/s] model-00001-of-00006.safetensors: 47%|████▋ | 2.35G/4.98G [00:03<00:02, 956MB/s] model-00001-of-00006.safetensors: 49%|████▉ | 2.45G/4.98G [00:03<00:02, 867MB/s] model-00001-of-00006.safetensors: 51%|█████ | 2.55G/4.98G [00:03<00:02, 847MB/s] model-00001-of-00006.safetensors: 53%|█████▎ | 2.64G/4.98G [00:03<00:02, 856MB/s] model-00001-of-00006.safetensors: 55%|█████▍ | 2.74G/4.98G [00:03<00:02, 811MB/s] model-00001-of-00006.safetensors: 59%|█████▉ | 2.93G/4.98G [00:04<00:01, 1.07GB/s] model-00001-of-00006.safetensors: 61%|██████ | 3.04G/4.98G [00:04<00:01, 1.04GB/s] model-00001-of-00006.safetensors: 64%|██████▍ | 3.21G/4.98G [00:04<00:01, 1.21GB/s] model-00001-of-00006.safetensors: 68%|██████▊ | 3.38G/4.98G [00:04<00:01, 1.33GB/s] model-00001-of-00006.safetensors: 72%|███████▏ | 3.60G/4.98G [00:04<00:00, 1.57GB/s] model-00001-of-00006.safetensors: 78%|███████▊ | 3.86G/4.98G [00:04<00:00, 1.79GB/s] model-00001-of-00006.safetensors: 82%|████████▏ | 4.10G/4.98G [00:04<00:00, 1.96GB/s] model-00001-of-00006.safetensors: 89%|████████▊ | 4.41G/4.98G [00:04<00:00, 2.29GB/s] model-00001-of-00006.safetensors: 94%|█████████▍| 4.67G/4.98G [00:04<00:00, 2.36GB/s] model-00001-of-00006.safetensors: 99%|█████████▊| 4.92G/4.98G [00:05<00:00, 1.90GB/s] model-00001-of-00006.safetensors: 100%|█████████▉| 4.98G/4.98G [00:05<00:00, 959MB/s]
khanhnto-khanhnto-v61-mkmlizer: model-00002-of-00006.safetensors: 0%| | 0.00/4.97G [00:00<?, ?B/s] model-00002-of-00006.safetensors: 0%| | 10.5M/4.97G [00:00<02:04, 39.7MB/s] model-00002-of-00006.safetensors: 2%|▏ | 94.4M/4.97G [00:00<00:16, 294MB/s] model-00002-of-00006.safetensors: 4%|▎ | 178M/4.97G [00:00<00:12, 395MB/s] model-00002-of-00006.safetensors: 5%|▍ | 241M/4.97G [00:00<00:10, 438MB/s] model-00002-of-00006.safetensors: 8%|▊ | 398M/4.97G [00:00<00:06, 737MB/s] model-00002-of-00006.safetensors: 11%|█ | 524M/4.97G [00:00<00:05, 848MB/s] model-00002-of-00006.safetensors: 14%|█▍ | 713M/4.97G [00:00<00:03, 1.11GB/s] model-00002-of-00006.safetensors: 17%|█▋ | 839M/4.97G [00:01<00:03, 1.10GB/s] model-00002-of-00006.safetensors: 20%|█▉ | 975M/4.97G [00:01<00:03, 1.13GB/s] model-00002-of-00006.safetensors: 23%|██▎ | 1.13G/4.97G [00:01<00:03, 1.22GB/s] model-00002-of-00006.safetensors: 25%|██▌ | 1.26G/4.97G [00:01<00:03, 1.09GB/s] model-00002-of-00006.safetensors: 28%|██▊ | 1.41G/4.97G [00:01<00:03, 1.18GB/s] model-00002-of-00006.safetensors: 32%|███▏ | 1.60G/4.97G [00:01<00:02, 1.39GB/s] model-00002-of-00006.safetensors: 35%|███▌ | 1.75G/4.97G [00:01<00:02, 1.40GB/s] model-00002-of-00006.safetensors: 38%|███▊ | 1.90G/4.97G [00:01<00:02, 1.32GB/s] model-00002-of-00006.safetensors: 42%|████▏ | 2.09G/4.97G [00:02<00:02, 1.38GB/s] model-00002-of-00006.safetensors: 45%|████▌ | 2.24G/4.97G [00:02<00:01, 1.42GB/s] model-00002-of-00006.safetensors: 48%|████▊ | 2.39G/4.97G [00:02<00:01, 1.34GB/s] model-00002-of-00006.safetensors: 53%|█████▎ | 2.61G/4.97G [00:02<00:01, 1.56GB/s] model-00002-of-00006.safetensors: 57%|█████▋ | 2.82G/4.97G [00:02<00:01, 1.71GB/s] model-00002-of-00006.safetensors: 61%|██████ | 3.01G/4.97G [00:02<00:01, 1.66GB/s] model-00002-of-00006.safetensors: 65%|██████▍ | 3.21G/4.97G [00:02<00:01, 1.65GB/s] model-00002-of-00006.safetensors: 68%|██████▊ | 3.38G/4.97G [00:02<00:00, 1.61GB/s] model-00002-of-00006.safetensors: 74%|███████▍ | 3.69G/4.97G [00:02<00:00, 2.01GB/s] model-00002-of-00006.safetensors: 80%|███████▉ | 3.97G/4.97G [00:03<00:00, 2.22GB/s] model-00002-of-00006.safetensors: 85%|████████▍ | 4.20G/4.97G [00:03<00:00, 2.04GB/s] model-00002-of-00006.safetensors: 89%|████████▉ | 4.44G/4.97G [00:03<00:00, 2.10GB/s] model-00002-of-00006.safetensors: 94%|█████████▎| 4.66G/4.97G [00:03<00:00, 1.86GB/s] model-00002-of-00006.safetensors: 98%|█████████▊| 4.86G/4.97G [00:03<00:00, 1.02GB/s] model-00002-of-00006.safetensors: 100%|█████████▉| 4.97G/4.97G [00:03<00:00, 1.27GB/s]
khanhnto-khanhnto-v60-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
khanhnto-khanhnto-v60-mkmlizer: ║ _____ __ __ ║
khanhnto-khanhnto-v60-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
khanhnto-khanhnto-v60-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
khanhnto-khanhnto-v60-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
khanhnto-khanhnto-v60-mkmlizer: ║ /___/ ║
khanhnto-khanhnto-v60-mkmlizer: ║ ║
khanhnto-khanhnto-v60-mkmlizer: ║ Version: 0.6.11 ║
khanhnto-khanhnto-v60-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
khanhnto-khanhnto-v60-mkmlizer: ║ ║
khanhnto-khanhnto-v60-mkmlizer: ║ The license key for the current software has been verified as ║
khanhnto-khanhnto-v60-mkmlizer: ║ belonging to: ║
khanhnto-khanhnto-v60-mkmlizer: ║ ║
khanhnto-khanhnto-v60-mkmlizer: ║ Chai Research Corp. ║
khanhnto-khanhnto-v60-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
khanhnto-khanhnto-v60-mkmlizer: ║ Expiration: 2024-07-15 23:59:59 ║
khanhnto-khanhnto-v60-mkmlizer: ║ ║
khanhnto-khanhnto-v60-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
khanhnto-khanhnto-v60-mkmlizer: .gitattributes: 0%| | 0.00/1.52k [00:00<?, ?B/s] .gitattributes: 100%|██████████| 1.52k/1.52k [00:00<00:00, 17.9MB/s]
khanhnto-khanhnto-v60-mkmlizer: added_tokens.json: 0%| | 0.00/21.0 [00:00<?, ?B/s] added_tokens.json: 100%|██████████| 21.0/21.0 [00:00<00:00, 178kB/s]
khanhnto-khanhnto-v60-mkmlizer: config.json: 0%| | 0.00/702 [00:00<?, ?B/s] config.json: 100%|██████████| 702/702 [00:00<00:00, 3.74MB/s]
khanhnto-khanhnto-v60-mkmlizer: generation_config.json: 0%| | 0.00/137 [00:00<?, ?B/s] generation_config.json: 100%|██████████| 137/137 [00:00<00:00, 1.73MB/s]
khanhnto-khanhnto-v61-mkmlizer: model-00003-of-00006.safetensors: 0%| | 0.00/4.97G [00:00<?, ?B/s] model-00003-of-00006.safetensors: 0%| | 10.5M/4.97G [00:00<02:06, 39.3MB/s] model-00003-of-00006.safetensors: 2%|▏ | 94.4M/4.97G [00:00<00:16, 303MB/s] model-00003-of-00006.safetensors: 3%|▎ | 147M/4.97G [00:00<00:14, 324MB/s] model-00003-of-00006.safetensors: 5%|▍ | 241M/4.97G [00:00<00:09, 490MB/s] model-00003-of-00006.safetensors: 6%|▌ | 304M/4.97G [00:00<00:12, 384MB/s] model-00003-of-00006.safetensors: 8%|▊ | 419M/4.97G [00:00<00:08, 542MB/s] model-00003-of-00006.safetensors: 11%|█ | 524M/4.97G [00:01<00:07, 634MB/s] model-00003-of-00006.safetensors: 12%|█▏ | 598M/4.97G [00:01<00:06, 634MB/s] model-00003-of-00006.safetensors: 14%|█▍ | 692M/4.97G [00:01<00:06, 688MB/s] model-00003-of-00006.safetensors: 16%|█▌ | 786M/4.97G [00:01<00:05, 720MB/s] model-00003-of-00006.safetensors: 18%|█▊ | 870M/4.97G [00:01<00:06, 609MB/s] model-00003-of-00006.safetensors: 20%|██ | 1.01G/4.97G [00:01<00:05, 783MB/s] model-00003-of-00006.safetensors: 22%|██▏ | 1.10G/4.97G [00:01<00:05, 762MB/s] model-00003-of-00006.safetensors: 24%|██▍ | 1.22G/4.97G [00:01<00:04, 816MB/s] model-00003-of-00006.safetensors: 26%|██▋ | 1.31G/4.97G [00:02<00:05, 697MB/s] model-00003-of-00006.safetensors: 31%|███ | 1.54G/4.97G [00:02<00:03, 1.05GB/s] model-00003-of-00006.safetensors: 34%|███▎ | 1.67G/4.97G [00:02<00:03, 952MB/s] model-00003-of-00006.safetensors: 36%|███▌ | 1.79G/4.97G [00:02<00:03, 1.02GB/s] model-00003-of-00006.safetensors: 39%|███▉ | 1.94G/4.97G [00:02<00:02, 1.11GB/s] model-00003-of-00006.safetensors: 42%|████▏ | 2.07G/4.97G [00:02<00:02, 1.13GB/s] model-00003-of-00006.safetensors: 47%|████▋ | 2.34G/4.97G [00:02<00:01, 1.55GB/s] model-00003-of-00006.safetensors: 51%|█████ | 2.53G/4.97G [00:02<00:01, 1.64GB/s] model-00003-of-00006.safetensors: 55%|█████▍ | 2.73G/4.97G [00:03<00:01, 1.74GB/s] model-00003-of-00006.safetensors: 61%|██████ | 3.02G/4.97G [00:03<00:01, 1.80GB/s] model-00003-of-00006.safetensors: 65%|██████▍ | 3.21G/4.97G [00:03<00:01, 1.31GB/s] model-00003-of-00006.safetensors: 68%|██████▊ | 3.37G/4.97G [00:03<00:01, 1.22GB/s] model-00003-of-00006.safetensors: 70%|███████ | 3.50G/4.97G [00:03<00:01, 1.05GB/s] model-00003-of-00006.safetensors: 73%|███████▎ | 3.63G/4.97G [00:04<00:01, 908MB/s] model-00003-of-00006.safetensors: 75%|███████▌ | 3.73G/4.97G [00:04<00:01, 913MB/s] model-00003-of-00006.safetensors: 77%|███████▋ | 3.84G/4.97G [00:04<00:01, 860MB/s] model-00003-of-00006.safetensors: 79%|███████▉ | 3.93G/4.97G [00:04<00:01, 794MB/s] model-00003-of-00006.safetensors: 81%|████████ | 4.02G/4.97G [00:04<00:01, 782MB/s] model-00003-of-00006.safetensors: 84%|████████▍ | 4.19G/4.97G [00:04<00:00, 1.01GB/s] model-00003-of-00006.safetensors: 87%|████████▋ | 4.31G/4.97G [00:04<00:00, 1.03GB/s] model-00003-of-00006.safetensors: 90%|█████████ | 4.48G/4.97G [00:04<00:00, 1.19GB/s] model-00003-of-00006.safetensors: 96%|█████████▌| 4.78G/4.97G [00:04<00:00, 1.67GB/s] model-00003-of-00006.safetensors: 100%|█████████▉| 4.96G/4.97G [00:06<00:00, 333MB/s] model-00003-of-00006.safetensors: 100%|█████████▉| 4.97G/4.97G [00:07<00:00, 626MB/s]
khanhnto-khanhnto-v61-mkmlizer: model-00004-of-00006.safetensors: 0%| | 0.00/4.93G [00:00<?, ?B/s] model-00004-of-00006.safetensors: 0%| | 10.5M/4.93G [00:00<01:25, 57.3MB/s] model-00004-of-00006.safetensors: 0%| | 21.0M/4.93G [00:00<01:21, 60.3MB/s] model-00004-of-00006.safetensors: 3%|▎ | 126M/4.93G [00:00<00:12, 373MB/s] model-00004-of-00006.safetensors: 5%|▍ | 231M/4.93G [00:00<00:08, 571MB/s] model-00004-of-00006.safetensors: 6%|▌ | 304M/4.93G [00:00<00:08, 554MB/s] model-00004-of-00006.safetensors: 7%|▋ | 367M/4.93G [00:00<00:09, 506MB/s] model-00004-of-00006.safetensors: 9%|▊ | 430M/4.93G [00:01<00:09, 464MB/s] model-00004-of-00006.safetensors: 10%|▉ | 482M/4.93G [00:01<00:09, 473MB/s] model-00004-of-00006.safetensors: 14%|█▎ | 671M/4.93G [00:01<00:05, 819MB/s] model-00004-of-00006.safetensors: 16%|█▌ | 765M/4.93G [00:01<00:05, 800MB/s] model-00004-of-00006.safetensors: 17%|█▋ | 860M/4.93G [00:01<00:06, 648MB/s] model-00004-of-00006.safetensors: 20%|█▉ | 986M/4.93G [00:01<00:05, 768MB/s] model-00004-of-00006.safetensors: 22%|██▏ | 1.08G/4.93G [00:01<00:05, 720MB/s] model-00004-of-00006.safetensors: 24%|██▎ | 1.16G/4.93G [00:01<00:05, 673MB/s] model-00004-of-00006.safetensors: 26%|██▌ | 1.26G/4.93G [00:02<00:05, 678MB/s] model-00004-of-00006.safetensors: 27%|██▋ | 1.33G/4.93G [00:02<00:06, 600MB/s] model-00004-of-00006.safetensors: 30%|██▉ | 1.48G/4.93G [00:02<00:04, 790MB/s] model-00004-of-00006.safetensors: 32%|███▏ | 1.59G/4.93G [00:02<00:04, 831MB/s] model-00004-of-00006.safetensors: 34%|███▍ | 1.70G/4.93G [00:02<00:04, 694MB/s] model-00004-of-00006.safetensors: 37%|███▋ | 1.81G/4.93G [00:02<00:03, 788MB/s] model-00004-of-00006.safetensors: 39%|███▉ | 1.94G/4.93G [00:02<00:03, 888MB/s] model-00004-of-00006.safetensors: 44%|████▍ | 2.18G/4.93G [00:03<00:02, 1.26GB/s] model-00004-of-00006.safetensors: 47%|████▋ | 2.33G/4.93G [00:03<00:02, 978MB/s] model-00004-of-00006.safetensors: 51%|█████ | 2.53G/4.93G [00:03<00:02, 1.16GB/s] model-00004-of-00006.safetensors: 54%|█████▍ | 2.66G/4.93G [00:03<00:02, 913MB/s] model-00004-of-00006.safetensors: 57%|█████▋ | 2.82G/4.93G [00:03<00:02, 1.04GB/s] model-00004-of-00006.safetensors: 60%|█████▉ | 2.95G/4.93G [00:03<00:01, 1.06GB/s] model-00004-of-00006.safetensors: 62%|██████▏ | 3.07G/4.93G [00:03<00:01, 1.02GB/s] model-00004-of-00006.safetensors: 65%|██████▍ | 3.19G/4.93G [00:04<00:01, 916MB/s] model-00004-of-00006.safetensors: 67%|██████▋ | 3.32G/4.93G [00:04<00:01, 1.01GB/s] model-00004-of-00006.safetensors: 70%|███████ | 3.46G/4.93G [00:04<00:01, 1.03GB/s] model-00004-of-00006.safetensors: 73%|███████▎ | 3.59G/4.93G [00:04<00:01, 1.07GB/s] model-00004-of-00006.safetensors: 76%|███████▌ | 3.75G/4.93G [00:04<00:00, 1.22GB/s] model-00004-of-00006.safetensors: 80%|███████▉ | 3.93G/4.93G [00:04<00:00, 1.36GB/s] model-00004-of-00006.safetensors: 83%|████████▎ | 4.08G/4.93G [00:04<00:00, 1.35GB/s] model-00004-of-00006.safetensors: 86%|████████▋ | 4.26G/4.93G [00:04<00:00, 1.48GB/s] model-00004-of-00006.safetensors: 90%|████████▉ | 4.42G/4.93G [00:04<00:00, 1.48GB/s] model-00004-of-00006.safetensors: 93%|█████████▎| 4.58G/4.93G [00:05<00:00, 1.47GB/s] model-00004-of-00006.safetensors: 97%|█████████▋| 4.77G/4.93G [00:05<00:00, 1.51GB/s] model-00004-of-00006.safetensors: 100%|█████████▉| 4.92G/4.93G [00:05<00:00, 1.34GB/s] model-00004-of-00006.safetensors: 100%|█████████▉| 4.93G/4.93G [00:05<00:00, 913MB/s]
khanhnto-khanhnto-v60-mkmlizer: model-00002-of-00006.safetensors: 0%| | 0.00/4.97G [00:00<?, ?B/s] model-00002-of-00006.safetensors: 0%| | 10.5M/4.97G [00:00<01:07, 73.4MB/s] model-00002-of-00006.safetensors: 2%|▏ | 94.4M/4.97G [00:00<00:11, 417MB/s] model-00002-of-00006.safetensors: 4%|▍ | 220M/4.97G [00:00<00:06, 744MB/s] model-00002-of-00006.safetensors: 9%|▊ | 430M/4.97G [00:00<00:03, 1.20GB/s] model-00002-of-00006.safetensors: 11%|█ | 556M/4.97G [00:00<00:03, 1.12GB/s] model-00002-of-00006.safetensors: 14%|█▍ | 703M/4.97G [00:00<00:03, 1.15GB/s] model-00002-of-00006.safetensors: 17%|█▋ | 860M/4.97G [00:00<00:03, 1.28GB/s] model-00002-of-00006.safetensors: 21%|██ | 1.05G/4.97G [00:00<00:02, 1.37GB/s] model-00002-of-00006.safetensors: 26%|██▌ | 1.28G/4.97G [00:01<00:02, 1.63GB/s] model-00002-of-00006.safetensors: 29%|██▉ | 1.45G/4.97G [00:01<00:02, 1.52GB/s] model-00002-of-00006.safetensors: 34%|███▎ | 1.67G/4.97G [00:01<00:01, 1.71GB/s] model-00002-of-00006.safetensors: 38%|███▊ | 1.89G/4.97G [00:01<00:01, 1.81GB/s] model-00002-of-00006.safetensors: 42%|████▏ | 2.08G/4.97G [00:01<00:01, 1.82GB/s] model-00002-of-00006.safetensors: 46%|████▌ | 2.26G/4.97G [00:01<00:01, 1.79GB/s] model-00002-of-00006.safetensors: 50%|█████ | 2.51G/4.97G [00:01<00:01, 1.96GB/s] model-00002-of-00006.safetensors: 55%|█████▌ | 2.74G/4.97G [00:01<00:01, 2.05GB/s] model-00002-of-00006.safetensors: 61%|██████ | 3.03G/4.97G [00:01<00:00, 2.30GB/s] model-00002-of-00006.safetensors: 66%|██████▌ | 3.27G/4.97G [00:02<00:00, 2.20GB/s] model-00002-of-00006.safetensors: 70%|███████ | 3.50G/4.97G [00:02<00:00, 2.18GB/s] model-00002-of-00006.safetensors: 76%|███████▌ | 3.75G/4.97G [00:02<00:00, 2.26GB/s] model-00002-of-00006.safetensors: 80%|████████ | 3.98G/4.97G [00:02<00:00, 2.17GB/s] model-00002-of-00006.safetensors: 85%|████████▍ | 4.20G/4.97G [00:02<00:00, 1.93GB/s] model-00002-of-00006.safetensors: 89%|████████▊ | 4.40G/4.97G [00:02<00:00, 1.19GB/s] model-00002-of-00006.safetensors: 92%|█████████▏| 4.56G/4.97G [00:03<00:00, 784MB/s] model-00002-of-00006.safetensors: 94%|█████████▍| 4.69G/4.97G [00:03<00:00, 768MB/s] model-00002-of-00006.safetensors: 98%|█████████▊| 4.86G/4.97G [00:03<00:00, 908MB/s] model-00002-of-00006.safetensors: 100%|█████████▉| 4.97G/4.97G [00:03<00:00, 1.38GB/s]
khanhnto-khanhnto-v60-mkmlizer: model-00003-of-00006.safetensors: 0%| | 0.00/4.97G [00:00<?, ?B/s] model-00003-of-00006.safetensors: 0%| | 10.5M/4.97G [00:00<01:29, 55.1MB/s] model-00003-of-00006.safetensors: 0%| | 21.0M/4.97G [00:00<01:08, 72.4MB/s] model-00003-of-00006.safetensors: 2%|▏ | 105M/4.97G [00:00<00:14, 335MB/s] model-00003-of-00006.safetensors: 4%|▍ | 189M/4.97G [00:00<00:09, 499MB/s] model-00003-of-00006.safetensors: 5%|▌ | 262M/4.97G [00:00<00:09, 502MB/s] model-00003-of-00006.safetensors: 7%|▋ | 367M/4.97G [00:00<00:07, 635MB/s] model-00003-of-00006.safetensors: 9%|▉ | 461M/4.97G [00:00<00:06, 683MB/s] model-00003-of-00006.safetensors: 11%|█▏ | 566M/4.97G [00:01<00:05, 776MB/s] model-00003-of-00006.safetensors: 13%|█▎ | 650M/4.97G [00:01<00:06, 678MB/s] model-00003-of-00006.safetensors: 16%|█▌ | 807M/4.97G [00:01<00:04, 904MB/s] model-00003-of-00006.safetensors: 20%|██ | 1.01G/4.97G [00:01<00:03, 1.16GB/s] model-00003-of-00006.safetensors: 25%|██▍ | 1.24G/4.97G [00:01<00:02, 1.46GB/s] model-00003-of-00006.safetensors: 28%|██▊ | 1.42G/4.97G [00:01<00:02, 1.51GB/s] model-00003-of-00006.safetensors: 32%|███▏ | 1.57G/4.97G [00:01<00:02, 1.51GB/s] model-00003-of-00006.safetensors: 35%|███▍ | 1.73G/4.97G [00:01<00:02, 1.42GB/s] model-00003-of-00006.safetensors: 38%|███▊ | 1.88G/4.97G [00:01<00:02, 1.41GB/s] model-00003-of-00006.safetensors: 43%|████▎ | 2.13G/4.97G [00:02<00:01, 1.69GB/s] model-00003-of-00006.safetensors: 47%|████▋ | 2.34G/4.97G [00:02<00:01, 1.80GB/s] model-00003-of-00006.safetensors: 51%|█████ | 2.54G/4.97G [00:02<00:01, 1.85GB/s] model-00003-of-00006.safetensors: 55%|█████▌ | 2.76G/4.97G [00:02<00:01, 1.94GB/s] model-00003-of-00006.safetensors: 60%|█████▉ | 2.98G/4.97G [00:02<00:00, 1.99GB/s] model-00003-of-00006.safetensors: 65%|██████▌ | 3.25G/4.97G [00:02<00:00, 2.12GB/s] model-00003-of-00006.safetensors: 70%|██████▉ | 3.47G/4.97G [00:02<00:00, 1.80GB/s] model-00003-of-00006.safetensors: 74%|███████▍ | 3.67G/4.97G [00:02<00:00, 1.81GB/s] model-00003-of-00006.safetensors: 78%|███████▊ | 3.86G/4.97G [00:02<00:00, 1.67GB/s] model-00003-of-00006.safetensors: 81%|████████ | 4.04G/4.97G [00:03<00:00, 1.63GB/s] model-00003-of-00006.safetensors: 85%|████████▍ | 4.20G/4.97G [00:03<00:00, 1.57GB/s] model-00003-of-00006.safetensors: 88%|████████▊ | 4.37G/4.97G [00:04<00:01, 537MB/s] model-00003-of-00006.safetensors: 91%|█████████ | 4.50G/4.97G [00:04<00:00, 499MB/s] model-00003-of-00006.safetensors: 93%|█████████▎| 4.65G/4.97G [00:04<00:00, 603MB/s] model-00003-of-00006.safetensors: 97%|█████████▋| 4.83G/4.97G [00:04<00:00, 780MB/s] model-00003-of-00006.safetensors: 100%|█████████▉| 4.97G/4.97G [00:04<00:00, 1.06GB/s]
khanhnto-khanhnto-v61-mkmlizer: model-00005-of-00006.safetensors: 0%| | 0.00/4.93G [00:00<?, ?B/s] model-00005-of-00006.safetensors: 0%| | 10.5M/4.93G [00:00<02:34, 31.8MB/s] model-00005-of-00006.safetensors: 3%|▎ | 147M/4.93G [00:00<00:11, 427MB/s] model-00005-of-00006.safetensors: 5%|▌ | 252M/4.93G [00:00<00:07, 612MB/s] model-00005-of-00006.safetensors: 8%|▊ | 409M/4.93G [00:00<00:05, 895MB/s] model-00005-of-00006.safetensors: 11%|█▏ | 556M/4.93G [00:00<00:04, 1.06GB/s] model-00005-of-00006.safetensors: 14%|█▍ | 692M/4.93G [00:00<00:03, 1.12GB/s] model-00005-of-00006.safetensors: 17%|█▋ | 839M/4.93G [00:00<00:03, 1.22GB/s] model-00005-of-00006.safetensors: 23%|██▎ | 1.11G/4.93G [00:01<00:02, 1.65GB/s] model-00005-of-00006.safetensors: 28%|██▊ | 1.38G/4.93G [00:01<00:01, 1.97GB/s] model-00005-of-00006.safetensors: 32%|███▏ | 1.59G/4.93G [00:01<00:01, 1.83GB/s] model-00005-of-00006.safetensors: 40%|███▉ | 1.96G/4.93G [00:01<00:01, 2.34GB/s] model-00005-of-00006.safetensors: 45%|████▍ | 2.21G/4.93G [00:01<00:01, 2.34GB/s] model-00005-of-00006.safetensors: 50%|████▉ | 2.45G/4.93G [00:01<00:01, 2.36GB/s] model-00005-of-00006.safetensors: 55%|█████▍ | 2.69G/4.93G [00:01<00:01, 1.98GB/s] model-00005-of-00006.safetensors: 59%|█████▉ | 2.92G/4.93G [00:02<00:01, 1.43GB/s] model-00005-of-00006.safetensors: 63%|██████▎ | 3.09G/4.93G [00:02<00:01, 1.27GB/s] model-00005-of-00006.safetensors: 66%|██████▌ | 3.25G/4.93G [00:02<00:01, 1.16GB/s] model-00005-of-00006.safetensors: 69%|██████▊ | 3.39G/4.93G [00:02<00:01, 1.08GB/s] model-00005-of-00006.safetensors: 71%|███████ | 3.51G/4.93G [00:02<00:01, 1.08GB/s] model-00005-of-00006.safetensors: 74%|███████▍ | 3.65G/4.93G [00:02<00:01, 1.11GB/s] model-00005-of-00006.safetensors: 77%|███████▋ | 3.79G/4.93G [00:02<00:01, 1.13GB/s] model-00005-of-00006.safetensors: 79%|███████▉ | 3.92G/4.93G [00:02<00:00, 1.18GB/s] model-00005-of-00006.safetensors: 83%|████████▎ | 4.07G/4.93G [00:03<00:00, 1.24GB/s] model-00005-of-00006.safetensors: 85%|████████▌ | 4.21G/4.93G [00:03<00:00, 1.12GB/s] model-00005-of-00006.safetensors: 88%|████████▊ | 4.34G/4.93G [00:03<00:00, 971MB/s] model-00005-of-00006.safetensors: 90%|█████████ | 4.44G/4.93G [00:03<00:00, 968MB/s] model-00005-of-00006.safetensors: 95%|█████████▍| 4.67G/4.93G [00:03<00:00, 1.29GB/s] model-00005-of-00006.safetensors: 98%|█████████▊| 4.82G/4.93G [00:03<00:00, 1.27GB/s] model-00005-of-00006.safetensors: 100%|█████████▉| 4.93G/4.93G [00:04<00:00, 1.14GB/s]
khanhnto-khanhnto-v61-mkmlizer: model-00006-of-00006.safetensors: 0%| | 0.00/1.25G [00:00<?, ?B/s] model-00006-of-00006.safetensors: 1%| | 10.5M/1.25G [00:00<00:16, 72.8MB/s] model-00006-of-00006.safetensors: 6%|▌ | 73.4M/1.25G [00:00<00:03, 346MB/s] model-00006-of-00006.safetensors: 11%|█ | 136M/1.25G [00:00<00:02, 435MB/s] model-00006-of-00006.safetensors: 24%|██▎ | 294M/1.25G [00:00<00:01, 823MB/s] model-00006-of-00006.safetensors: 33%|███▎ | 409M/1.25G [00:00<00:00, 909MB/s] model-00006-of-00006.safetensors: 46%|████▋ | 577M/1.25G [00:00<00:00, 1.11GB/s] model-00006-of-00006.safetensors: 73%|███████▎ | 912M/1.25G [00:00<00:00, 1.78GB/s] model-00006-of-00006.safetensors: 88%|████████▊ | 1.10G/1.25G [00:00<00:00, 1.72GB/s] model-00006-of-00006.safetensors: 100%|█████████▉| 1.25G/1.25G [00:02<00:00, 573MB/s]
khanhnto-khanhnto-v61-mkmlizer: model.safetensors.index.json: 0%| | 0.00/29.9k [00:00<?, ?B/s] model.safetensors.index.json: 100%|██████████| 29.9k/29.9k [00:00<00:00, 39.7MB/s]
khanhnto-khanhnto-v61-mkmlizer: special_tokens_map.json: 0%| | 0.00/548 [00:00<?, ?B/s] special_tokens_map.json: 100%|██████████| 548/548 [00:00<00:00, 7.39MB/s]
khanhnto-khanhnto-v61-mkmlizer: tokenizer.json: 0%| | 0.00/1.84M [00:00<?, ?B/s] tokenizer.json: 100%|██████████| 1.84M/1.84M [00:00<00:00, 10.5MB/s] tokenizer.json: 100%|██████████| 1.84M/1.84M [00:00<00:00, 10.4MB/s]
khanhnto-khanhnto-v61-mkmlizer: tokenizer.model: 0%| | 0.00/500k [00:00<?, ?B/s] tokenizer.model: 100%|██████████| 500k/500k [00:00<00:00, 5.55MB/s]
khanhnto-khanhnto-v61-mkmlizer: tokenizer_config.json: 0%| | 0.00/1.02k [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 1.02k/1.02k [00:00<00:00, 9.25MB/s]
khanhnto-khanhnto-v61-mkmlizer: Downloaded to shared memory in 32.179s
khanhnto-khanhnto-v61-mkmlizer: quantizing model to /dev/shm/model_cache
khanhnto-khanhnto-v61-mkmlizer: Saving mkml model at /dev/shm/model_cache
khanhnto-khanhnto-v61-mkmlizer: Reading /tmp/tmpuomiz0tb/model.safetensors.index.json
khanhnto-khanhnto-v60-mkmlizer: model-00004-of-00006.safetensors: 0%| | 0.00/4.93G [00:00<?, ?B/s] model-00004-of-00006.safetensors: 0%| | 10.5M/4.93G [00:00<01:13, 67.0MB/s] model-00004-of-00006.safetensors: 2%|▏ | 94.4M/4.93G [00:00<00:11, 413MB/s] model-00004-of-00006.safetensors: 4%|▍ | 220M/4.93G [00:00<00:06, 675MB/s] model-00004-of-00006.safetensors: 6%|▋ | 315M/4.93G [00:00<00:06, 742MB/s] model-00004-of-00006.safetensors: 9%|▉ | 461M/4.93G [00:00<00:04, 944MB/s] model-00004-of-00006.safetensors: 13%|█▎ | 661M/4.93G [00:00<00:03, 1.24GB/s] model-00004-of-00006.safetensors: 16%|█▌ | 797M/4.93G [00:00<00:03, 1.10GB/s] model-00004-of-00006.safetensors: 19%|█▉ | 954M/4.93G [00:00<00:03, 1.17GB/s] model-00004-of-00006.safetensors: 22%|██▏ | 1.10G/4.93G [00:01<00:03, 1.23GB/s] model-00004-of-00006.safetensors: 25%|██▌ | 1.24G/4.93G [00:01<00:03, 1.09GB/s] model-00004-of-00006.safetensors: 27%|██▋ | 1.35G/4.93G [00:01<00:03, 1.09GB/s] model-00004-of-00006.safetensors: 30%|██▉ | 1.47G/4.93G [00:01<00:03, 1.09GB/s] model-00004-of-00006.safetensors: 32%|███▏ | 1.58G/4.93G [00:01<00:03, 1.05GB/s] model-00004-of-00006.safetensors: 34%|███▍ | 1.70G/4.93G [00:01<00:03, 972MB/s] model-00004-of-00006.safetensors: 38%|███▊ | 1.90G/4.93G [00:01<00:02, 1.22GB/s] model-00004-of-00006.safetensors: 44%|████▎ | 2.15G/4.93G [00:01<00:01, 1.56GB/s] model-00004-of-00006.safetensors: 48%|████▊ | 2.35G/4.93G [00:02<00:01, 1.67GB/s] model-00004-of-00006.safetensors: 52%|█████▏ | 2.56G/4.93G [00:02<00:01, 1.76GB/s] model-00004-of-00006.safetensors: 56%|█████▌ | 2.77G/4.93G [00:02<00:01, 1.80GB/s] model-00004-of-00006.safetensors: 63%|██████▎ | 3.09G/4.93G [00:02<00:00, 2.18GB/s] model-00004-of-00006.safetensors: 70%|██████▉ | 3.45G/4.93G [00:02<00:00, 2.56GB/s] model-00004-of-00006.safetensors: 75%|███████▌ | 3.72G/4.93G [00:02<00:00, 2.59GB/s] model-00004-of-00006.safetensors: 81%|████████ | 3.98G/4.93G [00:02<00:00, 2.15GB/s] model-00004-of-00006.safetensors: 85%|████████▌ | 4.22G/4.93G [00:02<00:00, 1.98GB/s] model-00004-of-00006.safetensors: 90%|████████▉ | 4.43G/4.93G [00:04<00:00, 536MB/s] model-00004-of-00006.safetensors: 93%|█████████▎| 4.59G/4.93G [00:04<00:00, 577MB/s] model-00004-of-00006.safetensors: 97%|█████████▋| 4.80G/4.93G [00:04<00:00, 726MB/s] model-00004-of-00006.safetensors: 100%|█████████▉| 4.93G/4.93G [00:04<00:00, 1.10GB/s]
khanhnto-khanhnto-v60-mkmlizer: model-00005-of-00006.safetensors: 0%| | 0.00/4.93G [00:00<?, ?B/s] model-00005-of-00006.safetensors: 0%| | 10.5M/4.93G [00:00<01:53, 43.5MB/s] model-00005-of-00006.safetensors: 2%|▏ | 83.9M/4.93G [00:00<00:16, 298MB/s] model-00005-of-00006.safetensors: 3%|▎ | 157M/4.93G [00:00<00:11, 432MB/s] model-00005-of-00006.safetensors: 6%|▌ | 283M/4.93G [00:00<00:06, 684MB/s] model-00005-of-00006.safetensors: 9%|▉ | 461M/4.93G [00:00<00:04, 1.02GB/s] model-00005-of-00006.safetensors: 12%|█▏ | 577M/4.93G [00:00<00:04, 1.02GB/s] model-00005-of-00006.safetensors: 15%|█▌ | 755M/4.93G [00:00<00:03, 1.25GB/s] model-00005-of-00006.safetensors: 18%|█▊ | 902M/4.93G [00:00<00:03, 1.30GB/s] model-00005-of-00006.safetensors: 23%|██▎ | 1.14G/4.93G [00:01<00:02, 1.54GB/s] model-00005-of-00006.safetensors: 27%|██▋ | 1.34G/4.93G [00:01<00:02, 1.66GB/s] model-00005-of-00006.safetensors: 31%|███ | 1.52G/4.93G [00:01<00:02, 1.34GB/s] model-00005-of-00006.safetensors: 34%|███▍ | 1.69G/4.93G [00:01<00:02, 1.42GB/s] model-00005-of-00006.safetensors: 37%|███▋ | 1.85G/4.93G [00:01<00:02, 1.30GB/s] model-00005-of-00006.safetensors: 40%|████ | 1.99G/4.93G [00:01<00:02, 1.30GB/s] model-00005-of-00006.safetensors: 45%|████▌ | 2.22G/4.93G [00:01<00:01, 1.54GB/s] model-00005-of-00006.safetensors: 48%|████▊ | 2.39G/4.93G [00:01<00:01, 1.57GB/s] model-00005-of-00006.safetensors: 52%|█████▏ | 2.56G/4.93G [00:02<00:01, 1.58GB/s] model-00005-of-00006.safetensors: 55%|█████▌ | 2.74G/4.93G [00:02<00:01, 1.59GB/s] model-00005-of-00006.safetensors: 59%|█████▉ | 2.90G/4.93G [00:02<00:01, 1.61GB/s] model-00005-of-00006.safetensors: 64%|██████▍ | 3.15G/4.93G [00:02<00:00, 1.83GB/s] model-00005-of-00006.safetensors: 68%|██████▊ | 3.33G/4.93G [00:02<00:00, 1.77GB/s] model-00005-of-00006.safetensors: 72%|███████▏ | 3.55G/4.93G [00:02<00:00, 1.89GB/s] model-00005-of-00006.safetensors: 77%|███████▋ | 3.79G/4.93G [00:02<00:00, 1.98GB/s] model-00005-of-00006.safetensors: 81%|████████ | 4.00G/4.93G [00:02<00:00, 1.82GB/s] model-00005-of-00006.safetensors: 85%|████████▍ | 4.19G/4.93G [00:02<00:00, 1.70GB/s] model-00005-of-00006.safetensors: 93%|█████████▎| 4.59G/4.93G [00:03<00:00, 2.08GB/s] model-00005-of-00006.safetensors: 97%|█████████▋| 4.80G/4.93G [00:03<00:00, 1.23GB/s] model-00005-of-00006.safetensors: 100%|█████████▉| 4.93G/4.93G [00:03<00:00, 1.38GB/s]
khanhnto-khanhnto-v60-mkmlizer: model-00006-of-00006.safetensors: 0%| | 0.00/1.25G [00:00<?, ?B/s] model-00006-of-00006.safetensors: 1%| | 10.5M/1.25G [00:00<00:19, 64.4MB/s] model-00006-of-00006.safetensors: 6%|▌ | 73.4M/1.25G [00:00<00:03, 324MB/s] model-00006-of-00006.safetensors: 13%|█▎ | 168M/1.25G [00:00<00:02, 527MB/s] model-00006-of-00006.safetensors: 25%|██▌ | 315M/1.25G [00:00<00:01, 825MB/s] model-00006-of-00006.safetensors: 38%|███▊ | 472M/1.25G [00:00<00:00, 1.05GB/s] model-00006-of-00006.safetensors: 72%|███████▏ | 899M/1.25G [00:00<00:00, 2.06GB/s] model-00006-of-00006.safetensors: 100%|█████████▉| 1.25G/1.25G [00:00<00:00, 1.61GB/s]
khanhnto-khanhnto-v60-mkmlizer: model.safetensors.index.json: 0%| | 0.00/29.9k [00:00<?, ?B/s] model.safetensors.index.json: 100%|██████████| 29.9k/29.9k [00:00<00:00, 15.1MB/s]
khanhnto-khanhnto-v60-mkmlizer: special_tokens_map.json: 0%| | 0.00/548 [00:00<?, ?B/s] special_tokens_map.json: 100%|██████████| 548/548 [00:00<00:00, 6.02MB/s]
khanhnto-khanhnto-v60-mkmlizer: tokenizer.json: 0%| | 0.00/1.84M [00:00<?, ?B/s] tokenizer.json: 100%|██████████| 1.84M/1.84M [00:00<00:00, 26.6MB/s]
khanhnto-khanhnto-v60-mkmlizer: tokenizer.model: 0%| | 0.00/500k [00:00<?, ?B/s] tokenizer.model: 100%|██████████| 500k/500k [00:00<00:00, 59.4MB/s]
khanhnto-khanhnto-v60-mkmlizer: tokenizer_config.json: 0%| | 0.00/1.02k [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 1.02k/1.02k [00:00<00:00, 8.32MB/s]
khanhnto-khanhnto-v60-mkmlizer: Downloaded to shared memory in 22.738s
khanhnto-khanhnto-v60-mkmlizer: quantizing model to /dev/shm/model_cache
khanhnto-khanhnto-v60-mkmlizer: Saving mkml model at /dev/shm/model_cache
khanhnto-khanhnto-v60-mkmlizer: Reading /tmp/tmpfn6_xfzi/model.safetensors.index.json
khanhnto-khanhnto-v61-mkmlizer: Profiling: 0%| | 0/363 [00:00<?, ?it/s] Profiling: 0%| | 1/363 [00:01<08:43, 1.45s/it] Profiling: 3%|▎ | 10/363 [00:01<00:40, 8.70it/s] Profiling: 6%|▌ | 21/363 [00:01<00:17, 19.94it/s] Profiling: 9%|▊ | 31/363 [00:01<00:10, 30.45it/s] Profiling: 11%|█▏ | 41/363 [00:01<00:07, 40.95it/s] Profiling: 14%|█▍ | 52/363 [00:01<00:05, 53.37it/s] Profiling: 17%|█▋ | 62/363 [00:02<00:04, 62.80it/s] Profiling: 20%|█▉ | 72/363 [00:02<00:06, 44.44it/s] Profiling: 23%|██▎ | 83/363 [00:02<00:05, 55.35it/s] Profiling: 26%|██▌ | 93/363 [00:02<00:04, 63.96it/s] Profiling: 29%|██▊ | 104/363 [00:02<00:03, 72.76it/s] Profiling: 31%|███▏ | 114/363 [00:02<00:03, 78.85it/s] Profiling: 34%|███▍ | 124/363 [00:02<00:03, 78.00it/s] Profiling: 37%|███▋ | 135/363 [00:03<00:02, 85.07it/s] Profiling: 40%|███▉ | 145/363 [00:03<00:03, 55.06it/s] Profiling: 43%|████▎ | 156/363 [00:03<00:03, 63.82it/s] Profiling: 46%|████▌ | 166/363 [00:03<00:02, 69.06it/s] Profiling: 48%|████▊ | 176/363 [00:03<00:02, 73.89it/s] Profiling: 52%|█████▏ | 187/363 [00:03<00:02, 81.82it/s] Profiling: 54%|█████▍ | 197/363 [00:03<00:01, 86.11it/s] Profiling: 57%|█████▋ | 207/363 [00:04<00:01, 89.30it/s] Profiling: 60%|█████▉ | 217/363 [00:04<00:02, 55.64it/s] Profiling: 63%|██████▎ | 228/363 [00:04<00:02, 64.16it/s] Profiling: 66%|██████▌ | 238/363 [00:04<00:01, 68.89it/s] Profiling: 68%|██████▊ | 248/363 [00:04<00:01, 72.75it/s] Profiling: 71%|███████▏ | 259/363 [00:04<00:01, 80.22it/s] Profiling: 74%|███████▍ | 269/363 [00:04<00:01, 83.99it/s] Profiling: 77%|███████▋ | 279/363 [00:05<00:01, 53.89it/s] Profiling: 80%|████████ | 291/363 [00:05<00:01, 64.15it/s] Profiling: 83%|████████▎ | 301/363 [00:05<00:00, 69.37it/s] Profiling: 86%|████████▌ | 311/363 [00:05<00:00, 73.87it/s] Profiling: 89%|████████▊ | 322/363 [00:05<00:00, 81.75it/s] Profiling: 92%|█████████▏| 333/363 [00:05<00:00, 87.74it/s] Profiling: 95%|█████████▍| 344/363 [00:06<00:00, 90.00it/s] Profiling: 98%|█████████▊| 354/363 [00:07<00:00, 17.17it/s] Profiling: 100%|██████████| 363/363 [00:07<00:00, 45.81it/s]
khanhnto-khanhnto-v60-mkmlizer: Profiling: 0%| | 0/363 [00:00<?, ?it/s] Profiling: 0%| | 1/363 [00:01<09:54, 1.64s/it] Profiling: 3%|▎ | 12/363 [00:01<00:37, 9.28it/s] Profiling: 6%|▋ | 23/363 [00:01<00:17, 19.43it/s] Profiling: 11%|█ | 39/363 [00:01<00:08, 36.95it/s] Profiling: 14%|█▍ | 52/363 [00:02<00:06, 50.81it/s] Profiling: 18%|█▊ | 65/363 [00:02<00:04, 64.81it/s] Profiling: 21%|██ | 77/363 [00:02<00:05, 49.11it/s] Profiling: 25%|██▍ | 89/363 [00:02<00:04, 59.93it/s] Profiling: 28%|██▊ | 103/363 [00:02<00:03, 74.31it/s] Profiling: 33%|███▎ | 118/363 [00:02<00:02, 88.04it/s] Profiling: 36%|███▋ | 132/363 [00:02<00:02, 96.99it/s] Profiling: 40%|███▉ | 145/363 [00:03<00:03, 67.50it/s] Profiling: 44%|████▎ | 158/363 [00:03<00:02, 77.55it/s] Profiling: 47%|████▋ | 171/363 [00:03<00:02, 87.65it/s] Profiling: 50%|█████ | 183/363 [00:03<00:02, 89.61it/s] Profiling: 53%|█████▎ | 194/363 [00:03<00:01, 90.31it/s] Profiling: 57%|█████▋ | 207/363 [00:03<00:01, 99.43it/s] Profiling: 60%|██████ | 218/363 [00:04<00:02, 65.28it/s] Profiling: 63%|██████▎ | 229/363 [00:04<00:01, 73.24it/s] Profiling: 66%|██████▋ | 241/363 [00:04<00:01, 83.11it/s] Profiling: 70%|███████ | 255/363 [00:04<00:01, 93.98it/s] Profiling: 73%|███████▎ | 266/363 [00:04<00:01, 95.17it/s] Profiling: 77%|███████▋ | 278/363 [00:04<00:01, 64.09it/s] Profiling: 80%|████████ | 291/363 [00:05<00:00, 74.42it/s] Profiling: 83%|████████▎ | 301/363 [00:05<00:00, 78.97it/s] Profiling: 86%|████████▌ | 313/363 [00:05<00:00, 88.00it/s] Profiling: 90%|█████████ | 327/363 [00:05<00:00, 99.35it/s] Profiling: 93%|█████████▎| 339/363 [00:05<00:00, 104.39it/s] Profiling: 97%|█████████▋| 351/363 [00:07<00:00, 18.37it/s] Profiling: 100%|██████████| 363/363 [00:07<00:00, 48.03it/s]
khanhnto-khanhnto-v61-mkmlizer: Processed model khanhnto/khanhnto in 61.966s
khanhnto-khanhnto-v61-mkmlizer: creating bucket guanaco-mkml-models
khanhnto-khanhnto-v61-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
khanhnto-khanhnto-v61-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/khanhnto-khanhnto-v61
khanhnto-khanhnto-v61-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/khanhnto-khanhnto-v61/config.json
khanhnto-khanhnto-v61-mkmlizer: cp /dev/shm/model_cache/added_tokens.json s3://guanaco-mkml-models/khanhnto-khanhnto-v61/added_tokens.json
khanhnto-khanhnto-v61-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/khanhnto-khanhnto-v61/special_tokens_map.json
khanhnto-khanhnto-v61-mkmlizer: cp /dev/shm/model_cache/tokenizer.model s3://guanaco-mkml-models/khanhnto-khanhnto-v61/tokenizer.model
khanhnto-khanhnto-v61-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/khanhnto-khanhnto-v61/tokenizer_config.json
khanhnto-khanhnto-v61-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/khanhnto-khanhnto-v61/tokenizer.json
khanhnto-khanhnto-v61-mkmlizer: cp /dev/shm/model_cache/mkml_model.tensors s3://guanaco-mkml-models/khanhnto-khanhnto-v61/mkml_model.tensors
khanhnto-khanhnto-v61-mkmlizer: loading reward model from rirv938/gpt2_ties_merge_preference_plus_classic_e2_density_99
khanhnto-khanhnto-v61-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:1067: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
khanhnto-khanhnto-v61-mkmlizer: warnings.warn(
khanhnto-khanhnto-v61-mkmlizer: config.json: 0%| | 0.00/983 [00:00<?, ?B/s] config.json: 100%|██████████| 983/983 [00:00<00:00, 10.3MB/s]
khanhnto-khanhnto-v61-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:690: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
khanhnto-khanhnto-v61-mkmlizer: warnings.warn(
khanhnto-khanhnto-v61-mkmlizer: tokenizer_config.json: 0%| | 0.00/445 [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 445/445 [00:00<00:00, 5.57MB/s]
khanhnto-khanhnto-v61-mkmlizer: vocab.json: 0%| | 0.00/798k [00:00<?, ?B/s] vocab.json: 100%|██████████| 798k/798k [00:00<00:00, 8.49MB/s]
khanhnto-khanhnto-v61-mkmlizer: merges.txt: 0%| | 0.00/456k [00:00<?, ?B/s] merges.txt: 100%|██████████| 456k/456k [00:00<00:00, 10.1MB/s]
khanhnto-khanhnto-v61-mkmlizer: tokenizer.json: 0%| | 0.00/2.11M [00:00<?, ?B/s] tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 21.9MB/s]
khanhnto-khanhnto-v61-mkmlizer: special_tokens_map.json: 0%| | 0.00/441 [00:00<?, ?B/s] special_tokens_map.json: 100%|██████████| 441/441 [00:00<00:00, 3.25MB/s]
khanhnto-khanhnto-v61-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:472: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
khanhnto-khanhnto-v61-mkmlizer: warnings.warn(
khanhnto-khanhnto-v60-mkmlizer: quantized model in 25.920s
khanhnto-khanhnto-v60-mkmlizer: Processed model khanhnto/khanhnto in 50.331s
khanhnto-khanhnto-v60-mkmlizer: creating bucket guanaco-mkml-models
khanhnto-khanhnto-v60-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
khanhnto-khanhnto-v60-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/khanhnto-khanhnto-v60
khanhnto-khanhnto-v60-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/khanhnto-khanhnto-v60/config.json
khanhnto-khanhnto-v60-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/khanhnto-khanhnto-v60/special_tokens_map.json
khanhnto-khanhnto-v60-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/khanhnto-khanhnto-v60/tokenizer_config.json
khanhnto-khanhnto-v60-mkmlizer: cp /dev/shm/model_cache/added_tokens.json s3://guanaco-mkml-models/khanhnto-khanhnto-v60/added_tokens.json
khanhnto-khanhnto-v60-mkmlizer: cp /dev/shm/model_cache/tokenizer.model s3://guanaco-mkml-models/khanhnto-khanhnto-v60/tokenizer.model
khanhnto-khanhnto-v60-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/khanhnto-khanhnto-v60/tokenizer.json
khanhnto-khanhnto-v61-mkmlizer: Bucket 's3://guanaco-reward-models/' created
khanhnto-khanhnto-v61-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/khanhnto-khanhnto-v61_reward
khanhnto-khanhnto-v61-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/khanhnto-khanhnto-v61_reward/config.json
khanhnto-khanhnto-v61-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/khanhnto-khanhnto-v61_reward/tokenizer_config.json
khanhnto-khanhnto-v61-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/khanhnto-khanhnto-v61_reward/special_tokens_map.json
khanhnto-khanhnto-v61-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/khanhnto-khanhnto-v61_reward/vocab.json
khanhnto-khanhnto-v61-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/khanhnto-khanhnto-v61_reward/merges.txt
khanhnto-khanhnto-v61-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/khanhnto-khanhnto-v61_reward/tokenizer.json
khanhnto-khanhnto-v61-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/khanhnto-khanhnto-v61_reward/reward.tensors
Job khanhnto-khanhnto-v61-mkmlizer completed after 87.57s with status: succeeded
Stopping job with name khanhnto-khanhnto-v61-mkmlizer
Pipeline stage MKMLizer completed in 88.17s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.15s
Running pipeline stage ISVCDeployer
Creating inference service khanhnto-khanhnto-v61
Waiting for inference service khanhnto-khanhnto-v61 to be ready
khanhnto-khanhnto-v60-mkmlizer: loading reward model from rirv938/gpt2_ties_merge_preference_plus_classic_e2_density_99
khanhnto-khanhnto-v60-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:1067: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
khanhnto-khanhnto-v60-mkmlizer: warnings.warn(
khanhnto-khanhnto-v60-mkmlizer: config.json: 0%| | 0.00/983 [00:00<?, ?B/s] config.json: 100%|██████████| 983/983 [00:00<00:00, 10.9MB/s]
khanhnto-khanhnto-v60-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:690: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
khanhnto-khanhnto-v60-mkmlizer: warnings.warn(
khanhnto-khanhnto-v60-mkmlizer: tokenizer_config.json: 0%| | 0.00/445 [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 445/445 [00:00<00:00, 5.00MB/s]
khanhnto-khanhnto-v60-mkmlizer: vocab.json: 0%| | 0.00/798k [00:00<?, ?B/s] vocab.json: 100%|██████████| 798k/798k [00:00<00:00, 34.2MB/s]
khanhnto-khanhnto-v60-mkmlizer: merges.txt: 0%| | 0.00/456k [00:00<?, ?B/s] merges.txt: 100%|██████████| 456k/456k [00:00<00:00, 6.38MB/s]
khanhnto-khanhnto-v60-mkmlizer: tokenizer.json: 0%| | 0.00/2.11M [00:00<?, ?B/s] tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 43.9MB/s]
khanhnto-khanhnto-v60-mkmlizer: special_tokens_map.json: 0%| | 0.00/441 [00:00<?, ?B/s] special_tokens_map.json: 100%|██████████| 441/441 [00:00<00:00, 4.84MB/s]
khanhnto-khanhnto-v60-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:472: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
khanhnto-khanhnto-v60-mkmlizer: warnings.warn(
khanhnto-khanhnto-v60-mkmlizer: model.safetensors.index.json: 0%| | 0.00/10.5k [00:00<?, ?B/s] model.safetensors.index.json: 100%|██████████| 10.5k/10.5k [00:00<00:00, 45.4MB/s]
khanhnto-khanhnto-v60-mkmlizer: Downloading shards: 0%| | 0/1 [00:00<?, ?it/s]
khanhnto-khanhnto-v60-mkmlizer: model-00001-of-00001.safetensors: 0%| | 0.00/249M [00:00<?, ?B/s]
khanhnto-khanhnto-v60-mkmlizer: model-00001-of-00001.safetensors: 7%|▋ | 18.2M/249M [00:00<00:02, 114MB/s] model-00001-of-00001.safetensors: 100%|█████████▉| 249M/249M [00:00<00:00, 1.12GB/s]
khanhnto-khanhnto-v60-mkmlizer: Downloading shards: 100%|██████████| 1/1 [00:00<00:00, 2.66it/s] Downloading shards: 100%|██████████| 1/1 [00:00<00:00, 2.66it/s]
khanhnto-khanhnto-v60-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
khanhnto-khanhnto-v60-mkmlizer: Saving duration: 0.087s
khanhnto-khanhnto-v60-mkmlizer: Processed model rirv938/gpt2_ties_merge_preference_plus_classic_e2_density_99 in 2.219s
khanhnto-khanhnto-v60-mkmlizer: creating bucket guanaco-reward-models
khanhnto-khanhnto-v60-mkmlizer: Bucket 's3://guanaco-reward-models/' created
khanhnto-khanhnto-v60-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/khanhnto-khanhnto-v60_reward
khanhnto-khanhnto-v60-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/khanhnto-khanhnto-v60_reward/tokenizer_config.json
khanhnto-khanhnto-v60-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/khanhnto-khanhnto-v60_reward/config.json
khanhnto-khanhnto-v60-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/khanhnto-khanhnto-v60_reward/special_tokens_map.json
khanhnto-khanhnto-v60-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/khanhnto-khanhnto-v60_reward/merges.txt
khanhnto-khanhnto-v60-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/khanhnto-khanhnto-v60_reward/vocab.json
khanhnto-khanhnto-v60-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/khanhnto-khanhnto-v60_reward/tokenizer.json
khanhnto-khanhnto-v60-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/khanhnto-khanhnto-v60_reward/reward.tensors
Job khanhnto-khanhnto-v60-mkmlizer completed after 111.73s with status: succeeded
Stopping job with name khanhnto-khanhnto-v60-mkmlizer
Pipeline stage MKMLizer completed in 113.56s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.15s
Running pipeline stage ISVCDeployer
Creating inference service khanhnto-khanhnto-v60
Waiting for inference service khanhnto-khanhnto-v60 to be ready
Inference service khanhnto-khanhnto-v61 ready after 50.34753775596619s
Pipeline stage ISVCDeployer completed in 56.29s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.5024983882904053s
Received healthy response to inference request in 1.695281744003296s
Received healthy response to inference request in 1.293116807937622s
Received healthy response to inference request in 1.9686028957366943s
Received healthy response to inference request in 1.2665250301361084s
5 requests
0 failed requests
5th percentile: 1.271843385696411
10th percentile: 1.2771617412567138
20th percentile: 1.2877984523773194
30th percentile: 1.3349931240081787
40th percentile: 1.418745756149292
50th percentile: 1.5024983882904053
Inference service khanhnto-khanhnto-v60 ready after 50.33349347114563s
60th percentile: 1.5796117305755615
Pipeline stage ISVCDeployer completed in 56.18s
70th percentile: 1.6567250728607177
Running pipeline stage StressChecker
80th percentile: 1.7499459743499757
90th percentile: 1.859274435043335
95th percentile: 1.9139386653900146
99th percentile: 1.9576700496673585
mean time: 1.5452049732208253
Pipeline stage StressChecker completed in 10.07s
Running pipeline stage DaemonicModelEvalScorer
Pipeline stage DaemonicModelEvalScorer completed in 0.12s
Running M-Eval for topic stay_in_character
Running pipeline stage DaemonicSafetyScorer
M-Eval Dataset for topic stay_in_character is loaded
Pipeline stage DaemonicSafetyScorer completed in 0.22s
khanhnto-khanhnto_v61 status is now deployed due to DeploymentManager action
khanhnto-khanhnto_v61 status is now inactive due to admin request

Usage Metrics

Latency Metrics