submission_id: lex-hue-delexa-7b_v1
developer_uid: Meliodia
status: torndown
model_repo: lex-hue/Delexa-7b
reward_repo: ChaiML/reward_gpt2_medium_preference_24m_e2
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
reward_formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
timestamp: 2024-04-16T05:08:07+00:00
model_name: lex-hue-delexa-7b_v1
model_eval_status: success
model_group: lex-hue/Delexa-7b
num_battles: 5896
num_wins: 3209
celo_rating: 1175.11
propriety_score: 0.0
propriety_total_count: 0.0
submission_type: basic
model_architecture: MistralForCausalLM
model_num_parameters: 7241732096.0
best_of: 16
max_input_tokens: 512
max_output_tokens: 64
display_name: lex-hue-delexa-7b_v1
ineligible_reason: propriety_total_count < 800
language_model: lex-hue/Delexa-7b
model_size: 7B
reward_model: ChaiML/reward_gpt2_medium_preference_24m_e2
us_pacific_date: 2024-04-15
win_ratio: 0.5442672998643148
preference_data_url: None
Resubmit model
Running pipeline stage MKMLizer
Starting job with name lex-hue-delexa-7b-v1-mkmlizer
Waiting for job on lex-hue-delexa-7b-v1-mkmlizer to finish
lex-hue-delexa-7b-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
lex-hue-delexa-7b-v1-mkmlizer: ║ _____ __ __ ║
lex-hue-delexa-7b-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
lex-hue-delexa-7b-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
lex-hue-delexa-7b-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
lex-hue-delexa-7b-v1-mkmlizer: ║ /___/ ║
lex-hue-delexa-7b-v1-mkmlizer: ║ ║
lex-hue-delexa-7b-v1-mkmlizer: ║ Version: 0.6.11 ║
lex-hue-delexa-7b-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
lex-hue-delexa-7b-v1-mkmlizer: ║ ║
lex-hue-delexa-7b-v1-mkmlizer: ║ The license key for the current software has been verified as ║
lex-hue-delexa-7b-v1-mkmlizer: ║ belonging to: ║
lex-hue-delexa-7b-v1-mkmlizer: ║ ║
lex-hue-delexa-7b-v1-mkmlizer: ║ Chai Research Corp. ║
lex-hue-delexa-7b-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
lex-hue-delexa-7b-v1-mkmlizer: ║ Expiration: 2024-07-15 23:59:59 ║
lex-hue-delexa-7b-v1-mkmlizer: ║ ║
lex-hue-delexa-7b-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
lex-hue-delexa-7b-v1-mkmlizer: .gitattributes: 0%| | 0.00/1.52k [00:00<?, ?B/s] .gitattributes: 100%|██████████| 1.52k/1.52k [00:00<00:00, 17.0MB/s]
lex-hue-delexa-7b-v1-mkmlizer: README.md: 0%| | 0.00/5.88k [00:00<?, ?B/s] README.md: 100%|██████████| 5.88k/5.88k [00:00<00:00, 48.5MB/s]
lex-hue-delexa-7b-v1-mkmlizer: config.json: 0%| | 0.00/954 [00:00<?, ?B/s] config.json: 100%|██████████| 954/954 [00:00<00:00, 11.1MB/s]
lex-hue-delexa-7b-v1-mkmlizer: configuration_mistral.py: 0%| | 0.00/8.90k [00:00<?, ?B/s] configuration_mistral.py: 100%|██████████| 8.90k/8.90k [00:00<00:00, 95.7MB/s]
lex-hue-delexa-7b-v1-mkmlizer: model-00001-of-00002.safetensors: 0%| | 0.00/9.94G [00:00<?, ?B/s] model-00001-of-00002.safetensors: 0%| | 10.5M/9.94G [00:02<32:28, 5.10MB/s] model-00001-of-00002.safetensors: 0%| | 21.0M/9.94G [00:02<17:02, 9.70MB/s] model-00001-of-00002.safetensors: 0%| | 31.5M/9.94G [00:02<10:05, 16.4MB/s] model-00001-of-00002.safetensors: 0%| | 41.9M/9.94G [00:02<07:51, 21.0MB/s] model-00001-of-00002.safetensors: 2%|▏ | 189M/9.94G [00:02<00:59, 164MB/s] model-00001-of-00002.safetensors: 2%|▏ | 241M/9.94G [00:03<00:51, 189MB/s] model-00001-of-00002.safetensors: 3%|▎ | 283M/9.94G [00:03<00:58, 164MB/s] model-00001-of-00002.safetensors: 4%|▍ | 419M/9.94G [00:03<00:30, 315MB/s] model-00001-of-00002.safetensors: 10%|▉ | 954M/9.94G [00:03<00:08, 1.09GB/s] model-00001-of-00002.safetensors: 12%|█▏ | 1.16G/9.94G [00:03<00:07, 1.13GB/s] model-00001-of-00002.safetensors: 14%|█▎ | 1.35G/9.94G [00:04<00:08, 1.04GB/s] model-00001-of-00002.safetensors: 15%|█▌ | 1.51G/9.94G [00:04<00:11, 705MB/s] model-00001-of-00002.safetensors: 17%|█▋ | 1.68G/9.94G [00:04<00:09, 836MB/s] model-00001-of-00002.safetensors: 18%|█▊ | 1.81G/9.94G [00:04<00:09, 821MB/s] model-00001-of-00002.safetensors: 19%|█▉ | 1.93G/9.94G [00:05<00:10, 778MB/s] model-00001-of-00002.safetensors: 21%|██▏ | 2.12G/9.94G [00:05<00:08, 960MB/s] model-00001-of-00002.safetensors: 23%|██▎ | 2.32G/9.94G [00:05<00:06, 1.16GB/s] model-00001-of-00002.safetensors: 26%|██▋ | 2.61G/9.94G [00:05<00:04, 1.53GB/s] model-00001-of-00002.safetensors: 29%|██▉ | 2.88G/9.94G [00:05<00:04, 1.57GB/s] model-00001-of-00002.safetensors: 31%|███ | 3.06G/9.94G [00:05<00:06, 1.05GB/s] model-00001-of-00002.safetensors: 32%|███▏ | 3.21G/9.94G [00:06<00:07, 904MB/s] model-00001-of-00002.safetensors: 34%|███▎ | 3.36G/9.94G [00:06<00:07, 878MB/s] model-00001-of-00002.safetensors: 35%|███▍ | 3.47G/9.94G [00:06<00:07, 863MB/s] model-00001-of-00002.safetensors: 36%|███▌ | 3.58G/9.94G [00:06<00:08, 714MB/s] model-00001-of-00002.safetensors: 37%|███▋ | 3.68G/9.94G [00:06<00:08, 749MB/s] model-00001-of-00002.safetensors: 38%|███▊ | 3.77G/9.94G [00:06<00:08, 761MB/s] model-00001-of-00002.safetensors: 39%|███▉ | 3.86G/9.94G [00:06<00:07, 767MB/s] model-00001-of-00002.safetensors: 40%|███▉ | 3.94G/9.94G [00:07<00:08, 717MB/s] model-00001-of-00002.safetensors: 41%|████ | 4.04G/9.94G [00:07<00:07, 752MB/s] model-00001-of-00002.safetensors: 42%|████▏ | 4.19G/9.94G [00:07<00:06, 884MB/s] model-00001-of-00002.safetensors: 43%|████▎ | 4.29G/9.94G [00:07<00:06, 893MB/s] model-00001-of-00002.safetensors: 44%|████▍ | 4.38G/9.94G [00:07<00:06, 864MB/s] model-00001-of-00002.safetensors: 45%|████▌ | 4.48G/9.94G [00:07<00:06, 861MB/s] model-00001-of-00002.safetensors: 46%|████▌ | 4.57G/9.94G [00:07<00:07, 699MB/s] model-00001-of-00002.safetensors: 47%|████▋ | 4.66G/9.94G [00:07<00:07, 712MB/s] model-00001-of-00002.safetensors: 48%|████▊ | 4.74G/9.94G [00:08<00:07, 702MB/s] model-00001-of-00002.safetensors: 49%|████▊ | 4.84G/9.94G [00:08<00:06, 743MB/s] model-00001-of-00002.safetensors: 50%|████▉ | 4.96G/9.94G [00:08<00:06, 755MB/s] model-00001-of-00002.safetensors: 51%|█████ | 5.06G/9.94G [00:08<00:06, 801MB/s] model-00001-of-00002.safetensors: 52%|█████▏ | 5.16G/9.94G [00:08<00:05, 837MB/s] model-00001-of-00002.safetensors: 53%|█████▎ | 5.25G/9.94G [00:08<00:05, 793MB/s] model-00001-of-00002.safetensors: 54%|█████▎ | 5.34G/9.94G [00:08<00:05, 803MB/s] model-00001-of-00002.safetensors: 55%|█████▍ | 5.42G/9.94G [00:08<00:05, 760MB/s] model-00001-of-00002.safetensors: 55%|█████▌ | 5.51G/9.94G [00:09<00:06, 722MB/s] model-00001-of-00002.safetensors: 57%|█████▋ | 5.64G/9.94G [00:09<00:04, 876MB/s] model-00001-of-00002.safetensors: 58%|█████▊ | 5.74G/9.94G [00:09<00:06, 666MB/s] model-00001-of-00002.safetensors: 60%|█████▉ | 5.93G/9.94G [00:09<00:04, 959MB/s] model-00001-of-00002.safetensors: 61%|██████ | 6.05G/9.94G [00:09<00:04, 843MB/s] model-00001-of-00002.safetensors: 62%|██████▏ | 6.16G/9.94G [00:09<00:04, 848MB/s] model-00001-of-00002.safetensors: 63%|██████▎ | 6.25G/9.94G [00:09<00:04, 803MB/s] model-00001-of-00002.safetensors: 64%|██████▍ | 6.34G/9.94G [00:10<00:04, 792MB/s] model-00001-of-00002.safetensors: 65%|██████▍ | 6.46G/9.94G [00:10<00:04, 864MB/s] model-00001-of-00002.safetensors: 66%|██████▌ | 6.55G/9.94G [00:10<00:04, 808MB/s] model-00001-of-00002.safetensors: 67%|██████▋ | 6.65G/9.94G [00:10<00:04, 797MB/s] model-00001-of-00002.safetensors: 68%|██████▊ | 6.81G/9.94G [00:10<00:03, 975MB/s] model-00001-of-00002.safetensors: 69%|██████▉ | 6.91G/9.94G [00:10<00:03, 935MB/s] model-00001-of-00002.safetensors: 71%|███████ | 7.04G/9.94G [00:10<00:02, 989MB/s] model-00001-of-00002.safetensors: 72%|███████▏ | 7.14G/9.94G [00:10<00:02, 948MB/s] model-00001-of-00002.safetensors: 73%|███████▎ | 7.25G/9.94G [00:11<00:03, 860MB/s] model-00001-of-00002.safetensors: 74%|███████▍ | 7.34G/9.94G [00:11<00:03, 853MB/s] model-00001-of-00002.safetensors: 75%|███████▌ | 7.49G/9.94G [00:11<00:02, 1.01GB/s] model-00001-of-00002.safetensors: 77%|███████▋ | 7.64G/9.94G [00:11<00:02, 1.13GB/s] model-00001-of-00002.safetensors: 78%|███████▊ | 7.77G/9.94G [00:11<00:02, 994MB/s] model-00001-of-00002.safetensors: 79%|███████▉ | 7.90G/9.94G [00:11<00:02, 1.02GB/s] model-00001-of-00002.safetensors: 81%|████████ | 8.02G/9.94G [00:11<00:01, 1.08GB/s] model-00001-of-00002.safetensors: 82%|████████▏ | 8.14G/9.94G [00:12<00:02, 805MB/s] model-00001-of-00002.safetensors: 83%|████████▎ | 8.23G/9.94G [00:12<00:02, 774MB/s] model-00001-of-00002.safetensors: 85%|████████▌ | 8.46G/9.94G [00:12<00:01, 1.09GB/s] model-00001-of-00002.safetensors: 86%|████████▋ | 8.59G/9.94G [00:12<00:01, 970MB/s] model-00001-of-00002.safetensors: 88%|████████▊ | 8.70G/9.94G [00:12<00:01, 916MB/s] model-00001-of-00002.safetensors: 89%|████████▉ | 8.89G/9.94G [00:12<00:00, 1.13GB/s] model-00001-of-00002.safetensors: 91%|█████████ | 9.04G/9.94G [00:12<00:00, 1.20GB/s] model-00001-of-00002.safetensors: 92%|█████████▏| 9.18G/9.94G [00:12<00:00, 1.15GB/s] model-00001-of-00002.safetensors: 94%|█████████▎| 9.30G/9.94G [00:13<00:00, 980MB/s] model-00001-of-00002.safetensors: 96%|█████████▌| 9.51G/9.94G [00:13<00:00, 1.23GB/s] model-00001-of-00002.safetensors: 99%|█████████▉| 9.89G/9.94G [00:13<00:00, 1.69GB/s] model-00001-of-00002.safetensors: 100%|█████████▉| 9.94G/9.94G [00:16<00:00, 585MB/s]
lex-hue-delexa-7b-v1-mkmlizer: model-00002-of-00002.safetensors: 0%| | 0.00/4.54G [00:00<?, ?B/s] model-00002-of-00002.safetensors: 0%| | 10.5M/4.54G [00:01<13:17, 5.68MB/s] model-00002-of-00002.safetensors: 1%| | 31.5M/4.54G [00:02<04:47, 15.7MB/s] model-00002-of-00002.safetensors: 2%|▏ | 94.4M/4.54G [00:02<01:16, 58.3MB/s] model-00002-of-00002.safetensors: 3%|▎ | 157M/4.54G [00:02<00:40, 109MB/s] model-00002-of-00002.safetensors: 4%|▍ | 199M/4.54G [00:03<00:55, 78.8MB/s] model-00002-of-00002.safetensors: 13%|█▎ | 608M/4.54G [00:03<00:09, 406MB/s] model-00002-of-00002.safetensors: 27%|██▋ | 1.24G/4.54G [00:03<00:03, 878MB/s] model-00002-of-00002.safetensors: 31%|███▏ | 1.43G/4.54G [00:04<00:05, 570MB/s] model-00002-of-00002.safetensors: 34%|███▍ | 1.56G/4.54G [00:04<00:04, 612MB/s] model-00002-of-00002.safetensors: 41%|████ | 1.86G/4.54G [00:04<00:03, 847MB/s] model-00002-of-00002.safetensors: 45%|████▍ | 2.02G/4.54G [00:04<00:02, 941MB/s] model-00002-of-00002.safetensors: 48%|████▊ | 2.19G/4.54G [00:05<00:02, 988MB/s] model-00002-of-00002.safetensors: 54%|█████▎ | 2.43G/4.54G [00:05<00:01, 1.22GB/s] model-00002-of-00002.safetensors: 58%|█████▊ | 2.61G/4.54G [00:05<00:01, 1.19GB/s] model-00002-of-00002.safetensors: 61%|██████ | 2.78G/4.54G [00:05<00:01, 1.26GB/s] model-00002-of-00002.safetensors: 65%|██████▍ | 2.95G/4.54G [00:05<00:01, 1.33GB/s] model-00002-of-00002.safetensors: 69%|██████▊ | 3.11G/4.54G [00:05<00:01, 1.31GB/s] model-00002-of-00002.safetensors: 72%|███████▏ | 3.26G/4.54G [00:05<00:01, 1.26GB/s] model-00002-of-00002.safetensors: 75%|███████▍ | 3.40G/4.54G [00:06<00:01, 816MB/s] model-00002-of-00002.safetensors: 79%|███████▉ | 3.60G/4.54G [00:06<00:00, 1.03GB/s] model-00002-of-00002.safetensors: 82%|████████▏ | 3.73G/4.54G [00:06<00:00, 943MB/s] model-00002-of-00002.safetensors: 85%|████████▍ | 3.86G/4.54G [00:06<00:00, 940MB/s] model-00002-of-00002.safetensors: 88%|████████▊ | 4.02G/4.54G [00:06<00:00, 1.03GB/s] model-00002-of-00002.safetensors: 91%|█████████ | 4.14G/4.54G [00:06<00:00, 959MB/s] model-00002-of-00002.safetensors: 100%|█████████▉| 4.54G/4.54G [00:07<00:00, 646MB/s]
lex-hue-delexa-7b-v1-mkmlizer: model.safetensors.index.json: 0%| | 0.00/22.8k [00:00<?, ?B/s] model.safetensors.index.json: 100%|██████████| 22.8k/22.8k [00:00<00:00, 28.6MB/s]
lex-hue-delexa-7b-v1-mkmlizer: modeling_mistral_yarn.py: 0%| | 0.00/67.3k [00:00<?, ?B/s] modeling_mistral_yarn.py: 100%|██████████| 67.3k/67.3k [00:00<00:00, 248MB/s]
lex-hue-delexa-7b-v1-mkmlizer: special_tokens_map.json: 0%| | 0.00/487 [00:00<?, ?B/s] special_tokens_map.json: 100%|██████████| 487/487 [00:00<00:00, 3.87MB/s]
lex-hue-delexa-7b-v1-mkmlizer: tokenizer.json: 0%| | 0.00/1.80M [00:00<?, ?B/s] tokenizer.json: 100%|██████████| 1.80M/1.80M [00:00<00:00, 18.6MB/s]
lex-hue-delexa-7b-v1-mkmlizer: tokenizer.model: 0%| | 0.00/493k [00:00<?, ?B/s] tokenizer.model: 100%|██████████| 493k/493k [00:00<00:00, 16.9MB/s]
lex-hue-delexa-7b-v1-mkmlizer: tokenizer_config.json: 0%| | 0.00/1.00k [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 1.00k/1.00k [00:00<00:00, 8.26MB/s]
lex-hue-delexa-7b-v1-mkmlizer: Downloaded to shared memory in 26.062s
lex-hue-delexa-7b-v1-mkmlizer: quantizing model to /dev/shm/model_cache
lex-hue-delexa-7b-v1-mkmlizer: Saving mkml model at /dev/shm/model_cache
lex-hue-delexa-7b-v1-mkmlizer: Reading /tmp/tmp34btuyf6/model.safetensors.index.json
lex-hue-delexa-7b-v1-mkmlizer: Profiling: 0%| | 0/291 [00:00<?, ?it/s] Profiling: 0%| | 1/291 [00:01<06:27, 1.33s/it] Profiling: 4%|▍ | 12/291 [00:01<00:24, 11.26it/s] Profiling: 8%|▊ | 22/291 [00:01<00:12, 21.59it/s] Profiling: 11%|█ | 32/291 [00:01<00:07, 32.51it/s] Profiling: 16%|█▋ | 48/291 [00:01<00:04, 53.50it/s] Profiling: 20%|██ | 59/291 [00:01<00:03, 63.50it/s] Profiling: 26%|██▌ | 75/291 [00:01<00:02, 84.08it/s] Profiling: 30%|███ | 88/291 [00:02<00:02, 92.87it/s] Profiling: 35%|███▌ | 102/291 [00:02<00:01, 104.12it/s] Profiling: 40%|███▉ | 115/291 [00:02<00:01, 98.84it/s] Profiling: 44%|████▍ | 129/291 [00:02<00:01, 107.32it/s] Profiling: 49%|████▉ | 143/291 [00:02<00:01, 113.17it/s] Profiling: 54%|█████▎ | 156/291 [00:02<00:01, 116.71it/s] Profiling: 58%|█████▊ | 169/291 [00:02<00:01, 119.34it/s] Profiling: 63%|██████▎ | 182/291 [00:02<00:00, 115.73it/s] Profiling: 67%|██████▋ | 196/291 [00:02<00:00, 117.94it/s] Profiling: 72%|███████▏ | 209/291 [00:04<00:04, 19.52it/s] Profiling: 76%|███████▌ | 221/291 [00:05<00:02, 25.22it/s] Profiling: 80%|████████ | 234/291 [00:05<00:01, 33.29it/s] Profiling: 85%|████████▍ | 247/291 [00:05<00:01, 42.74it/s] Profiling: 89%|████████▊ | 258/291 [00:05<00:00, 49.78it/s] Profiling: 94%|█████████▍| 274/291 [00:05<00:00, 64.56it/s] Profiling: 98%|█████████▊| 286/291 [00:05<00:00, 71.13it/s] Profiling: 100%|██████████| 291/291 [00:05<00:00, 49.33it/s]
lex-hue-delexa-7b-v1-mkmlizer: quantized model in 17.439s
lex-hue-delexa-7b-v1-mkmlizer: Processed model lex-hue/Delexa-7b in 44.472s
lex-hue-delexa-7b-v1-mkmlizer: creating bucket guanaco-mkml-models
lex-hue-delexa-7b-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
lex-hue-delexa-7b-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/lex-hue-delexa-7b-v1
lex-hue-delexa-7b-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/lex-hue-delexa-7b-v1/tokenizer_config.json
lex-hue-delexa-7b-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/lex-hue-delexa-7b-v1/config.json
lex-hue-delexa-7b-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/lex-hue-delexa-7b-v1/special_tokens_map.json
lex-hue-delexa-7b-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/lex-hue-delexa-7b-v1/tokenizer.json
lex-hue-delexa-7b-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.model s3://guanaco-mkml-models/lex-hue-delexa-7b-v1/tokenizer.model
lex-hue-delexa-7b-v1-mkmlizer: cp /dev/shm/model_cache/mkml_model.tensors s3://guanaco-mkml-models/lex-hue-delexa-7b-v1/mkml_model.tensors
lex-hue-delexa-7b-v1-mkmlizer: loading reward model from ChaiML/reward_gpt2_medium_preference_24m_e2
lex-hue-delexa-7b-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:1067: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
lex-hue-delexa-7b-v1-mkmlizer: warnings.warn(
lex-hue-delexa-7b-v1-mkmlizer: config.json: 0%| | 0.00/1.05k [00:00<?, ?B/s] config.json: 100%|██████████| 1.05k/1.05k [00:00<00:00, 8.44MB/s]
lex-hue-delexa-7b-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:690: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
lex-hue-delexa-7b-v1-mkmlizer: warnings.warn(
lex-hue-delexa-7b-v1-mkmlizer: tokenizer_config.json: 0%| | 0.00/234 [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 234/234 [00:00<00:00, 2.91MB/s]
lex-hue-delexa-7b-v1-mkmlizer: vocab.json: 0%| | 0.00/1.04M [00:00<?, ?B/s] vocab.json: 100%|██████████| 1.04M/1.04M [00:00<00:00, 10.4MB/s]
lex-hue-delexa-7b-v1-mkmlizer: tokenizer.json: 0%| | 0.00/2.11M [00:00<?, ?B/s] tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 21.0MB/s]
lex-hue-delexa-7b-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:472: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
lex-hue-delexa-7b-v1-mkmlizer: warnings.warn(
lex-hue-delexa-7b-v1-mkmlizer: pytorch_model.bin: 0%| | 0.00/1.44G [00:00<?, ?B/s] pytorch_model.bin: 1%|▏ | 21.0M/1.44G [00:00<00:08, 173MB/s] pytorch_model.bin: 10%|█ | 147M/1.44G [00:00<00:01, 679MB/s] pytorch_model.bin: 19%|█▉ | 273M/1.44G [00:00<00:01, 878MB/s] pytorch_model.bin: 30%|██▉ | 430M/1.44G [00:00<00:00, 1.09GB/s] pytorch_model.bin: 41%|████▏ | 598M/1.44G [00:00<00:00, 1.26GB/s] pytorch_model.bin: 51%|█████ | 734M/1.44G [00:00<00:00, 1.07GB/s] pytorch_model.bin: 60%|█████▉ | 860M/1.44G [00:00<00:00, 1.12GB/s] pytorch_model.bin: 70%|███████ | 1.01G/1.44G [00:00<00:00, 1.23GB/s] pytorch_model.bin: 83%|████████▎ | 1.20G/1.44G [00:01<00:00, 1.41GB/s] pytorch_model.bin: 99%|█████████▉| 1.43G/1.44G [00:01<00:00, 1.40GB/s] pytorch_model.bin: 100%|█████████▉| 1.44G/1.44G [00:05<00:00, 272MB/s]
lex-hue-delexa-7b-v1-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
lex-hue-delexa-7b-v1-mkmlizer: Saving duration: 0.303s
lex-hue-delexa-7b-v1-mkmlizer: Processed model ChaiML/reward_gpt2_medium_preference_24m_e2 in 9.426s
lex-hue-delexa-7b-v1-mkmlizer: creating bucket guanaco-reward-models
lex-hue-delexa-7b-v1-mkmlizer: Bucket 's3://guanaco-reward-models/' created
lex-hue-delexa-7b-v1-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/lex-hue-delexa-7b-v1_reward
lex-hue-delexa-7b-v1-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/lex-hue-delexa-7b-v1_reward/config.json
lex-hue-delexa-7b-v1-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/lex-hue-delexa-7b-v1_reward/tokenizer_config.json
lex-hue-delexa-7b-v1-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/lex-hue-delexa-7b-v1_reward/special_tokens_map.json
lex-hue-delexa-7b-v1-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/lex-hue-delexa-7b-v1_reward/vocab.json
lex-hue-delexa-7b-v1-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/lex-hue-delexa-7b-v1_reward/merges.txt
lex-hue-delexa-7b-v1-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/lex-hue-delexa-7b-v1_reward/tokenizer.json
lex-hue-delexa-7b-v1-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/lex-hue-delexa-7b-v1_reward/reward.tensors
Job lex-hue-delexa-7b-v1-mkmlizer completed after 74.29s with status: succeeded
Stopping job with name lex-hue-delexa-7b-v1-mkmlizer
Pipeline stage MKMLizer completed in 79.37s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.10s
Running pipeline stage ISVCDeployer
Creating inference service lex-hue-delexa-7b-v1
Waiting for inference service lex-hue-delexa-7b-v1 to be ready
Inference service lex-hue-delexa-7b-v1 ready after 50.60946083068848s
Pipeline stage ISVCDeployer completed in 58.24s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7288265228271484s
Received healthy response to inference request in 3.2375001907348633s
Received healthy response to inference request in 1.204747200012207s
Received healthy response to inference request in 1.2559306621551514s
Received healthy response to inference request in 1.1924784183502197s
5 requests
0 failed requests
5th percentile: 1.1949321746826171
10th percentile: 1.1973859310150146
20th percentile: 1.2022934436798096
30th percentile: 1.2149838924407959
40th percentile: 1.2354572772979737
50th percentile: 1.2559306621551514
60th percentile: 1.44508900642395
70th percentile: 1.6342473506927488
80th percentile: 2.0305612564086917
90th percentile: 2.6340307235717777
95th percentile: 2.93576545715332
99th percentile: 3.177153244018555
mean time: 1.723896598815918
Pipeline stage StressChecker completed in 9.48s
Running pipeline stage DaemonicModelEvalScorer
Pipeline stage DaemonicModelEvalScorer completed in 0.04s
Running pipeline stage DaemonicSafetyScorer
Running M-Eval for topic stay_in_character
Pipeline stage DaemonicSafetyScorer completed in 0.04s
M-Eval Dataset for topic stay_in_character is loaded
lex-hue-delexa-7b_v1 status is now deployed due to DeploymentManager action
lex-hue-delexa-7b_v1 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of lex-hue-delexa-7b_v1
Running pipeline stage ISVCDeleter
Checking if service lex-hue-delexa-7b-v1 is running
Tearing down inference service lex-hue-delexa-7b-v1
Toredown service lex-hue-delexa-7b-v1
Pipeline stage ISVCDeleter completed in 4.08s
Running pipeline stage MKMLModelDeleter
Cleaning model data from S3
Cleaning model data from model cache
Deleting key lex-hue-delexa-7b-v1/config.json from bucket guanaco-mkml-models
Deleting key lex-hue-delexa-7b-v1/mkml_model.tensors from bucket guanaco-mkml-models
Deleting key lex-hue-delexa-7b-v1/special_tokens_map.json from bucket guanaco-mkml-models
Deleting key lex-hue-delexa-7b-v1/tokenizer.json from bucket guanaco-mkml-models
Deleting key lex-hue-delexa-7b-v1/tokenizer.model from bucket guanaco-mkml-models
Deleting key lex-hue-delexa-7b-v1/tokenizer_config.json from bucket guanaco-mkml-models
Cleaning model data from model cache
Deleting key lex-hue-delexa-7b-v1_reward/config.json from bucket guanaco-reward-models
Deleting key lex-hue-delexa-7b-v1_reward/merges.txt from bucket guanaco-reward-models
Deleting key lex-hue-delexa-7b-v1_reward/reward.tensors from bucket guanaco-reward-models
Deleting key lex-hue-delexa-7b-v1_reward/special_tokens_map.json from bucket guanaco-reward-models
Deleting key lex-hue-delexa-7b-v1_reward/tokenizer.json from bucket guanaco-reward-models
Deleting key lex-hue-delexa-7b-v1_reward/tokenizer_config.json from bucket guanaco-reward-models
Deleting key lex-hue-delexa-7b-v1_reward/vocab.json from bucket guanaco-reward-models
Pipeline stage MKMLModelDeleter completed in 2.58s
lex-hue-delexa-7b_v1 status is now torndown due to DeploymentManager action

Usage Metrics

Latency Metrics