submission_id: undi95-bigl-7b_v1
developer_uid: Undi95
status: inactive
model_repo: Undi95/BigL-7B
reward_repo: ChaiML/reward_gpt2_medium_preference_24m_e2
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:'}
reward_formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:'}
timestamp: 2024-03-25T08:59:19+00:00
model_name: undi95-bigl-7b_v1
model_eval_status: pending
safety_score: 0.95
entertaining: None
stay_in_character: None
user_preference: None
double_thumbs_up: 455
thumbs_up: 561
thumbs_down: 323
num_battles: 67742
num_wins: 34445
win_ratio: 0.5084733252634998
celo_rating: 1163.31
Resubmit model
Running pipeline stage MKMLizer
Starting job with name undi95-bigl-7b-v1-mkmlizer
Waiting for job on undi95-bigl-7b-v1-mkmlizer to finish
Failed to get response for submission anhnv125-llama-op-v17-1_v27: HTTPConnectionPool(host='anhnv125-llama-op-v17-1-v27-predictor-default.tenant-chaiml-guanaco.knative.ord1.coreweave.cloud', port=80): Read timed out. (read timeout=5.5)
undi95-bigl-7b-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
undi95-bigl-7b-v1-mkmlizer: ║ _____ __ __ ║
undi95-bigl-7b-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
undi95-bigl-7b-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
undi95-bigl-7b-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
undi95-bigl-7b-v1-mkmlizer: ║ /___/ ║
undi95-bigl-7b-v1-mkmlizer: ║ ║
undi95-bigl-7b-v1-mkmlizer: ║ Version: 0.6.11 ║
undi95-bigl-7b-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
undi95-bigl-7b-v1-mkmlizer: ║ ║
undi95-bigl-7b-v1-mkmlizer: ║ The license key for the current software has been verified as ║
undi95-bigl-7b-v1-mkmlizer: ║ belonging to: ║
undi95-bigl-7b-v1-mkmlizer: ║ ║
undi95-bigl-7b-v1-mkmlizer: ║ Chai Research Corp. ║
undi95-bigl-7b-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
undi95-bigl-7b-v1-mkmlizer: ║ Expiration: 2024-04-15 23:59:59 ║
undi95-bigl-7b-v1-mkmlizer: ║ ║
undi95-bigl-7b-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
undi95-bigl-7b-v1-mkmlizer: .gitattributes: 0%| | 0.00/1.52k [00:00<?, ?B/s] .gitattributes: 100%|██████████| 1.52k/1.52k [00:00<00:00, 17.8MB/s]
undi95-bigl-7b-v1-mkmlizer: config.json: 0%| | 0.00/653 [00:00<?, ?B/s] config.json: 100%|██████████| 653/653 [00:00<00:00, 5.37MB/s]
undi95-bigl-7b-v1-mkmlizer: model-00001-of-00002.safetensors: 0%| | 0.00/9.94G [00:00<?, ?B/s] model-00001-of-00002.safetensors: 0%| | 10.5M/9.94G [00:01<19:45, 8.38MB/s] model-00001-of-00002.safetensors: 0%| | 31.5M/9.94G [00:01<06:38, 24.9MB/s] model-00001-of-00002.safetensors: 0%| | 41.9M/9.94G [00:01<05:43, 28.9MB/s] model-00001-of-00002.safetensors: 1%| | 62.9M/9.94G [00:01<03:16, 50.3MB/s] model-00001-of-00002.safetensors: 1%| | 83.9M/9.94G [00:02<02:45, 59.5MB/s] model-00001-of-00002.safetensors: 1%| | 105M/9.94G [00:02<02:01, 81.1MB/s] model-00001-of-00002.safetensors: 2%|▏ | 168M/9.94G [00:02<00:58, 168MB/s] model-00001-of-00002.safetensors: 3%|▎ | 336M/9.94G [00:02<00:21, 451MB/s] model-00001-of-00002.safetensors: 4%|▍ | 409M/9.94G [00:02<00:24, 394MB/s] model-00001-of-00002.safetensors: 6%|▋ | 629M/9.94G [00:02<00:12, 737MB/s] model-00001-of-00002.safetensors: 13%|█▎ | 1.28G/9.94G [00:02<00:04, 1.96GB/s] model-00001-of-00002.safetensors: 16%|█▌ | 1.55G/9.94G [00:03<00:09, 905MB/s] model-00001-of-00002.safetensors: 18%|█▊ | 1.76G/9.94G [00:04<00:13, 617MB/s] model-00001-of-00002.safetensors: 19%|█▉ | 1.92G/9.94G [00:04<00:12, 657MB/s] model-00001-of-00002.safetensors: 21%|██▏ | 2.12G/9.94G [00:04<00:09, 796MB/s] model-00001-of-00002.safetensors: 26%|██▌ | 2.57G/9.94G [00:04<00:05, 1.29GB/s] model-00001-of-00002.safetensors: 29%|██▉ | 2.86G/9.94G [00:04<00:04, 1.54GB/s] model-00001-of-00002.safetensors: 31%|███▏ | 3.11G/9.94G [00:05<00:06, 988MB/s] model-00001-of-00002.safetensors: 33%|███▎ | 3.30G/9.94G [00:05<00:07, 840MB/s] model-00001-of-00002.safetensors: 35%|███▌ | 3.50G/9.94G [00:05<00:06, 983MB/s] model-00001-of-00002.safetensors: 37%|███▋ | 3.67G/9.94G [00:05<00:06, 1.04GB/s] model-00001-of-00002.safetensors: 39%|███▉ | 3.88G/9.94G [00:05<00:04, 1.22GB/s] model-00001-of-00002.safetensors: 41%|████▏ | 4.11G/9.94G [00:06<00:04, 1.36GB/s] model-00001-of-00002.safetensors: 43%|████▎ | 4.29G/9.94G [00:06<00:04, 1.32GB/s] model-00001-of-00002.safetensors: 45%|████▍ | 4.46G/9.94G [00:06<00:04, 1.12GB/s] model-00001-of-00002.safetensors: 46%|████▌ | 4.59G/9.94G [00:06<00:05, 1.03GB/s] model-00001-of-00002.safetensors: 47%|████▋ | 4.72G/9.94G [00:06<00:05, 1.01GB/s] model-00001-of-00002.safetensors: 49%|████▊ | 4.84G/9.94G [00:06<00:05, 928MB/s] model-00001-of-00002.safetensors: 50%|████▉ | 4.95G/9.94G [00:06<00:05, 941MB/s] model-00001-of-00002.safetensors: 51%|█████ | 5.06G/9.94G [00:07<00:04, 986MB/s] model-00001-of-00002.safetensors: 54%|█████▍ | 5.39G/9.94G [00:07<00:03, 1.52GB/s] model-00001-of-00002.safetensors: 56%|█████▌ | 5.56G/9.94G [00:07<00:02, 1.53GB/s] model-00001-of-00002.safetensors: 58%|█████▊ | 5.73G/9.94G [00:07<00:03, 1.27GB/s] model-00001-of-00002.safetensors: 59%|█████▉ | 5.87G/9.94G [00:07<00:03, 1.16GB/s] model-00001-of-00002.safetensors: 60%|██████ | 6.01G/9.94G [00:07<00:03, 1.00GB/s] model-00001-of-00002.safetensors: 62%|██████▏ | 6.12G/9.94G [00:08<00:04, 904MB/s] model-00001-of-00002.safetensors: 63%|██████▎ | 6.23G/9.94G [00:08<00:03, 934MB/s] model-00001-of-00002.safetensors: 64%|██████▍ | 6.38G/9.94G [00:08<00:03, 1.04GB/s] model-00001-of-00002.safetensors: 66%|██████▋ | 6.61G/9.94G [00:08<00:02, 1.33GB/s] model-00001-of-00002.safetensors: 68%|██████▊ | 6.75G/9.94G [00:08<00:02, 1.32GB/s] model-00001-of-00002.safetensors: 69%|██████▉ | 6.90G/9.94G [00:08<00:02, 1.26GB/s] model-00001-of-00002.safetensors: 71%|███████ | 7.04G/9.94G [00:08<00:02, 1.18GB/s] model-00001-of-00002.safetensors: 72%|███████▏ | 7.16G/9.94G [00:08<00:02, 1.13GB/s] model-00001-of-00002.safetensors: 73%|███████▎ | 7.29G/9.94G [00:08<00:02, 1.08GB/s] model-00001-of-00002.safetensors: 75%|███████▍ | 7.41G/9.94G [00:09<00:02, 1.12GB/s] model-00001-of-00002.safetensors: 76%|███████▌ | 7.53G/9.94G [00:09<00:02, 1.09GB/s] model-00001-of-00002.safetensors: 77%|███████▋ | 7.69G/9.94G [00:09<00:01, 1.20GB/s] model-00001-of-00002.safetensors: 79%|███████▉ | 7.87G/9.94G [00:09<00:01, 1.36GB/s] model-00001-of-00002.safetensors: 81%|████████ | 8.02G/9.94G [00:09<00:01, 1.31GB/s] model-00001-of-00002.safetensors: 82%|████████▏ | 8.16G/9.94G [00:09<00:01, 1.29GB/s] model-00001-of-00002.safetensors: 83%|████████▎ | 8.29G/9.94G [00:09<00:01, 1.09GB/s] model-00001-of-00002.safetensors: 85%|████████▌ | 8.46G/9.94G [00:09<00:01, 1.22GB/s] model-00001-of-00002.safetensors: 87%|████████▋ | 8.62G/9.94G [00:10<00:01, 1.30GB/s] model-00001-of-00002.safetensors: 88%|████████▊ | 8.76G/9.94G [00:10<00:00, 1.20GB/s] model-00001-of-00002.safetensors: 90%|████████▉ | 8.90G/9.94G [00:10<00:00, 1.23GB/s] model-00001-of-00002.safetensors: 91%|█████████ | 9.04G/9.94G [00:10<00:00, 1.17GB/s] model-00001-of-00002.safetensors: 92%|█████████▏| 9.20G/9.94G [00:10<00:00, 1.24GB/s] model-00001-of-00002.safetensors: 95%|█████████▌| 9.49G/9.94G [00:10<00:00, 1.67GB/s] model-00001-of-00002.safetensors: 97%|█████████▋| 9.67G/9.94G [00:10<00:00, 1.55GB/s] model-00001-of-00002.safetensors: 100%|█████████▉| 9.93G/9.94G [00:10<00:00, 1.82GB/s] model-00001-of-00002.safetensors: 100%|█████████▉| 9.94G/9.94G [00:10<00:00, 912MB/s]
undi95-bigl-7b-v1-mkmlizer: model-00002-of-00002.safetensors: 0%| | 0.00/4.54G [00:00<?, ?B/s] model-00002-of-00002.safetensors: 0%| | 10.5M/4.54G [00:01<08:40, 8.71MB/s] model-00002-of-00002.safetensors: 0%| | 21.0M/4.54G [00:01<05:49, 12.9MB/s] model-00002-of-00002.safetensors: 1%| | 31.5M/4.54G [00:01<03:55, 19.1MB/s] model-00002-of-00002.safetensors: 1%| | 52.4M/4.54G [00:02<01:59, 37.7MB/s] model-00002-of-00002.safetensors: 2%|▏ | 94.4M/4.54G [00:02<00:55, 80.4MB/s] model-00002-of-00002.safetensors: 5%|▍ | 210M/4.54G [00:02<00:18, 235MB/s] model-00002-of-00002.safetensors: 6%|▌ | 273M/4.54G [00:02<00:16, 255MB/s] model-00002-of-00002.safetensors: 8%|▊ | 377M/4.54G [00:02<00:10, 387MB/s] model-00002-of-00002.safetensors: 13%|█▎ | 587M/4.54G [00:02<00:05, 719MB/s] model-00002-of-00002.safetensors: 29%|██▊ | 1.30G/4.54G [00:03<00:02, 1.58GB/s] model-00002-of-00002.safetensors: 32%|███▏ | 1.47G/4.54G [00:03<00:04, 720MB/s] model-00002-of-00002.safetensors: 36%|███▋ | 1.65G/4.54G [00:03<00:03, 825MB/s] model-00002-of-00002.safetensors: 39%|███▉ | 1.78G/4.54G [00:04<00:03, 800MB/s] model-00002-of-00002.safetensors: 42%|████▏ | 1.92G/4.54G [00:04<00:02, 874MB/s] model-00002-of-00002.safetensors: 55%|█████▍ | 2.47G/4.54G [00:04<00:01, 1.66GB/s] model-00002-of-00002.safetensors: 60%|██████ | 2.73G/4.54G [00:04<00:01, 1.08GB/s] model-00002-of-00002.safetensors: 64%|██████▍ | 2.92G/4.54G [00:05<00:02, 751MB/s] model-00002-of-00002.safetensors: 67%|██████▋ | 3.06G/4.54G [00:05<00:01, 821MB/s] model-00002-of-00002.safetensors: 71%|███████ | 3.21G/4.54G [00:05<00:01, 866MB/s] model-00002-of-00002.safetensors: 74%|███████▎ | 3.34G/4.54G [00:05<00:01, 933MB/s] model-00002-of-00002.safetensors: 82%|████████▏ | 3.71G/4.54G [00:05<00:00, 1.43GB/s] model-00002-of-00002.safetensors: 86%|████████▌ | 3.91G/4.54G [00:06<00:00, 1.01GB/s] model-00002-of-00002.safetensors: 91%|█████████ | 4.11G/4.54G [00:06<00:00, 1.15GB/s] model-00002-of-00002.safetensors: 94%|█████████▍| 4.28G/4.54G [00:06<00:00, 1.20GB/s] model-00002-of-00002.safetensors: 100%|█████████▉| 4.54G/4.54G [00:06<00:00, 699MB/s]
undi95-bigl-7b-v1-mkmlizer: model.safetensors.index.json: 0%| | 0.00/22.8k [00:00<?, ?B/s] model.safetensors.index.json: 100%|██████████| 22.8k/22.8k [00:00<00:00, 151MB/s]
undi95-bigl-7b-v1-mkmlizer: special_tokens_map.json: 0%| | 0.00/414 [00:00<?, ?B/s] special_tokens_map.json: 100%|██████████| 414/414 [00:00<00:00, 5.38MB/s]
undi95-bigl-7b-v1-mkmlizer: tokenizer.json: 0%| | 0.00/1.80M [00:00<?, ?B/s] tokenizer.json: 100%|██████████| 1.80M/1.80M [00:00<00:00, 9.09MB/s] tokenizer.json: 100%|██████████| 1.80M/1.80M [00:00<00:00, 9.04MB/s]
undi95-bigl-7b-v1-mkmlizer: tokenizer.model: 0%| | 0.00/493k [00:00<?, ?B/s] tokenizer.model: 100%|██████████| 493k/493k [00:00<00:00, 53.3MB/s]
undi95-bigl-7b-v1-mkmlizer: tokenizer_config.json: 0%| | 0.00/960 [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 960/960 [00:00<00:00, 12.9MB/s]
undi95-bigl-7b-v1-mkmlizer: Downloaded to shared memory in 19.702s
undi95-bigl-7b-v1-mkmlizer: quantizing model to /dev/shm/model_cache
undi95-bigl-7b-v1-mkmlizer: Saving mkml model at /dev/shm/model_cache
undi95-bigl-7b-v1-mkmlizer: Reading /tmp/tmpzawnyx1u/model.safetensors.index.json
undi95-bigl-7b-v1-mkmlizer: Profiling: 0%| | 0/291 [00:00<?, ?it/s] Profiling: 0%| | 1/291 [00:01<06:09, 1.27s/it] Profiling: 7%|▋ | 21/291 [00:01<00:12, 20.77it/s] Profiling: 14%|█▍ | 41/291 [00:01<00:05, 42.78it/s] Profiling: 22%|██▏ | 64/291 [00:01<00:03, 70.86it/s] Profiling: 29%|██▉ | 85/291 [00:01<00:02, 94.15it/s] Profiling: 37%|███▋ | 107/291 [00:01<00:01, 118.85it/s] Profiling: 44%|████▍ | 129/291 [00:01<00:01, 141.18it/s] Profiling: 52%|█████▏ | 152/291 [00:02<00:00, 160.97it/s] Profiling: 60%|█████▉ | 174/291 [00:02<00:00, 175.27it/s] Profiling: 67%|██████▋ | 196/291 [00:02<00:00, 185.95it/s] Profiling: 75%|███████▍ | 218/291 [00:03<00:02, 36.46it/s] Profiling: 82%|████████▏ | 239/291 [00:04<00:01, 48.01it/s] Profiling: 89%|████████▊ | 258/291 [00:04<00:00, 60.50it/s] Profiling: 97%|█████████▋| 283/291 [00:04<00:00, 80.81it/s] Profiling: 100%|██████████| 291/291 [00:04<00:00, 65.78it/s]
undi95-bigl-7b-v1-mkmlizer: quantized model in 14.620s
undi95-bigl-7b-v1-mkmlizer: Processed model Undi95/BigL-7B in 35.138s
undi95-bigl-7b-v1-mkmlizer: creating bucket guanaco-mkml-models
undi95-bigl-7b-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
undi95-bigl-7b-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/undi95-bigl-7b-v1
undi95-bigl-7b-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/undi95-bigl-7b-v1/special_tokens_map.json
undi95-bigl-7b-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/undi95-bigl-7b-v1/config.json
undi95-bigl-7b-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/undi95-bigl-7b-v1/tokenizer_config.json
undi95-bigl-7b-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.model s3://guanaco-mkml-models/undi95-bigl-7b-v1/tokenizer.model
undi95-bigl-7b-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/undi95-bigl-7b-v1/tokenizer.json
undi95-bigl-7b-v1-mkmlizer: cp /dev/shm/model_cache/mkml_model.tensors s3://guanaco-mkml-models/undi95-bigl-7b-v1/mkml_model.tensors
undi95-bigl-7b-v1-mkmlizer: loading reward model from ChaiML/reward_gpt2_medium_preference_24m_e2
undi95-bigl-7b-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:1067: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
undi95-bigl-7b-v1-mkmlizer: warnings.warn(
undi95-bigl-7b-v1-mkmlizer: config.json: 0%| | 0.00/1.05k [00:00<?, ?B/s] config.json: 100%|██████████| 1.05k/1.05k [00:00<00:00, 10.1MB/s]
undi95-bigl-7b-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:690: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
undi95-bigl-7b-v1-mkmlizer: warnings.warn(
undi95-bigl-7b-v1-mkmlizer: tokenizer_config.json: 0%| | 0.00/234 [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 234/234 [00:00<00:00, 2.79MB/s]
undi95-bigl-7b-v1-mkmlizer: vocab.json: 0%| | 0.00/1.04M [00:00<?, ?B/s] vocab.json: 100%|██████████| 1.04M/1.04M [00:00<00:00, 15.1MB/s]
undi95-bigl-7b-v1-mkmlizer: tokenizer.json: 0%| | 0.00/2.11M [00:00<?, ?B/s] tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 36.9MB/s]
undi95-bigl-7b-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:472: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
undi95-bigl-7b-v1-mkmlizer: warnings.warn(
undi95-bigl-7b-v1-mkmlizer: pytorch_model.bin: 0%| | 0.00/1.44G [00:00<?, ?B/s] pytorch_model.bin: 1%| | 10.5M/1.44G [00:00<00:20, 68.8MB/s] pytorch_model.bin: 3%|▎ | 41.9M/1.44G [00:00<00:16, 82.8MB/s] pytorch_model.bin: 4%|▎ | 52.4M/1.44G [00:00<00:19, 71.6MB/s] pytorch_model.bin: 6%|▌ | 83.9M/1.44G [00:00<00:11, 118MB/s] pytorch_model.bin: 7%|▋ | 105M/1.44G [00:01<00:11, 114MB/s] pytorch_model.bin: 10%|█ | 147M/1.44G [00:01<00:07, 169MB/s] pytorch_model.bin: 19%|█▉ | 273M/1.44G [00:01<00:02, 407MB/s] pytorch_model.bin: 46%|████▋ | 671M/1.44G [00:01<00:00, 1.26GB/s] pytorch_model.bin: 81%|████████ | 1.16G/1.44G [00:01<00:00, 2.11GB/s] pytorch_model.bin: 100%|█████████▉| 1.44G/1.44G [00:01<00:00, 926MB/s]
undi95-bigl-7b-v1-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
undi95-bigl-7b-v1-mkmlizer: Saving duration: 0.230s
undi95-bigl-7b-v1-mkmlizer: Processed model ChaiML/reward_gpt2_medium_preference_24m_e2 in 5.991s
undi95-bigl-7b-v1-mkmlizer: creating bucket guanaco-reward-models
undi95-bigl-7b-v1-mkmlizer: Bucket 's3://guanaco-reward-models/' created
undi95-bigl-7b-v1-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/undi95-bigl-7b-v1_reward
undi95-bigl-7b-v1-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/undi95-bigl-7b-v1_reward/config.json
undi95-bigl-7b-v1-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/undi95-bigl-7b-v1_reward/tokenizer_config.json
undi95-bigl-7b-v1-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/undi95-bigl-7b-v1_reward/special_tokens_map.json
undi95-bigl-7b-v1-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/undi95-bigl-7b-v1_reward/merges.txt
undi95-bigl-7b-v1-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/undi95-bigl-7b-v1_reward/vocab.json
undi95-bigl-7b-v1-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/undi95-bigl-7b-v1_reward/tokenizer.json
Job undi95-bigl-7b-v1-mkmlizer completed after 63.87s with status: succeeded
Stopping job with name undi95-bigl-7b-v1-mkmlizer
Pipeline stage MKMLizer completed in 67.51s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.12s
Running pipeline stage ISVCDeployer
Creating inference service undi95-bigl-7b-v1
Waiting for inference service undi95-bigl-7b-v1 to be ready
Inference service undi95-bigl-7b-v1 ready after 40.284982681274414s
Pipeline stage ISVCDeployer completed in 47.19s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7854809761047363s
Received healthy response to inference request in 1.1564664840698242s
Received healthy response to inference request in 1.0207245349884033s
Received healthy response to inference request in 1.1743042469024658s
Received healthy response to inference request in 1.218320608139038s
5 requests
0 failed requests
5th percentile: 1.0478729248046874
10th percentile: 1.0750213146209717
20th percentile: 1.1293180942535401
30th percentile: 1.1600340366363526
40th percentile: 1.1671691417694092
50th percentile: 1.1743042469024658
60th percentile: 1.1919107913970948
70th percentile: 1.2095173358917237
80th percentile: 1.3317526817321779
90th percentile: 1.558616828918457
95th percentile: 1.6720489025115965
99th percentile: 1.7627945613861085
mean time: 1.2710593700408936
Pipeline stage StressChecker completed in 7.90s
Running pipeline stage DaemonicModelEvalScorer
Pipeline stage DaemonicModelEvalScorer completed in 0.05s
Running pipeline stage DaemonicSafetyScorer
Running M-Eval for topic stay_in_character
Pipeline stage DaemonicSafetyScorer completed in 0.05s
M-Eval Dataset for topic stay_in_character is loaded
undi95-bigl-7b_v1 status is now inactive due to auto deactivation removed underperforming models

Usage Metrics

Latency Metrics