submission_id: cgato-thespis-7b-v0-2-sfttest_v5
developer_uid: chaiwill
status: inactive
model_repo: cgato/Thespis-7b-v0.2-SFTTest-3Epoch
reward_repo: rirv938/reward_gpt2_medium_preference_24m_e2
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:'}
reward_formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:'}
timestamp: 2024-03-05T22:32:01+00:00
model_name: cgato-thespis-7b-v0-2-sfttest_v5
model_eval_status: success
safety_score: 0.75
entertaining: 6.94
stay_in_character: 8.46
user_preference: 7.34
double_thumbs_up: 2047
thumbs_up: 3215
thumbs_down: 1325
num_battles: 112843
num_wins: 59525
win_ratio: 0.5275028136437351
celo_rating: 1177.12
Resubmit model
Running pipeline stage MKMLizer
Starting job with name cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer
Waiting for job on cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer to finish
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ _____ __ __ ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ /___/ ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ Version: 0.6.11 ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ The license key for the current software has been verified as ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ belonging to: ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ Chai Research Corp. ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ Expiration: 2024-04-15 23:59:59 ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ║ ║
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: .gitattributes: 0%| | 0.00/1.52k [00:00<?, ?B/s] .gitattributes: 100%|██████████| 1.52k/1.52k [00:00<00:00, 15.5MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: README.md: 0%| | 0.00/1.15k [00:00<?, ?B/s] README.md: 100%|██████████| 1.15k/1.15k [00:00<00:00, 13.0MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: config.json: 0%| | 0.00/648 [00:00<?, ?B/s] config.json: 100%|██████████| 648/648 [00:00<00:00, 4.55MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: generation_config.json: 0%| | 0.00/137 [00:00<?, ?B/s] generation_config.json: 100%|██████████| 137/137 [00:00<00:00, 2.25MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: model-00001-of-00003.safetensors: 0%| | 0.00/4.94G [00:00<?, ?B/s] model-00001-of-00003.safetensors: 0%| | 10.5M/4.94G [00:00<01:09, 71.4MB/s] model-00001-of-00003.safetensors: 2%|▏ | 105M/4.94G [00:00<00:13, 352MB/s] model-00001-of-00003.safetensors: 5%|▌ | 252M/4.94G [00:00<00:06, 715MB/s] model-00001-of-00003.safetensors: 8%|▊ | 388M/4.94G [00:00<00:05, 903MB/s] model-00001-of-00003.safetensors: 10%|█ | 503M/4.94G [00:00<00:04, 963MB/s] model-00001-of-00003.safetensors: 13%|█▎ | 640M/4.94G [00:00<00:04, 1.07GB/s] model-00001-of-00003.safetensors: 15%|█▌ | 755M/4.94G [00:00<00:03, 1.06GB/s] model-00001-of-00003.safetensors: 19%|█▉ | 954M/4.94G [00:00<00:03, 1.28GB/s] model-00001-of-00003.safetensors: 26%|██▌ | 1.28G/4.94G [00:01<00:01, 1.84GB/s] model-00001-of-00003.safetensors: 32%|███▏ | 1.60G/4.94G [00:01<00:01, 2.23GB/s] model-00001-of-00003.safetensors: 37%|███▋ | 1.85G/4.94G [00:01<00:01, 2.27GB/s] model-00001-of-00003.safetensors: 42%|████▏ | 2.09G/4.94G [00:01<00:01, 2.21GB/s] model-00001-of-00003.safetensors: 49%|████▊ | 2.40G/4.94G [00:01<00:01, 2.48GB/s] model-00001-of-00003.safetensors: 56%|█████▌ | 2.77G/4.94G [00:01<00:00, 2.81GB/s] model-00001-of-00003.safetensors: 62%|██████▏ | 3.06G/4.94G [00:01<00:00, 2.21GB/s] model-00001-of-00003.safetensors: 67%|██████▋ | 3.31G/4.94G [00:02<00:01, 1.48GB/s] model-00001-of-00003.safetensors: 71%|███████ | 3.51G/4.94G [00:02<00:00, 1.46GB/s] model-00001-of-00003.safetensors: 75%|███████▍ | 3.69G/4.94G [00:02<00:00, 1.36GB/s] model-00001-of-00003.safetensors: 78%|███████▊ | 3.85G/4.94G [00:02<00:00, 1.27GB/s] model-00001-of-00003.safetensors: 82%|████████▏ | 4.05G/4.94G [00:02<00:00, 1.41GB/s] model-00001-of-00003.safetensors: 85%|████████▌ | 4.21G/4.94G [00:02<00:00, 1.32GB/s] model-00001-of-00003.safetensors: 88%|████████▊ | 4.36G/4.94G [00:03<00:00, 812MB/s] model-00001-of-00003.safetensors: 90%|█████████ | 4.47G/4.94G [00:03<00:00, 867MB/s] model-00001-of-00003.safetensors: 93%|█████████▎| 4.59G/4.94G [00:03<00:00, 850MB/s] model-00001-of-00003.safetensors: 100%|█████████▉| 4.94G/4.94G [00:05<00:00, 944MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: model-00002-of-00003.safetensors: 0%| | 0.00/5.00G [00:00<?, ?B/s] model-00002-of-00003.safetensors: 0%| | 10.5M/5.00G [00:00<01:19, 62.8MB/s] model-00002-of-00003.safetensors: 1%|▏ | 73.4M/5.00G [00:00<00:15, 323MB/s] model-00002-of-00003.safetensors: 6%|▌ | 294M/5.00G [00:00<00:04, 1.07GB/s] model-00002-of-00003.safetensors: 9%|▊ | 430M/5.00G [00:00<00:03, 1.17GB/s] model-00002-of-00003.safetensors: 11%|█▏ | 566M/5.00G [00:00<00:04, 978MB/s] model-00002-of-00003.safetensors: 15%|█▍ | 744M/5.00G [00:00<00:03, 1.18GB/s] model-00002-of-00003.safetensors: 20%|█▉ | 996M/5.00G [00:00<00:02, 1.55GB/s] model-00002-of-00003.safetensors: 27%|██▋ | 1.33G/5.00G [00:00<00:01, 2.01GB/s] model-00002-of-00003.safetensors: 33%|███▎ | 1.67G/5.00G [00:01<00:01, 2.36GB/s] model-00002-of-00003.safetensors: 39%|███▊ | 1.93G/5.00G [00:01<00:01, 2.39GB/s] model-00002-of-00003.safetensors: 46%|████▌ | 2.28G/5.00G [00:01<00:01, 2.69GB/s] model-00002-of-00003.safetensors: 51%|█████ | 2.56G/5.00G [00:01<00:00, 2.53GB/s] model-00002-of-00003.safetensors: 56%|█████▋ | 2.82G/5.00G [00:01<00:01, 1.67GB/s] model-00002-of-00003.safetensors: 61%|██████ | 3.03G/5.00G [00:02<00:01, 1.21GB/s] model-00002-of-00003.safetensors: 64%|██████▍ | 3.20G/5.00G [00:02<00:01, 1.04GB/s] model-00002-of-00003.safetensors: 67%|██████▋ | 3.33G/5.00G [00:02<00:01, 1.05GB/s] model-00002-of-00003.safetensors: 69%|██████▉ | 3.47G/5.00G [00:02<00:01, 1.06GB/s] model-00002-of-00003.safetensors: 72%|███████▏ | 3.60G/5.00G [00:02<00:01, 1.09GB/s] model-00002-of-00003.safetensors: 74%|███████▍ | 3.72G/5.00G [00:02<00:01, 1.10GB/s] model-00002-of-00003.safetensors: 78%|███████▊ | 3.91G/5.00G [00:02<00:00, 1.28GB/s] model-00002-of-00003.safetensors: 83%|████████▎ | 4.13G/5.00G [00:02<00:00, 1.48GB/s] model-00002-of-00003.safetensors: 87%|████████▋ | 4.35G/5.00G [00:03<00:00, 1.64GB/s] model-00002-of-00003.safetensors: 91%|█████████ | 4.53G/5.00G [00:03<00:00, 1.55GB/s] model-00002-of-00003.safetensors: 97%|█████████▋| 4.87G/5.00G [00:03<00:00, 2.04GB/s] model-00002-of-00003.safetensors: 100%|█████████▉| 5.00G/5.00G [00:03<00:00, 1.31GB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: model-00003-of-00003.safetensors: 0%| | 0.00/4.54G [00:00<?, ?B/s] model-00003-of-00003.safetensors: 0%| | 10.5M/4.54G [00:00<00:54, 82.7MB/s] model-00003-of-00003.safetensors: 2%|▏ | 94.4M/4.54G [00:00<00:10, 425MB/s] model-00003-of-00003.safetensors: 3%|▎ | 147M/4.54G [00:00<00:09, 463MB/s] model-00003-of-00003.safetensors: 6%|▌ | 283M/4.54G [00:00<00:05, 772MB/s] model-00003-of-00003.safetensors: 9%|▉ | 419M/4.54G [00:00<00:04, 916MB/s] model-00003-of-00003.safetensors: 12%|█▏ | 566M/4.54G [00:00<00:03, 1.02GB/s] model-00003-of-00003.safetensors: 17%|█▋ | 755M/4.54G [00:00<00:03, 1.25GB/s] model-00003-of-00003.safetensors: 25%|██▍ | 1.12G/4.54G [00:00<00:01, 1.95GB/s] model-00003-of-00003.safetensors: 32%|███▏ | 1.46G/4.54G [00:01<00:01, 2.35GB/s] model-00003-of-00003.safetensors: 38%|███▊ | 1.73G/4.54G [00:01<00:01, 2.44GB/s] model-00003-of-00003.safetensors: 45%|████▍ | 2.03G/4.54G [00:01<00:01, 2.44GB/s] model-00003-of-00003.safetensors: 50%|█████ | 2.29G/4.54G [00:01<00:00, 2.46GB/s] model-00003-of-00003.safetensors: 58%|█████▊ | 2.64G/4.54G [00:01<00:00, 2.60GB/s] model-00003-of-00003.safetensors: 64%|██████▍ | 2.90G/4.54G [00:01<00:00, 1.98GB/s] model-00003-of-00003.safetensors: 69%|██████▉ | 3.12G/4.54G [00:01<00:00, 1.62GB/s] model-00003-of-00003.safetensors: 73%|███████▎ | 3.31G/4.54G [00:02<00:00, 1.51GB/s] model-00003-of-00003.safetensors: 77%|███████▋ | 3.48G/4.54G [00:02<00:00, 1.33GB/s] model-00003-of-00003.safetensors: 80%|███████▉ | 3.63G/4.54G [00:02<00:00, 1.28GB/s] model-00003-of-00003.safetensors: 83%|████████▎ | 3.76G/4.54G [00:02<00:00, 1.02GB/s] model-00003-of-00003.safetensors: 85%|████████▌ | 3.88G/4.54G [00:02<00:00, 761MB/s] model-00003-of-00003.safetensors: 88%|████████▊ | 3.97G/4.54G [00:02<00:00, 784MB/s] model-00003-of-00003.safetensors: 90%|████████▉ | 4.07G/4.54G [00:03<00:00, 793MB/s] model-00003-of-00003.safetensors: 92%|█████████▏| 4.16G/4.54G [00:03<00:00, 683MB/s] model-00003-of-00003.safetensors: 100%|█████████▉| 4.54G/4.54G [00:03<00:00, 1.34GB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: model.safetensors.index.json: 0%| | 0.00/25.1k [00:00<?, ?B/s] model.safetensors.index.json: 100%|██████████| 25.1k/25.1k [00:00<00:00, 159MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: pytorch_model.bin.index.json: 0%| | 0.00/23.9k [00:00<?, ?B/s] pytorch_model.bin.index.json: 100%|██████████| 23.9k/23.9k [00:00<00:00, 131MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: special_tokens_map.json: 0%| | 0.00/437 [00:00<?, ?B/s] special_tokens_map.json: 100%|██████████| 437/437 [00:00<00:00, 5.27MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: tokenizer.model: 0%| | 0.00/493k [00:00<?, ?B/s] tokenizer.model: 100%|██████████| 493k/493k [00:00<00:00, 53.6MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: tokenizer_config.json: 0%| | 0.00/1.02k [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 1.02k/1.02k [00:00<00:00, 10.5MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: Downloaded to shared memory in 14.658s
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: quantizing model to /dev/shm/model_cache
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: Saving mkml model at /dev/shm/model_cache
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: Reading /tmp/tmpp1qqtgqh/model.safetensors.index.json
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: Profiling: 0%| | 0/291 [00:00<?, ?it/s] Profiling: 0%| | 1/291 [00:01<09:05, 1.88s/it] Profiling: 7%|▋ | 21/291 [00:01<00:18, 14.50it/s] Profiling: 15%|█▌ | 45/291 [00:02<00:07, 34.70it/s] Profiling: 22%|██▏ | 65/291 [00:02<00:04, 53.03it/s] Profiling: 31%|███ | 90/291 [00:02<00:02, 79.55it/s] Profiling: 38%|███▊ | 110/291 [00:02<00:02, 75.42it/s] Profiling: 45%|████▍ | 130/291 [00:02<00:01, 92.71it/s] Profiling: 51%|█████ | 149/291 [00:02<00:01, 109.12it/s] Profiling: 60%|█████▉ | 174/291 [00:02<00:00, 135.29it/s] Profiling: 67%|██████▋ | 194/291 [00:03<00:00, 145.92it/s] Profiling: 73%|███████▎ | 213/291 [00:04<00:02, 36.42it/s] Profiling: 81%|████████ | 235/291 [00:04<00:01, 49.68it/s] Profiling: 88%|████████▊ | 256/291 [00:04<00:00, 64.55it/s] Profiling: 95%|█████████▍| 276/291 [00:04<00:00, 79.42it/s] Profiling: 100%|██████████| 291/291 [00:05<00:00, 57.01it/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: quantized model in 15.063s
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: Processed model cgato/Thespis-7b-v0.2-SFTTest-3Epoch in 30.575s
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: creating bucket guanaco-mkml-models
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/cgato-thespis-7b-v0-2-sfttest-v5
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/cgato-thespis-7b-v0-2-sfttest-v5/tokenizer_config.json
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: cp /dev/shm/model_cache/tokenizer.model s3://guanaco-mkml-models/cgato-thespis-7b-v0-2-sfttest-v5/tokenizer.model
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/cgato-thespis-7b-v0-2-sfttest-v5/special_tokens_map.json
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/cgato-thespis-7b-v0-2-sfttest-v5/tokenizer.json
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/cgato-thespis-7b-v0-2-sfttest-v5/config.json
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: cp /dev/shm/model_cache/mkml_model.tensors s3://guanaco-mkml-models/cgato-thespis-7b-v0-2-sfttest-v5/mkml_model.tensors
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: loading reward model from rirv938/reward_gpt2_medium_preference_24m_e2
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:1067: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: warnings.warn(
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: config.json: 0%| | 0.00/1.05k [00:00<?, ?B/s] config.json: 100%|██████████| 1.05k/1.05k [00:00<00:00, 12.2MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:690: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: warnings.warn(
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: tokenizer_config.json: 0%| | 0.00/234 [00:00<?, ?B/s] tokenizer_config.json: 100%|██████████| 234/234 [00:00<00:00, 3.35MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: vocab.json: 0%| | 0.00/1.04M [00:00<?, ?B/s] vocab.json: 100%|██████████| 1.04M/1.04M [00:00<00:00, 48.6MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: tokenizer.json: 0%| | 0.00/2.11M [00:00<?, ?B/s] tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 13.1MB/s] tokenizer.json: 100%|██████████| 2.11M/2.11M [00:00<00:00, 13.1MB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:472: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: warnings.warn(
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: pytorch_model.bin: 0%| | 0.00/1.44G [00:00<?, ?B/s] pytorch_model.bin: 1%| | 10.5M/1.44G [00:00<00:38, 36.9MB/s] pytorch_model.bin: 4%|▎ | 52.4M/1.44G [00:00<00:09, 149MB/s] pytorch_model.bin: 7%|▋ | 105M/1.44G [00:00<00:05, 259MB/s] pytorch_model.bin: 10%|█ | 147M/1.44G [00:00<00:04, 286MB/s] pytorch_model.bin: 23%|██▎ | 325M/1.44G [00:00<00:01, 709MB/s] pytorch_model.bin: 29%|██▉ | 419M/1.44G [00:00<00:01, 716MB/s] pytorch_model.bin: 37%|███▋ | 535M/1.44G [00:00<00:01, 799MB/s] pytorch_model.bin: 49%|████▊ | 703M/1.44G [00:01<00:00, 1.04GB/s] pytorch_model.bin: 78%|███████▊ | 1.12G/1.44G [00:01<00:00, 1.88GB/s] pytorch_model.bin: 100%|█████████▉| 1.44G/1.44G [00:01<00:00, 1.16GB/s]
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: Saving duration: 0.228s
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: Processed model rirv938/reward_gpt2_medium_preference_24m_e2 in 4.949s
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: creating bucket guanaco-reward-models
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: Bucket 's3://guanaco-reward-models/' created
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/cgato-thespis-7b-v0-2-sfttest-v5_reward
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/cgato-thespis-7b-v0-2-sfttest-v5_reward/config.json
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/cgato-thespis-7b-v0-2-sfttest-v5_reward/special_tokens_map.json
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/cgato-thespis-7b-v0-2-sfttest-v5_reward/merges.txt
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/cgato-thespis-7b-v0-2-sfttest-v5_reward/vocab.json
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/cgato-thespis-7b-v0-2-sfttest-v5_reward/tokenizer_config.json
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/cgato-thespis-7b-v0-2-sfttest-v5_reward/tokenizer.json
cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/cgato-thespis-7b-v0-2-sfttest-v5_reward/reward.tensors
Job cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer completed after 65.71s with status: succeeded
Stopping job with name cgato-thespis-7b-v0-2-sfttest-v5-mkmlizer
Pipeline stage MKMLizer completed in 70.42s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.29s
Running pipeline stage ISVCDeployer
Creating inference service cgato-thespis-7b-v0-2-sfttest-v5
Waiting for inference service cgato-thespis-7b-v0-2-sfttest-v5 to be ready
Inference service cgato-thespis-7b-v0-2-sfttest-v5 ready after 50.40709686279297s
Pipeline stage ISVCDeployer completed in 60.19s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.6795005798339844s
Received healthy response to inference request in 1.198026180267334s
Received healthy response to inference request in 1.23563814163208s
Received healthy response to inference request in 1.1974036693572998s
Received healthy response to inference request in 1.2044241428375244s
5 requests
0 failed requests
5th percentile: 1.1975281715393067
10th percentile: 1.1976526737213136
20th percentile: 1.197901678085327
30th percentile: 1.199305772781372
40th percentile: 1.2018649578094482
50th percentile: 1.2044241428375244
60th percentile: 1.2169097423553468
70th percentile: 1.229395341873169
80th percentile: 1.3244106292724611
90th percentile: 1.5019556045532227
95th percentile: 1.5907280921936033
99th percentile: 1.6617460823059083
mean time: 1.3029985427856445
Pipeline stage StressChecker completed in 7.61s
Running pipeline stage DaemonicModelEvalScorer
Pipeline stage DaemonicModelEvalScorer completed in 0.05s
Running pipeline stage DaemonicSafetyScorer
Running M-Eval for topic stay_in_character
Pipeline stage DaemonicSafetyScorer completed in 0.05s
M-Eval Dataset for topic stay_in_character is loaded
cgato-thespis-7b-v0-2-sfttest_v5 status is now inactive due to auto deactivation removed underperforming models

Usage Metrics

Latency Metrics