submission_id: chaiml-llama-8b-pairwis_8189_v38
developer_uid: chai_backend_admin
best_of: 1
celo_rating: 1246.87
display_name: chaiml-llama-8b-pairwis_8189_v38
family_friendly_score: 0.5937985558906864
family_friendly_standard_error: 0.0057321372650887305
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '', 'truncate_by_message': True}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 64, 'best_of': 1, 'max_output_tokens': 1}
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/llama_8b_pairwise_64m_256_tokens_step_249984
max_input_tokens: 64
max_output_tokens: 1
model_architecture: LlamaForSequenceClassification
model_group: ChaiML/llama_8b_pairwise
model_name: chaiml-llama-8b-pairwis_8189_v38
model_num_parameters: 8030261248.0
model_repo: ChaiML/llama_8b_pairwise_64m_256_tokens_step_249984
model_size: 8B
num_battles: 15333
num_wins: 7528
ranking_group: single
status: torndown
submission_type: basic
timestamp: 2024-10-14T14:14:54+00:00
us_pacific_date: 2024-10-14
win_ratio: 0.4909671949390204
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-llama-8b-pairwis-8189-v38-mkmlizer
Waiting for job on chaiml-llama-8b-pairwis-8189-v38-mkmlizer to finish
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ _____ __ __ ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ /___/ ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ Version: 0.11.12 ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ https://mk1.ai ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ belonging to: ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ Chai Research Corp. ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ║ ║
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: Downloaded to shared memory in 21.347s
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: quantizing model to /dev/shm/model_cache, profile:t0, folder:/tmp/tmp79ykddv9, device:0
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: quantized model in 86.750s
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: Processed model ChaiML/llama_8b_pairwise_64m_256_tokens_step_249984 in 108.098s
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: creating bucket guanaco-mkml-models
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-llama-8b-pairwis-8189-v38
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-llama-8b-pairwis-8189-v38/special_tokens_map.json
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-llama-8b-pairwis-8189-v38/config.json
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-llama-8b-pairwis-8189-v38/tokenizer_config.json
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-llama-8b-pairwis-8189-v38/tokenizer.json
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-llama-8b-pairwis-8189-v38/flywheel_model.0.safetensors
chaiml-llama-8b-pairwis-8189-v38-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 1%| | 3/291 [00:00<00:57, 4.97it/s] Loading 0: 1%|▏ | 4/291 [00:01<01:33, 3.06it/s] Loading 0: 2%|▏ | 5/291 [00:01<02:05, 2.28it/s] Loading 0: 3%|▎ | 8/291 [00:02<01:04, 4.37it/s] Loading 0: 3%|▎ | 9/291 [00:02<01:04, 4.40it/s] Loading 0: 3%|▎ | 10/291 [00:02<00:55, 5.02it/s] Loading 0: 4%|▍ | 12/291 [00:03<01:06, 4.19it/s] Loading 0: 4%|▍ | 13/291 [00:03<01:28, 3.16it/s] Loading 0: 5%|▍ | 14/291 [00:04<01:51, 2.49it/s] Loading 0: 6%|▌ | 17/291 [00:04<01:04, 4.28it/s] Loading 0: 6%|▌ | 18/291 [00:04<01:00, 4.52it/s] Loading 0: 7%|▋ | 19/291 [00:04<00:53, 5.09it/s] Loading 0: 7%|▋ | 21/291 [00:05<01:03, 4.22it/s] Loading 0: 8%|▊ | 22/291 [00:05<01:24, 3.18it/s] Loading 0: 8%|▊ | 23/291 [00:06<01:47, 2.49it/s] Loading 0: 9%|▉ | 26/291 [00:06<01:02, 4.24it/s] Loading 0: 9%|▉ | 27/291 [00:07<00:59, 4.45it/s] Loading 0: 10%|▉ | 28/291 [00:07<00:53, 4.96it/s] Loading 0: 10%|█ | 30/291 [00:07<01:01, 4.21it/s] Loading 0: 11%|█ | 31/291 [00:08<01:21, 3.19it/s] Loading 0: 11%|█ | 32/291 [00:09<01:42, 2.53it/s] Loading 0: 12%|█▏ | 35/291 [00:09<00:59, 4.31it/s] Loading 0: 12%|█▏ | 36/291 [00:09<00:56, 4.54it/s] Loading 0: 13%|█▎ | 37/291 [00:09<00:49, 5.13it/s] Loading 0: 13%|█▎ | 39/291 [00:10<00:58, 4.27it/s] Loading 0: 14%|█▎ | 40/291 [00:10<01:17, 3.22it/s] Loading 0: 14%|█▍ | 41/291 [00:11<01:39, 2.52it/s] Loading 0: 15%|█▌ | 44/291 [00:11<00:57, 4.29it/s] Loading 0: 15%|█▌ | 45/291 [00:11<00:54, 4.54it/s] Loading 0: 16%|█▌ | 46/291 [00:11<00:47, 5.13it/s] Loading 0: 16%|█▋ | 48/291 [00:12<00:56, 4.28it/s] Loading 0: 17%|█▋ | 49/291 [00:13<01:14, 3.23it/s] Loading 0: 17%|█▋ | 50/291 [00:13<01:35, 2.52it/s] Loading 0: 18%|█▊ | 53/291 [00:13<00:54, 4.34it/s] Loading 0: 19%|█▊ | 54/291 [00:14<00:51, 4.57it/s] Loading 0: 19%|█▉ | 55/291 [00:14<00:45, 5.14it/s] Loading 0: 20%|█▉ | 57/291 [00:14<00:54, 4.29it/s] Loading 0: 20%|█▉ | 58/291 [00:15<01:11, 3.25it/s] Loading 0: 20%|██ | 59/291 [00:16<01:30, 2.55it/s] Loading 0: 21%|██▏ | 62/291 [00:16<00:52, 4.38it/s] Loading 0: 22%|██▏ | 63/291 [00:16<00:49, 4.57it/s] Loading 0: 22%|██▏ | 64/291 [00:16<00:45, 5.03it/s] Loading 0: 23%|██▎ | 66/291 [00:17<00:52, 4.25it/s] Loading 0: 23%|██▎ | 67/291 [00:17<01:09, 3.22it/s] Loading 0: 23%|██▎ | 68/291 [00:18<01:27, 2.56it/s] Loading 0: 24%|██▍ | 71/291 [00:18<00:50, 4.35it/s] Loading 0: 25%|██▍ | 72/291 [00:18<00:47, 4.59it/s] Loading 0: 25%|██▌ | 73/291 [00:18<00:42, 5.11it/s] Loading 0: 26%|██▌ | 75/291 [00:19<00:50, 4.29it/s] Loading 0: 26%|██▌ | 76/291 [00:20<01:06, 3.24it/s] Loading 0: 26%|██▋ | 77/291 [00:20<01:23, 2.57it/s] Loading 0: 27%|██▋ | 80/291 [00:20<00:48, 4.35it/s] Loading 0: 28%|██▊ | 81/291 [00:21<00:45, 4.59it/s] Loading 0: 28%|██▊ | 82/291 [00:21<00:40, 5.16it/s] Loading 0: 29%|██▊ | 83/291 [00:21<00:42, 4.95it/s] Loading 0: 29%|██▉ | 84/291 [00:21<01:02, 3.34it/s] Loading 0: 29%|██▉ | 85/291 [00:22<01:17, 2.66it/s] Loading 0: 30%|██▉ | 86/291 [00:23<01:32, 2.21it/s] Loading 0: 31%|███ | 89/291 [00:23<00:49, 4.07it/s] Loading 0: 31%|███ | 90/291 [00:23<00:46, 4.33it/s] Loading 0: 31%|███▏ | 91/291 [00:23<00:40, 4.92it/s] Loading 0: 32%|███▏ | 93/291 [00:24<00:47, 4.19it/s] Loading 0: 32%|███▏ | 94/291 [00:24<01:01, 3.19it/s] Loading 0: 33%|███▎ | 95/291 [00:25<01:17, 2.53it/s] Loading 0: 34%|███▎ | 98/291 [00:25<00:44, 4.36it/s] Loading 0: 34%|███▍ | 99/291 [00:25<00:41, 4.59it/s] Loading 0: 34%|███▍ | 100/291 [00:26<00:37, 5.16it/s] Loading 0: 35%|███▌ | 102/291 [00:26<00:43, 4.32it/s] Loading 0: 35%|███▌ | 103/291 [00:27<00:57, 3.25it/s] Loading 0: 36%|███▌ | 104/291 [00:27<01:13, 2.55it/s] Loading 0: 37%|███▋ | 107/291 [00:28<00:41, 4.39it/s] Loading 0: 37%|███▋ | 108/291 [00:28<00:39, 4.63it/s] Loading 0: 37%|███▋ | 109/291 [00:28<00:35, 5.20it/s] Loading 0: 38%|███▊ | 111/291 [00:28<00:41, 4.33it/s] Loading 0: 38%|███▊ | 112/291 [00:29<00:54, 3.26it/s] Loading 0: 39%|███▉ | 113/291 [00:30<01:09, 2.57it/s] Loading 0: 40%|███▉ | 116/291 [00:30<00:39, 4.38it/s] Loading 0: 40%|████ | 117/291 [00:30<00:37, 4.62it/s] Loading 0: 41%|████ | 118/291 [00:30<00:33, 5.21it/s] Loading 0: 41%|████ | 120/291 [00:31<00:39, 4.34it/s] Loading 0: 42%|████▏ | 121/291 [00:31<00:51, 3.28it/s] Loading 0: 42%|████▏ | 122/291 [00:32<01:05, 2.58it/s] Loading 0: 43%|████▎ | 125/291 [00:32<00:37, 4.37it/s] Loading 0: 43%|████▎ | 126/291 [00:32<00:35, 4.61it/s] Loading 0: 44%|████▎ | 127/291 [00:32<00:31, 5.19it/s] Loading 0: 44%|████▍ | 129/291 [00:33<00:37, 4.34it/s] Loading 0: 45%|████▍ | 130/291 [00:34<00:49, 3.27it/s] Loading 0: 45%|████▌ | 131/291 [00:34<01:02, 2.57it/s] Loading 0: 46%|████▌ | 134/291 [00:34<00:35, 4.42it/s] Loading 0: 46%|████▋ | 135/291 [00:35<00:33, 4.65it/s] Loading 0: 47%|████▋ | 136/291 [00:35<00:29, 5.22it/s] Loading 0: 47%|████▋ | 138/291 [00:35<00:35, 4.34it/s] Loading 0: 48%|████▊ | 139/291 [00:36<00:46, 3.26it/s] Loading 0: 48%|████▊ | 140/291 [00:37<00:58, 2.56it/s] Loading 0: 49%|████▉ | 143/291 [00:37<00:33, 4.39it/s] Loading 0: 49%|████▉ | 144/291 [00:37<00:31, 4.63it/s] Loading 0: 50%|████▉ | 145/291 [00:37<00:28, 5.17it/s] Loading 0: 51%|█████ | 147/291 [00:38<00:33, 4.32it/s] Loading 0: 51%|█████ | 148/291 [00:38<00:43, 3.27it/s] Loading 0: 51%|█████ | 149/291 [00:39<00:54, 2.59it/s] Loading 0: 52%|█████▏ | 152/291 [00:39<00:31, 4.42it/s] Loading 0: 53%|█████▎ | 153/291 [00:39<00:29, 4.64it/s] Loading 0: 53%|█████▎ | 154/291 [00:39<00:26, 5.20it/s] Loading 0: 54%|█████▎ | 156/291 [00:40<00:31, 4.34it/s] Loading 0: 54%|█████▍ | 157/291 [00:40<00:40, 3.27it/s] Loading 0: 54%|█████▍ | 158/291 [00:41<00:51, 2.58it/s] Loading 0: 55%|█████▌ | 161/291 [00:41<00:29, 4.41it/s] Loading 0: 56%|█████▌ | 162/291 [00:42<00:27, 4.63it/s] Loading 0: 56%|█████▌ | 163/291 [00:42<00:24, 5.22it/s] Loading 0: 57%|█████▋ | 165/291 [00:42<00:28, 4.35it/s] Loading 0: 57%|█████▋ | 166/291 [00:43<00:38, 3.28it/s] Loading 0: 57%|█████▋ | 167/291 [00:43<00:47, 2.59it/s] Loading 0: 58%|█████▊ | 170/291 [00:44<00:27, 4.41it/s] Loading 0: 59%|█████▉ | 171/291 [00:44<00:25, 4.62it/s] Loading 0: 59%|█████▉ | 172/291 [00:44<00:22, 5.20it/s] Loading 0: 59%|█████▉ | 173/291 [00:44<00:33, 3.53it/s] Loading 0: 60%|██████ | 175/291 [00:45<00:24, 4.81it/s] Loading 0: 60%|██████ | 176/291 [00:45<00:22, 5.03it/s] Loading 0: 61%|██████ | 177/291 [00:45<00:20, 5.59it/s] Loading 0: 62%|██████▏ | 179/291 [00:46<00:25, 4.47it/s] Loading 0: 62%|██████▏ | 180/291 [00:46<00:33, 3.30it/s] Loading 0: 62%|██████▏ | 181/291 [00:47<00:42, 2.58it/s] Loading 0: 63%|██████▎ | 184/291 [00:47<00:23, 4.52it/s] Loading 0: 64%|██████▎ | 185/291 [00:47<00:22, 4.75it/s] Loading 0: 64%|██████▍ | 186/291 [00:47<00:19, 5.35it/s] Loading 0: 64%|██████▍ | 187/291 [00:47<00:19, 5.36it/s] Loading 0: 65%|██████▍ | 188/291 [00:48<00:29, 3.50it/s] Loading 0: 65%|██████▍ | 189/291 [00:49<00:39, 2.61it/s] Loading 0: 66%|██████▌ | 192/291 [00:49<00:27, 3.55it/s] Loading 0: 66%|██████▋ | 193/291 [00:50<00:33, 2.92it/s] Loading 0: 67%|██████▋ | 194/291 [00:50<00:39, 2.44it/s] Loading 0: 68%|██████▊ | 197/291 [00:51<00:22, 4.12it/s] Loading 0: 68%|██████▊ | 198/291 [00:51<00:21, 4.39it/s] Loading 0: 68%|██████▊ | 199/291 [00:51<00:18, 4.98it/s] Loading 0: 69%|██████▉ | 201/291 [00:51<00:21, 4.25it/s] Loading 0: 69%|██████▉ | 202/291 [00:52<00:27, 3.24it/s] Loading 0: 70%|██████▉ | 203/291 [00:53<00:34, 2.56it/s] Loading 0: 71%|███████ | 206/291 [00:53<00:19, 4.34it/s] Loading 0: 71%|███████ | 207/291 [00:53<00:18, 4.57it/s] Loading 0: 71%|███████▏ | 208/291 [00:53<00:16, 5.17it/s] Loading 0: 72%|███████▏ | 210/291 [00:54<00:18, 4.33it/s] Loading 0: 73%|███████▎ | 211/291 [00:54<00:24, 3.27it/s] Loading 0: 73%|███████▎ | 212/291 [00:55<00:30, 2.57it/s] Loading 0: 74%|███████▍ | 215/291 [00:55<00:17, 4.42it/s] Loading 0: 74%|███████▍ | 216/291 [00:55<00:16, 4.66it/s] Loading 0: 75%|███████▍ | 217/291 [00:56<00:14, 5.22it/s] Loading 0: 75%|███████▌ | 219/291 [00:56<00:16, 4.35it/s] Loading 0: 76%|███████▌ | 220/291 [00:57<00:21, 3.28it/s] Loading 0: 76%|███████▌ | 221/291 [00:57<00:26, 2.61it/s] Loading 0: 77%|███████▋ | 224/291 [00:58<00:15, 4.42it/s] Loading 0: 77%|███████▋ | 225/291 [00:58<00:14, 4.66it/s] Loading 0: 78%|███████▊ | 226/291 [00:58<00:12, 5.22it/s] Loading 0: 78%|███████▊ | 228/291 [00:58<00:14, 4.36it/s] Loading 0: 79%|███████▊ | 229/291 [00:59<00:18, 3.30it/s] Loading 0: 79%|███████▉ | 230/291 [01:00<00:23, 2.62it/s] Loading 0: 80%|████████ | 233/291 [01:00<00:12, 4.49it/s] Loading 0: 80%|████████ | 234/291 [01:00<00:12, 4.73it/s] Loading 0: 81%|████████ | 235/291 [01:00<00:10, 5.31it/s] Loading 0: 81%|████████▏ | 237/291 [01:01<00:12, 4.40it/s] Loading 0: 82%|████████▏ | 238/291 [01:01<00:15, 3.32it/s] Loading 0: 82%|████████▏ | 239/291 [01:02<00:19, 2.61it/s] Loading 0: 83%|████████▎ | 242/291 [01:02<00:10, 4.49it/s] Loading 0: 84%|████████▎ | 243/291 [01:02<00:10, 4.72it/s] Loading 0: 84%|████████▍ | 244/291 [01:02<00:08, 5.30it/s] Loading 0: 85%|████████▍ | 246/291 [01:03<00:10, 4.38it/s] Loading 0: 85%|████████▍ | 247/291 [01:03<00:13, 3.29it/s] Loading 0: 85%|████████▌ | 248/291 [01:04<00:16, 2.61it/s] Loading 0: 86%|████████▋ | 251/291 [01:04<00:08, 4.48it/s] Loading 0: 87%|████████▋ | 252/291 [01:04<00:08, 4.71it/s] Loading 0: 87%|████████▋ | 253/291 [01:05<00:07, 5.28it/s] Loading 0: 88%|████████▊ | 255/291 [01:05<00:08, 4.38it/s] Loading 0: 88%|████████▊ | 256/291 [01:06<00:10, 3.23it/s] Loading 0: 88%|████████▊ | 257/291 [01:06<00:13, 2.58it/s] Loading 0: 89%|████████▉ | 260/291 [01:07<00:06, 4.44it/s] Loading 0: 90%|████████▉ | 261/291 [01:07<00:06, 4.67it/s] Loading 0: 90%|█████████ | 262/291 [01:07<00:05, 5.26it/s] Loading 0: 91%|█████████ | 264/291 [01:07<00:06, 4.37it/s] Loading 0: 91%|█████████ | 265/291 [01:08<00:07, 3.30it/s] Loading 0: 91%|█████████▏| 266/291 [01:09<00:09, 2.61it/s] Loading 0: 92%|█████████▏| 269/291 [01:09<00:04, 4.49it/s] Loading 0: 93%|█████████▎| 270/291 [01:09<00:04, 4.80it/s] Loading 0: 93%|█████████▎| 271/291 [01:09<00:03, 5.39it/s] Loading 0: 94%|█████████▍| 273/291 [01:10<00:04, 4.42it/s] Loading 0: 94%|█████████▍| 274/291 [01:10<00:05, 3.33it/s] Loading 0: 95%|█████████▍| 275/291 [01:11<00:06, 2.60it/s] Loading 0: 96%|█████████▌| 278/291 [01:11<00:02, 4.43it/s] Loading 0: 96%|█████████▌| 279/291 [01:11<00:02, 4.67it/s] Loading 0: 96%|█████████▌| 280/291 [01:11<00:02, 5.26it/s] Loading 0: 97%|█████████▋| 281/291 [01:12<00:02, 3.55it/s] Loading 0: 97%|█████████▋| 282/291 [01:13<00:03, 2.69it/s] Loading 0: 98%|█████████▊| 284/291 [01:13<00:01, 3.87it/s] Loading 0: 98%|█████████▊| 285/291 [01:13<00:01, 4.22it/s] Loading 0: 98%|█████████▊| 286/291 [01:13<00:01, 4.85it/s] Loading 0: 99%|█████████▊| 287/291 [01:13<00:00, 5.04it/s] Loading 0: 99%|█████████▉| 288/291 [01:14<00:00, 3.30it/s]
Job chaiml-llama-8b-pairwis-8189-v38-mkmlizer completed after 131.37s with status: succeeded
Stopping job with name chaiml-llama-8b-pairwis-8189-v38-mkmlizer
Pipeline stage MKMLizer completed in 132.89s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.43s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-llama-8b-pairwis-8189-v38
Waiting for inference service chaiml-llama-8b-pairwis-8189-v38 to be ready
Inference service chaiml-llama-8b-pairwis-8189-v38 ready after 182.80422496795654s
Pipeline stage MKMLDeployer completed in 184.17s
run pipeline stage %s
Running pipeline stage StressChecker
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 4.7324888706207275s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 3.898766040802002s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 3.5399348735809326s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 3.011776924133301s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 3.980950117111206s
5 requests
0 failed requests
5th percentile: 3.117408514022827
10th percentile: 3.2230401039123535
20th percentile: 3.4343032836914062
30th percentile: 3.6117011070251466
40th percentile: 3.755233573913574
50th percentile: 3.898766040802002
60th percentile: 3.9316396713256836
70th percentile: 3.9645133018493652
80th percentile: 4.131257867813111
90th percentile: 4.431873369216919
95th percentile: 4.582181119918823
99th percentile: 4.702427320480346
mean time: 3.8327833652496337
%s, retrying in %s seconds...
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 3.209373950958252s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 2.642841100692749s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 3.559443950653076s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 3.8325979709625244s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 4.536367893218994s
5 requests
0 failed requests
5th percentile: 2.7561476707458494
10th percentile: 2.8694542407989503
20th percentile: 3.0960673809051515
30th percentile: 3.279387950897217
40th percentile: 3.4194159507751465
50th percentile: 3.559443950653076
60th percentile: 3.6687055587768556
70th percentile: 3.7779671669006345
80th percentile: 3.9733519554138184
90th percentile: 4.254859924316406
95th percentile: 4.3956139087677
99th percentile: 4.508217096328735
mean time: 3.556124973297119
%s, retrying in %s seconds...
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 4.656041383743286s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 4.923593759536743s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 4.218329191207886s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 3.358891010284424s
HTTP Request: %s %s "%s %d %s"
Received healthy response to inference request in 3.4003360271453857s
5 requests
0 failed requests
5th percentile: 3.367180013656616
10th percentile: 3.3754690170288084
20th percentile: 3.3920470237731934
30th percentile: 3.563934659957886
40th percentile: 3.891131925582886
50th percentile: 4.218329191207886
60th percentile: 4.393414068222046
70th percentile: 4.568498945236206
80th percentile: 4.709551858901977
90th percentile: 4.816572809219361
95th percentile: 4.870083284378052
99th percentile: 4.912891664505005
mean time: 4.111438274383545
clean up pipeline due to error=DeploymentChecksError('Unacceptable 70th percentile latency 4.568498945236206s')
Shutdown handler de-registered
chaiml-llama-8b-pairwis_8189_v38 status is now failed due to DeploymentManager action
admin requested tearing down of chaiml-llama-8b-pairwis_8189_v38
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLDeleter
Checking if service chaiml-llama-8b-pairwis-8189-v38 is running
%s, retrying in %s seconds...
Checking if service chaiml-llama-8b-pairwis-8189-v38 is running
%s, retrying in %s seconds...
Checking if service chaiml-llama-8b-pairwis-8189-v38 is running
clean up pipeline due to error=TeardownError('Exception when calling CustomObjectsApi->list_namespaced_custom_object: (403)\nReason: Forbidden\nHTTP response headers: HTTPHeaderDict({\'Audit-Id\': \'384016a8-552c-493d-b695-e1b7d0962806, 3089c821-9299-4939-8176-6d8ecca84457\', \'Cache-Control\': \'no-cache, private, no-cache, private\', \'Content-Length\': \'406\', \'Content-Type\': \'application/json\', \'Date\': \'Mon, 14 Oct 2024 14:21:35 GMT\', \'X-Content-Type-Options\': \'nosniff\', \'X-Kubernetes-Pf-Flowschema-Uid\': \'33f90070-eb4d-4b82-b71a-96fabdf69ad9\', \'X-Kubernetes-Pf-Prioritylevel-Uid\': \'07a050f1-ad60-4b87-8572-f4584856b92a\'})\nHTTP response body: {"kind":"Status","apiVersion":"v1","metadata":{},"status":"Failure","message":"inferenceservices.serving.kserve.io is forbidden: User \\"system:serviceaccount:guanaco-backend:default\\" cannot list resource \\"inferenceservices\\" in API group \\"serving.kserve.io\\" in the namespace \\"tenant-chaiml-guanaco\\"","reason":"Forbidden","details":{"group":"serving.kserve.io","kind":"inferenceservices"},"code":403}\n\n\n')
Shutdown handler de-registered
admin requested tearing down of chaiml-llama-8b-pairwis_8189_v38
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLDeleter
Checking if service chaiml-llama-8b-pairwis-8189-v38 is running
%s, retrying in %s seconds...
Checking if service chaiml-llama-8b-pairwis-8189-v38 is running
%s, retrying in %s seconds...
Checking if service chaiml-llama-8b-pairwis-8189-v38 is running
clean up pipeline due to error=TeardownError('Exception when calling CustomObjectsApi->list_namespaced_custom_object: (403)\nReason: Forbidden\nHTTP response headers: HTTPHeaderDict({\'Audit-Id\': \'ffbfa8ee-e6ed-47f8-9863-04a7632d58a2, e2cd54de-d823-4c0b-8993-17a54b24166c\', \'Cache-Control\': \'no-cache, private, no-cache, private\', \'Content-Length\': \'406\', \'Content-Type\': \'application/json\', \'Date\': \'Mon, 14 Oct 2024 14:26:39 GMT\', \'X-Content-Type-Options\': \'nosniff\', \'X-Kubernetes-Pf-Flowschema-Uid\': \'33f90070-eb4d-4b82-b71a-96fabdf69ad9\', \'X-Kubernetes-Pf-Prioritylevel-Uid\': \'07a050f1-ad60-4b87-8572-f4584856b92a\'})\nHTTP response body: {"kind":"Status","apiVersion":"v1","metadata":{},"status":"Failure","message":"inferenceservices.serving.kserve.io is forbidden: User \\"system:serviceaccount:guanaco-backend:default\\" cannot list resource \\"inferenceservices\\" in API group \\"serving.kserve.io\\" in the namespace \\"tenant-chaiml-guanaco\\"","reason":"Forbidden","details":{"group":"serving.kserve.io","kind":"inferenceservices"},"code":403}\n\n\n')
Shutdown handler de-registered
admin requested tearing down of chaiml-llama-8b-pairwis_8189_v38
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLDeleter
Checking if service chaiml-llama-8b-pairwis-8189-v38 is running
%s, retrying in %s seconds...
Checking if service chaiml-llama-8b-pairwis-8189-v38 is running
%s, retrying in %s seconds...
Checking if service chaiml-llama-8b-pairwis-8189-v38 is running
clean up pipeline due to error=TeardownError('Exception when calling CustomObjectsApi->list_namespaced_custom_object: (403)\nReason: Forbidden\nHTTP response headers: HTTPHeaderDict({\'Audit-Id\': \'a98b0f0d-c460-4ea8-984d-63a91b4e4cc6, ea3ee171-fdb3-45dc-9c24-238fb17c4b41\', \'Cache-Control\': \'no-cache, private, no-cache, private\', \'Content-Length\': \'406\', \'Content-Type\': \'application/json\', \'Date\': \'Mon, 14 Oct 2024 14:31:40 GMT\', \'X-Content-Type-Options\': \'nosniff\', \'X-Kubernetes-Pf-Flowschema-Uid\': \'33f90070-eb4d-4b82-b71a-96fabdf69ad9\', \'X-Kubernetes-Pf-Prioritylevel-Uid\': \'07a050f1-ad60-4b87-8572-f4584856b92a\'})\nHTTP response body: {"kind":"Status","apiVersion":"v1","metadata":{},"status":"Failure","message":"inferenceservices.serving.kserve.io is forbidden: User \\"system:serviceaccount:guanaco-backend:default\\" cannot list resource \\"inferenceservices\\" in API group \\"serving.kserve.io\\" in the namespace \\"tenant-chaiml-guanaco\\"","reason":"Forbidden","details":{"group":"serving.kserve.io","kind":"inferenceservices"},"code":403}\n\n\n')
Shutdown handler de-registered
chaiml-llama-8b-pairwis_8189_v38 status is now inactive due to auto deactivation removed underperforming models
chaiml-llama-8b-pairwis_8189_v38 status is now torndown due to DeploymentManager action