developer_uid: rica40325
submission_id: rica40325-lora32-first_v3
model_name: rica40325-lora32-first_v1
model_group: rica40325/lora32_first
status: torndown
timestamp: 2024-08-30T10:35:58+00:00
num_battles: 11322
num_wins: 5949
celo_rating: 1252.8
family_friendly_score: 0.0
submission_type: basic
model_repo: rica40325/lora32_first
model_architecture: LlamaForCausalLM
model_num_parameters: 8030261248.0
best_of: 16
max_input_tokens: 512
max_output_tokens: 64
display_name: rica40325-lora32-first_v1
is_internal_developer: False
language_model: rica40325/lora32_first
model_size: 8B
ranking_group: single
us_pacific_date: 2024-08-30
win_ratio: 0.5254372019077902
generation_params: {'temperature': 1.15, 'top_p': 0.95, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Running pipeline stage MKMLizer
Starting job with name rica40325-lora32-first-v3-mkmlizer
Waiting for job on rica40325-lora32-first-v3-mkmlizer to finish
Stopping job with name rica40325-lora32-first-v3-mkmlizer
%s, retrying in %s seconds...
Starting job with name rica40325-lora32-first-v3-mkmlizer
Waiting for job on rica40325-lora32-first-v3-mkmlizer to finish
Connection pool is full, discarding connection: %s. Connection pool size: %s
rica40325-lora32-first-v3-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
rica40325-lora32-first-v3-mkmlizer: ║ _____ __ __ ║
rica40325-lora32-first-v3-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
rica40325-lora32-first-v3-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
rica40325-lora32-first-v3-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
rica40325-lora32-first-v3-mkmlizer: ║ /___/ ║
rica40325-lora32-first-v3-mkmlizer: ║ ║
rica40325-lora32-first-v3-mkmlizer: ║ Version: 0.10.1 ║
rica40325-lora32-first-v3-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
rica40325-lora32-first-v3-mkmlizer: ║ https://mk1.ai ║
rica40325-lora32-first-v3-mkmlizer: ║ ║
rica40325-lora32-first-v3-mkmlizer: ║ The license key for the current software has been verified as ║
rica40325-lora32-first-v3-mkmlizer: ║ belonging to: ║
rica40325-lora32-first-v3-mkmlizer: ║ ║
rica40325-lora32-first-v3-mkmlizer: ║ Chai Research Corp. ║
rica40325-lora32-first-v3-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
rica40325-lora32-first-v3-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
rica40325-lora32-first-v3-mkmlizer: ║ ║
rica40325-lora32-first-v3-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
rica40325-lora32-first-v3-mkmlizer: Downloaded to shared memory in 19.586s
rica40325-lora32-first-v3-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpy147suvz, device:0
rica40325-lora32-first-v3-mkmlizer: Saving flywheel model at /dev/shm/model_cache
rica40325-lora32-first-v3-mkmlizer: quantized model in 26.754s
rica40325-lora32-first-v3-mkmlizer: Processed model rica40325/lora32_first in 46.340s
rica40325-lora32-first-v3-mkmlizer: creating bucket guanaco-mkml-models
rica40325-lora32-first-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/rica40325-lora32-first-v3/tokenizer.json
rica40325-lora32-first-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/rica40325-lora32-first-v3/flywheel_model.0.safetensors
rica40325-lora32-first-v3-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 1%|▏ | 4/291 [00:00<00:08, 34.99it/s] Loading 0: 3%|▎ | 8/291 [00:00<00:09, 30.93it/s] Loading 0: 5%|▌ | 16/291 [00:00<00:06, 44.72it/s] Loading 0: 9%|▊ | 25/291 [00:00<00:04, 56.39it/s] Loading 0: 12%|█▏ | 34/291 [00:00<00:03, 65.98it/s] Loading 0: 15%|█▍ | 43/291 [00:00<00:03, 65.59it/s] Loading 0: 20%|█▉ | 58/291 [00:00<00:02, 79.92it/s] Loading 0: 23%|██▎ | 67/291 [00:00<00:02, 82.13it/s] Loading 0: 26%|██▌ | 76/291 [00:01<00:02, 75.55it/s] Loading 0: 29%|██▉ | 84/291 [00:02<00:09, 21.53it/s] Loading 0: 31%|███ | 90/291 [00:02<00:08, 23.71it/s] Loading 0: 33%|███▎ | 97/291 [00:02<00:06, 28.16it/s] Loading 0: 36%|███▋ | 106/291 [00:02<00:05, 35.51it/s] Loading 0: 40%|███▉ | 115/291 [00:02<00:04, 40.92it/s] Loading 0: 43%|████▎ | 124/291 [00:02<00:03, 49.05it/s] Loading 0: 46%|████▌ | 133/291 [00:03<00:02, 54.10it/s] Loading 0: 49%|████▉ | 142/291 [00:03<00:02, 60.85it/s] Loading 0: 52%|█████▏ | 152/291 [00:03<00:01, 69.81it/s] Loading 0: 55%|█████▌ | 161/291 [00:03<00:01, 67.53it/s] Loading 0: 58%|█████▊ | 169/291 [00:03<00:01, 69.22it/s] Loading 0: 63%|██████▎ | 184/291 [00:03<00:01, 80.95it/s] Loading 0: 66%|██████▋ | 193/291 [00:04<00:04, 22.42it/s] Loading 0: 69%|██████▉ | 202/291 [00:04<00:03, 27.89it/s] Loading 0: 73%|███████▎ | 211/291 [00:05<00:02, 34.26it/s] Loading 0: 76%|███████▌ | 220/291 [00:05<00:01, 40.78it/s] Loading 0: 79%|███████▊ | 229/291 [00:05<00:01, 46.54it/s] Loading 0: 82%|████████▏ | 238/291 [00:05<00:01, 51.41it/s] Loading 0: 85%|████████▍ | 247/291 [00:05<00:00, 54.56it/s] Loading 0: 88%|████████▊ | 256/291 [00:05<00:00, 55.77it/s] Loading 0: 91%|█████████ | 265/291 [00:05<00:00, 58.34it/s] Loading 0: 94%|█████████▍| 274/291 [00:05<00:00, 62.19it/s] Loading 0: 97%|█████████▋| 283/291 [00:06<00:00, 67.89it/s] Loading 0: 100%|██████████| 291/291 [00:11<00:00, 5.12it/s]
Job rica40325-lora32-first-v3-mkmlizer completed after 74.13s with status: succeeded
Stopping job with name rica40325-lora32-first-v3-mkmlizer
Pipeline stage MKMLizer completed in 75.74s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.12s
Running pipeline stage ISVCDeployer
Creating inference service rica40325-lora32-first-v3
Waiting for inference service rica40325-lora32-first-v3 to be ready
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Inference service rica40325-lora32-first-v3 ready after 291.5949969291687s
Pipeline stage ISVCDeployer completed in 291.95s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.936044692993164s
Received healthy response to inference request in 1.9549837112426758s
Received healthy response to inference request in 1.8297874927520752s
Received healthy response to inference request in 1.6939373016357422s
Received healthy response to inference request in 1.6122667789459229s
5 requests
0 failed requests
5th percentile: 1.6286008834838868
10th percentile: 1.6449349880218507
20th percentile: 1.6776031970977783
30th percentile: 1.7211073398590089
40th percentile: 1.775447416305542
50th percentile: 1.8297874927520752
60th percentile: 1.8722903728485107
70th percentile: 1.9147932529449463
80th percentile: 1.9398324966430665
90th percentile: 1.947408103942871
95th percentile: 1.9511959075927734
99th percentile: 1.9542261505126952
mean time: 1.805403995513916
Pipeline stage StressChecker completed in 9.76s
rica40325-lora32-first_v3 status is now deployed due to DeploymentManager action
rica40325-lora32-first_v3 status is now inactive due to auto deactivation removed underperforming models
rica40325-lora32-first_v3 status is now torndown due to DeploymentManager action