developer_uid: rica40325
submission_id: rica40325-lora32-secend_v1
model_name: rica40325-lora32-secend_v1
model_group: rica40325/lora32_secend
status: torndown
timestamp: 2024-08-30T10:10:27+00:00
num_battles: 12811
num_wins: 5117
celo_rating: 1166.4
family_friendly_score: 0.0
submission_type: basic
model_repo: rica40325/lora32_secend
model_architecture: LlamaForCausalLM
model_num_parameters: 8030261248.0
best_of: 1
max_input_tokens: 512
max_output_tokens: 64
display_name: rica40325-lora32-secend_v1
is_internal_developer: False
language_model: rica40325/lora32_secend
model_size: 8B
ranking_group: single
us_pacific_date: 2024-08-30
win_ratio: 0.3994223713995785
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 1, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Running pipeline stage MKMLizer
Starting job with name rica40325-lora32-secend-v1-mkmlizer
Waiting for job on rica40325-lora32-secend-v1-mkmlizer to finish
Stopping job with name rica40325-lora32-secend-v1-mkmlizer
%s, retrying in %s seconds...
Starting job with name rica40325-lora32-secend-v1-mkmlizer
Waiting for job on rica40325-lora32-secend-v1-mkmlizer to finish
rica40325-lora32-secend-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
rica40325-lora32-secend-v1-mkmlizer: ║ _____ __ __ ║
rica40325-lora32-secend-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
rica40325-lora32-secend-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
rica40325-lora32-secend-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
rica40325-lora32-secend-v1-mkmlizer: ║ /___/ ║
rica40325-lora32-secend-v1-mkmlizer: ║ ║
rica40325-lora32-secend-v1-mkmlizer: ║ Version: 0.10.1 ║
rica40325-lora32-secend-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
rica40325-lora32-secend-v1-mkmlizer: ║ https://mk1.ai ║
rica40325-lora32-secend-v1-mkmlizer: ║ ║
rica40325-lora32-secend-v1-mkmlizer: ║ The license key for the current software has been verified as ║
rica40325-lora32-secend-v1-mkmlizer: ║ belonging to: ║
rica40325-lora32-secend-v1-mkmlizer: ║ ║
rica40325-lora32-secend-v1-mkmlizer: ║ Chai Research Corp. ║
rica40325-lora32-secend-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
rica40325-lora32-secend-v1-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
rica40325-lora32-secend-v1-mkmlizer: ║ ║
rica40325-lora32-secend-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
rica40325-lora32-secend-v1-mkmlizer: Downloaded to shared memory in 34.516s
rica40325-lora32-secend-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp7_ynw12o, device:0
rica40325-lora32-secend-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
rica40325-lora32-secend-v1-mkmlizer: quantized model in 25.469s
rica40325-lora32-secend-v1-mkmlizer: Processed model rica40325/lora32_secend in 59.985s
rica40325-lora32-secend-v1-mkmlizer: creating bucket guanaco-mkml-models
rica40325-lora32-secend-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
rica40325-lora32-secend-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/rica40325-lora32-secend-v1
rica40325-lora32-secend-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/rica40325-lora32-secend-v1/tokenizer.json
rica40325-lora32-secend-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/rica40325-lora32-secend-v1/flywheel_model.0.safetensors
rica40325-lora32-secend-v1-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 1%|▏ | 4/291 [00:00<00:07, 36.55it/s] Loading 0: 4%|▍ | 13/291 [00:00<00:04, 65.88it/s] Loading 0: 9%|▊ | 25/291 [00:00<00:03, 79.55it/s] Loading 0: 14%|█▎ | 40/291 [00:00<00:02, 92.02it/s] Loading 0: 18%|█▊ | 52/291 [00:00<00:02, 91.74it/s] Loading 0: 23%|██▎ | 67/291 [00:00<00:02, 98.43it/s] Loading 0: 26%|██▋ | 77/291 [00:00<00:02, 92.13it/s] Loading 0: 30%|██▉ | 87/291 [00:02<00:08, 24.89it/s] Loading 0: 32%|███▏ | 94/291 [00:02<00:06, 28.18it/s] Loading 0: 35%|███▌ | 103/291 [00:02<00:05, 34.37it/s] Loading 0: 40%|███▉ | 115/291 [00:02<00:03, 44.50it/s] Loading 0: 45%|████▍ | 130/291 [00:02<00:02, 58.67it/s] Loading 0: 49%|████▉ | 142/291 [00:02<00:02, 65.60it/s] Loading 0: 54%|█████▍ | 157/291 [00:02<00:01, 77.72it/s] Loading 0: 58%|█████▊ | 169/291 [00:02<00:01, 81.98it/s] Loading 0: 62%|██████▏ | 179/291 [00:03<00:01, 79.04it/s] Loading 0: 65%|██████▍ | 188/291 [00:04<00:04, 24.82it/s] Loading 0: 69%|██████▉ | 202/291 [00:04<00:02, 33.78it/s] Loading 0: 73%|███████▎ | 211/291 [00:04<00:02, 38.94it/s] Loading 0: 77%|███████▋ | 223/291 [00:04<00:01, 46.62it/s] Loading 0: 80%|███████▉ | 232/291 [00:04<00:01, 51.99it/s] Loading 0: 85%|████████▍ | 247/291 [00:04<00:00, 65.21it/s] Loading 0: 89%|████████▉ | 259/291 [00:04<00:00, 71.38it/s] Loading 0: 94%|█████████▍| 274/291 [00:05<00:00, 81.94it/s] Loading 0: 98%|█████████▊| 286/291 [00:05<00:00, 81.32it/s]
Job rica40325-lora32-secend-v1-mkmlizer completed after 85.25s with status: succeeded
Stopping job with name rica40325-lora32-secend-v1-mkmlizer
Pipeline stage MKMLizer completed in 86.66s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.12s
Running pipeline stage ISVCDeployer
Creating inference service rica40325-lora32-secend-v1
Waiting for inference service rica40325-lora32-secend-v1 to be ready
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Inference service rica40325-lora32-secend-v1 ready after 190.73108434677124s
Pipeline stage ISVCDeployer completed in 191.17s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.0132215023040771s
Received healthy response to inference request in 1.1633458137512207s
Received healthy response to inference request in 1.0344722270965576s
Received healthy response to inference request in 1.9600436687469482s
Received healthy response to inference request in 2.1720683574676514s
5 requests
0 failed requests
5th percentile: 1.0174716472625733
10th percentile: 1.0217217922210693
20th percentile: 1.0302220821380614
30th percentile: 1.0602469444274902
40th percentile: 1.1117963790893555
50th percentile: 1.1633458137512207
60th percentile: 1.4820249557495115
70th percentile: 1.8007040977478026
80th percentile: 2.0024486064910887
90th percentile: 2.08725848197937
95th percentile: 2.129663419723511
99th percentile: 2.1635873699188233
mean time: 1.468630313873291
Pipeline stage StressChecker completed in 8.20s
rica40325-lora32-secend_v1 status is now deployed due to DeploymentManager action
rica40325-lora32-secend_v1 status is now inactive due to auto deactivation removed underperforming models
rica40325-lora32-secend_v1 status is now torndown due to DeploymentManager action