submission_id: rica40325-dpo-1008-5162s_v2
developer_uid: rica40325
best_of: 8
celo_rating: 1294.27
display_name: rica40325-dpo-1008-5162s_v2
family_friendly_score: 0.5816983122362869
family_friendly_standard_error: 0.006180800235253562
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: False
language_model: rica40325/dpo_1008_5162s
max_input_tokens: 1024
max_output_tokens: 64
model_architecture: MistralForCausalLM
model_group: rica40325/dpo_1008_5162s
model_name: rica40325-dpo-1008-5162s_v2
model_num_parameters: 12772070400.0
model_repo: rica40325/dpo_1008_5162s
model_size: 13B
num_battles: 6577
num_wins: 3620
ranking_group: single
status: torndown
submission_type: basic
timestamp: 2024-10-08T12:24:58+00:00
us_pacific_date: 2024-10-08
win_ratio: 0.5504029192641021
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name rica40325-dpo-1008-5162s-v2-mkmlizer
Waiting for job on rica40325-dpo-1008-5162s-v2-mkmlizer to finish
rica40325-dpo-1008-5162s-v2-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ _____ __ __ ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ /___/ ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ Version: 0.11.12 ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ https://mk1.ai ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ The license key for the current software has been verified as ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ belonging to: ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ Chai Research Corp. ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ║ ║
rica40325-dpo-1008-5162s-v2-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
rica40325-dpo-1008-5162s-v2-mkmlizer: Downloaded to shared memory in 113.622s
rica40325-dpo-1008-5162s-v2-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpegmmrihv, device:0
rica40325-dpo-1008-5162s-v2-mkmlizer: Saving flywheel model at /dev/shm/model_cache
rica40325-dpo-1008-5162s-v2-mkmlizer: quantized model in 44.397s
rica40325-dpo-1008-5162s-v2-mkmlizer: Processed model rica40325/dpo_1008_5162s in 158.020s
rica40325-dpo-1008-5162s-v2-mkmlizer: creating bucket guanaco-mkml-models
rica40325-dpo-1008-5162s-v2-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
rica40325-dpo-1008-5162s-v2-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v2
rica40325-dpo-1008-5162s-v2-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v2/config.json
rica40325-dpo-1008-5162s-v2-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v2/special_tokens_map.json
rica40325-dpo-1008-5162s-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v2/tokenizer_config.json
rica40325-dpo-1008-5162s-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v2/tokenizer.json
rica40325-dpo-1008-5162s-v2-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v2/flywheel_model.0.safetensors
rica40325-dpo-1008-5162s-v2-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:17, 19.95it/s] Loading 0: 3%|▎ | 10/363 [00:00<00:14, 24.98it/s] Loading 0: 4%|▍ | 14/363 [00:00<00:15, 22.40it/s] Loading 0: 5%|▌ | 19/363 [00:00<00:12, 28.37it/s] Loading 0: 6%|▋ | 23/363 [00:01<00:15, 21.93it/s] Loading 0: 7%|▋ | 26/363 [00:01<00:17, 18.92it/s] Loading 0: 9%|▊ | 31/363 [00:01<00:13, 24.36it/s] Loading 0: 9%|▉ | 34/363 [00:01<00:13, 23.90it/s] Loading 0: 10%|█ | 37/363 [00:01<00:13, 24.89it/s] Loading 0: 11%|█▏ | 41/363 [00:01<00:14, 22.18it/s] Loading 0: 13%|█▎ | 46/363 [00:01<00:11, 27.05it/s] Loading 0: 14%|█▍ | 50/363 [00:02<00:13, 23.52it/s] Loading 0: 15%|█▌ | 55/363 [00:02<00:10, 28.26it/s] Loading 0: 17%|█▋ | 60/363 [00:02<00:10, 28.96it/s] Loading 0: 18%|█▊ | 64/363 [00:02<00:15, 19.09it/s] Loading 0: 19%|█▉ | 69/363 [00:02<00:12, 23.36it/s] Loading 0: 20%|██ | 73/363 [00:03<00:12, 23.32it/s] Loading 0: 21%|██ | 77/363 [00:03<00:13, 21.63it/s] Loading 0: 23%|██▎ | 82/363 [00:03<00:10, 25.89it/s] Loading 0: 24%|██▎ | 86/363 [00:03<00:12, 23.07it/s] Loading 0: 25%|██▌ | 91/363 [00:03<00:09, 27.35it/s] Loading 0: 26%|██▌ | 95/363 [00:03<00:11, 23.96it/s] Loading 0: 28%|██▊ | 100/363 [00:04<00:09, 27.95it/s] Loading 0: 29%|██▊ | 104/363 [00:04<00:13, 18.96it/s] Loading 0: 30%|███ | 109/363 [00:04<00:10, 23.19it/s] Loading 0: 31%|███ | 113/363 [00:04<00:11, 21.60it/s] Loading 0: 33%|███▎ | 118/363 [00:04<00:09, 25.86it/s] Loading 0: 34%|███▎ | 122/363 [00:05<00:10, 23.15it/s] Loading 0: 35%|███▍ | 127/363 [00:05<00:08, 27.33it/s] Loading 0: 36%|███▌ | 131/363 [00:05<00:09, 24.02it/s] Loading 0: 37%|███▋ | 136/363 [00:05<00:08, 28.19it/s] Loading 0: 39%|███▉ | 141/363 [00:05<00:07, 28.79it/s] Loading 0: 40%|███▉ | 145/363 [00:06<00:10, 20.37it/s] Loading 0: 41%|████ | 149/363 [00:06<00:10, 19.57it/s] Loading 0: 42%|████▏ | 154/363 [00:06<00:08, 23.94it/s] Loading 0: 44%|████▎ | 158/363 [00:06<00:09, 21.90it/s] Loading 0: 45%|████▍ | 163/363 [00:06<00:07, 26.03it/s] Loading 0: 46%|████▌ | 167/363 [00:07<00:08, 23.18it/s] Loading 0: 47%|████▋ | 172/363 [00:07<00:07, 27.22it/s] Loading 0: 48%|████▊ | 176/363 [00:07<00:07, 23.93it/s] Loading 0: 50%|████▉ | 181/363 [00:07<00:06, 28.06it/s] Loading 0: 51%|█████ | 185/363 [00:07<00:09, 18.90it/s] Loading 0: 52%|█████▏ | 190/363 [00:08<00:07, 23.15it/s] Loading 0: 53%|█████▎ | 194/363 [00:08<00:07, 21.48it/s] Loading 0: 55%|█████▍ | 199/363 [00:08<00:06, 25.66it/s] Loading 0: 56%|█████▌ | 203/363 [00:08<00:06, 23.16it/s] Loading 0: 57%|█████▋ | 208/363 [00:08<00:05, 27.34it/s] Loading 0: 58%|█████▊ | 212/363 [00:08<00:06, 24.00it/s] Loading 0: 60%|█████▉ | 217/363 [00:09<00:05, 28.13it/s] Loading 0: 61%|██████ | 222/363 [00:09<00:04, 28.76it/s] Loading 0: 62%|██████▏ | 226/363 [00:09<00:06, 20.55it/s] Loading 0: 63%|██████▎ | 230/363 [00:09<00:06, 19.83it/s] Loading 0: 65%|██████▍ | 235/363 [00:09<00:05, 24.06it/s] Loading 0: 66%|██████▌ | 238/363 [00:09<00:04, 25.16it/s] Loading 0: 66%|██████▋ | 241/363 [00:10<00:04, 24.47it/s] Loading 0: 67%|██████▋ | 244/363 [00:10<00:04, 25.40it/s] Loading 0: 68%|██████▊ | 248/363 [00:10<00:05, 22.45it/s] Loading 0: 70%|██████▉ | 253/363 [00:10<00:04, 27.45it/s] Loading 0: 71%|███████ | 257/363 [00:10<00:04, 23.82it/s] Loading 0: 72%|███████▏ | 262/363 [00:10<00:03, 28.14it/s] Loading 0: 73%|███████▎ | 266/363 [00:11<00:05, 18.78it/s] Loading 0: 75%|███████▍ | 271/363 [00:11<00:03, 23.43it/s] Loading 0: 76%|███████▌ | 275/363 [00:11<00:04, 21.72it/s] Loading 0: 77%|███████▋ | 280/363 [00:11<00:03, 26.21it/s] Loading 0: 78%|███████▊ | 284/363 [00:11<00:03, 23.53it/s] Loading 0: 80%|███████▉ | 289/363 [00:12<00:02, 27.74it/s] Loading 0: 81%|████████ | 293/363 [00:12<00:02, 24.33it/s] Loading 0: 82%|████████▏ | 298/363 [00:12<00:02, 28.72it/s] Loading 0: 83%|████████▎ | 303/363 [00:12<00:02, 29.22it/s] Loading 0: 85%|████████▍ | 307/363 [00:12<00:02, 20.77it/s] Loading 0: 86%|████████▌ | 311/363 [00:13<00:02, 19.98it/s] Loading 0: 87%|████████▋ | 316/363 [00:13<00:01, 24.31it/s] Loading 0: 88%|████████▊ | 320/363 [00:13<00:01, 22.26it/s] Loading 0: 90%|████████▉ | 325/363 [00:13<00:01, 26.71it/s] Loading 0: 91%|█████████ | 329/363 [00:13<00:01, 23.42it/s] Loading 0: 92%|█████████▏| 334/363 [00:13<00:01, 27.56it/s] Loading 0: 93%|█████████▎| 338/363 [00:14<00:01, 24.17it/s] Loading 0: 94%|█████████▍| 343/363 [00:14<00:00, 28.21it/s] Loading 0: 96%|█████████▌| 347/363 [00:21<00:08, 1.95it/s] Loading 0: 96%|█████████▋| 350/363 [00:21<00:05, 2.46it/s] Loading 0: 97%|█████████▋| 353/363 [00:21<00:03, 3.13it/s] Loading 0: 98%|█████████▊| 357/363 [00:21<00:01, 4.25it/s] Loading 0: 100%|█████████▉| 362/363 [00:21<00:00, 6.31it/s]
Job rica40325-dpo-1008-5162s-v2-mkmlizer completed after 186.03s with status: succeeded
Stopping job with name rica40325-dpo-1008-5162s-v2-mkmlizer
Pipeline stage MKMLizer completed in 186.53s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service rica40325-dpo-1008-5162s-v2
Waiting for inference service rica40325-dpo-1008-5162s-v2 to be ready
Inference service rica40325-dpo-1008-5162s-v2 ready after 130.67207527160645s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Pipeline stage MKMLDeployer completed in 139.78s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 5.499078273773193s
Received healthy response to inference request in 12.990625381469727s
Received healthy response to inference request in 9.743576526641846s
Received healthy response to inference request in 2.093740463256836s
Received healthy response to inference request in 2.0065953731536865s
5 requests
0 failed requests
5th percentile: 2.0240243911743163
10th percentile: 2.041453409194946
20th percentile: 2.076311445236206
30th percentile: 2.774808025360107
40th percentile: 4.136943149566651
50th percentile: 5.499078273773193
60th percentile: 7.196877574920654
70th percentile: 8.894676876068115
80th percentile: 10.392986297607422
90th percentile: 11.691805839538574
95th percentile: 12.34121561050415
99th percentile: 12.860743427276612
mean time: 6.466723203659058
%s, retrying in %s seconds...
Received healthy response to inference request in 2.365415096282959s
Received healthy response to inference request in 1.705930233001709s
Received healthy response to inference request in 1.8279533386230469s
Received healthy response to inference request in 2.041987895965576s
Received healthy response to inference request in 1.829470157623291s
5 requests
0 failed requests
5th percentile: 1.7303348541259767
10th percentile: 1.754739475250244
20th percentile: 1.8035487174987792
30th percentile: 1.8282567024230958
40th percentile: 1.8288634300231934
50th percentile: 1.829470157623291
60th percentile: 1.914477252960205
70th percentile: 1.9994843482971192
80th percentile: 2.106673336029053
90th percentile: 2.2360442161560057
95th percentile: 2.3007296562194823
99th percentile: 2.3524780082702637
mean time: 1.9541513442993164
Pipeline stage StressChecker completed in 45.64s
Shutdown handler de-registered
rica40325-dpo-1008-5162s_v2 status is now deployed due to DeploymentManager action
rica40325-dpo-1008-5162s_v2 status is now inactive due to auto deactivation removed underperforming models
rica40325-dpo-1008-5162s_v2 status is now torndown due to DeploymentManager action