submission_id: rica40325-dpo-1008-5162s_v4
developer_uid: rica40325
best_of: 8
celo_rating: 1297.21
display_name: rica40325-dpo-1008-5162s_v4
family_friendly_score: 0.5844610730775355
family_friendly_standard_error: 0.006189247695120724
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: False
language_model: rica40325/dpo_1008_5162s
max_input_tokens: 1024
max_output_tokens: 64
model_architecture: MistralForCausalLM
model_group: rica40325/dpo_1008_5162s
model_name: rica40325-dpo-1008-5162s_v4
model_num_parameters: 12772070400.0
model_repo: rica40325/dpo_1008_5162s
model_size: 13B
num_battles: 6562
num_wins: 3646
ranking_group: single
status: torndown
submission_type: basic
timestamp: 2024-10-08T12:25:00+00:00
us_pacific_date: 2024-10-08
win_ratio: 0.5556232855836635
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name rica40325-dpo-1008-5162s-v4-mkmlizer
Waiting for job on rica40325-dpo-1008-5162s-v4-mkmlizer to finish
rica40325-dpo-1008-5162s-v4-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ _____ __ __ ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ /___/ ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ Version: 0.11.12 ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ https://mk1.ai ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ The license key for the current software has been verified as ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ belonging to: ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ Chai Research Corp. ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ║ ║
rica40325-dpo-1008-5162s-v4-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
rica40325-dpo-1008-5162s-v4-mkmlizer: Downloaded to shared memory in 107.033s
rica40325-dpo-1008-5162s-v4-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp4bztl6xq, device:0
rica40325-dpo-1008-5162s-v4-mkmlizer: Saving flywheel model at /dev/shm/model_cache
rica40325-dpo-1008-5162s-v4-mkmlizer: quantized model in 44.248s
rica40325-dpo-1008-5162s-v4-mkmlizer: Processed model rica40325/dpo_1008_5162s in 151.280s
rica40325-dpo-1008-5162s-v4-mkmlizer: creating bucket guanaco-mkml-models
rica40325-dpo-1008-5162s-v4-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
rica40325-dpo-1008-5162s-v4-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v4
rica40325-dpo-1008-5162s-v4-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v4/special_tokens_map.json
rica40325-dpo-1008-5162s-v4-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v4/config.json
rica40325-dpo-1008-5162s-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v4/tokenizer_config.json
rica40325-dpo-1008-5162s-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/rica40325-dpo-1008-5162s-v4/tokenizer.json
rica40325-dpo-1008-5162s-v4-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:18, 19.04it/s] Loading 0: 3%|▎ | 10/363 [00:00<00:14, 24.76it/s] Loading 0: 4%|▍ | 14/363 [00:00<00:15, 21.83it/s] Loading 0: 5%|▌ | 19/363 [00:00<00:12, 27.88it/s] Loading 0: 6%|▋ | 23/363 [00:01<00:16, 20.86it/s] Loading 0: 7%|▋ | 26/363 [00:01<00:18, 18.24it/s] Loading 0: 9%|▊ | 31/363 [00:01<00:13, 23.80it/s] Loading 0: 9%|▉ | 34/363 [00:01<00:13, 23.51it/s] Loading 0: 10%|█ | 37/363 [00:01<00:13, 24.60it/s] Loading 0: 11%|█▏ | 41/363 [00:01<00:14, 22.02it/s] Loading 0: 13%|█▎ | 46/363 [00:01<00:11, 26.85it/s] Loading 0: 14%|█▍ | 50/363 [00:02<00:13, 23.40it/s] Loading 0: 15%|█▌ | 55/363 [00:02<00:11, 27.99it/s] Loading 0: 17%|█▋ | 60/363 [00:02<00:10, 28.58it/s] Loading 0: 18%|█▊ | 64/363 [00:02<00:16, 18.56it/s] Loading 0: 19%|█▉ | 69/363 [00:02<00:12, 22.85it/s] Loading 0: 20%|██ | 73/363 [00:03<00:12, 23.11it/s] Loading 0: 21%|██ | 77/363 [00:03<00:13, 21.74it/s] Loading 0: 23%|██▎ | 82/363 [00:03<00:10, 26.06it/s] Loading 0: 24%|██▎ | 86/363 [00:03<00:11, 23.43it/s] Loading 0: 25%|██▌ | 91/363 [00:03<00:09, 27.53it/s] Loading 0: 26%|██▌ | 95/363 [00:04<00:11, 24.34it/s] Loading 0: 28%|██▊ | 100/363 [00:04<00:09, 28.51it/s] Loading 0: 29%|██▊ | 104/363 [00:04<00:13, 19.60it/s] Loading 0: 30%|███ | 109/363 [00:04<00:10, 23.78it/s] Loading 0: 31%|███ | 113/363 [00:04<00:11, 21.93it/s] Loading 0: 33%|███▎ | 118/363 [00:04<00:09, 26.24it/s] Loading 0: 34%|███▎ | 122/363 [00:05<00:10, 23.53it/s] Loading 0: 35%|███▍ | 127/363 [00:05<00:08, 27.75it/s] Loading 0: 36%|███▌ | 131/363 [00:05<00:09, 24.36it/s] Loading 0: 37%|███▋ | 136/363 [00:05<00:07, 28.55it/s] Loading 0: 39%|███▉ | 141/363 [00:05<00:07, 29.24it/s] Loading 0: 40%|███▉ | 145/363 [00:06<00:10, 20.86it/s] Loading 0: 41%|████ | 149/363 [00:06<00:10, 19.97it/s] Loading 0: 42%|████▏ | 154/363 [00:06<00:08, 24.37it/s] Loading 0: 44%|████▎ | 158/363 [00:06<00:09, 22.26it/s] Loading 0: 45%|████▍ | 163/363 [00:06<00:07, 26.33it/s] Loading 0: 46%|████▌ | 167/363 [00:07<00:08, 23.46it/s] Loading 0: 47%|████▋ | 172/363 [00:07<00:06, 27.51it/s] Loading 0: 48%|████▊ | 176/363 [00:07<00:07, 24.15it/s] Loading 0: 50%|████▉ | 181/363 [00:07<00:06, 28.23it/s] Loading 0: 51%|█████ | 185/363 [00:07<00:09, 19.03it/s] Loading 0: 52%|█████▏ | 190/363 [00:07<00:07, 23.29it/s] Loading 0: 53%|█████▎ | 194/363 [00:08<00:07, 21.73it/s] Loading 0: 55%|█████▍ | 199/363 [00:08<00:06, 25.94it/s] Loading 0: 56%|█████▌ | 203/363 [00:08<00:06, 23.22it/s] Loading 0: 57%|█████▋ | 208/363 [00:08<00:05, 27.48it/s] Loading 0: 58%|█████▊ | 212/363 [00:08<00:06, 24.17it/s] Loading 0: 60%|█████▉ | 217/363 [00:08<00:05, 28.46it/s] Loading 0: 61%|██████ | 222/363 [00:09<00:04, 29.24it/s] Loading 0: 62%|██████▏ | 226/363 [00:09<00:06, 20.85it/s] Loading 0: 63%|██████▎ | 230/363 [00:09<00:06, 20.06it/s] Loading 0: 65%|██████▍ | 235/363 [00:09<00:05, 24.64it/s] Loading 0: 66%|██████▌ | 239/363 [00:10<00:05, 22.38it/s] Loading 0: 67%|██████▋ | 244/363 [00:10<00:04, 26.74it/s] Loading 0: 68%|██████▊ | 248/363 [00:10<00:04, 23.77it/s] Loading 0: 70%|██████▉ | 253/363 [00:10<00:03, 28.00it/s] Loading 0: 71%|███████ | 257/363 [00:10<00:04, 24.34it/s] Loading 0: 72%|███████▏ | 262/363 [00:10<00:03, 28.42it/s] Loading 0: 73%|███████▎ | 266/363 [00:11<00:05, 19.14it/s] Loading 0: 75%|███████▍ | 271/363 [00:11<00:03, 23.52it/s] Loading 0: 76%|███████▌ | 275/363 [00:11<00:04, 21.91it/s] Loading 0: 77%|███████▋ | 280/363 [00:11<00:03, 26.33it/s] Loading 0: 78%|███████▊ | 284/363 [00:11<00:03, 23.40it/s] Loading 0: 80%|███████▉ | 289/363 [00:11<00:02, 27.76it/s] Loading 0: 81%|████████ | 293/363 [00:12<00:02, 24.10it/s] Loading 0: 82%|████████▏ | 298/363 [00:12<00:02, 28.49it/s] Loading 0: 83%|████████▎ | 303/363 [00:12<00:02, 29.03it/s] Loading 0: 85%|████████▍ | 307/363 [00:12<00:02, 20.85it/s] Loading 0: 86%|████████▌ | 311/363 [00:13<00:02, 20.03it/s] Loading 0: 87%|████████▋ | 316/363 [00:13<00:01, 24.47it/s] Loading 0: 88%|████████▊ | 320/363 [00:13<00:01, 22.43it/s] Loading 0: 90%|████████▉ | 325/363 [00:13<00:01, 26.87it/s] Loading 0: 91%|█████████ | 329/363 [00:13<00:01, 23.87it/s] Loading 0: 92%|█████████▏| 334/363 [00:13<00:01, 28.19it/s] Loading 0: 93%|█████████▎| 338/363 [00:14<00:01, 24.63it/s] Loading 0: 94%|█████████▍| 343/363 [00:14<00:00, 28.51it/s] Loading 0: 96%|█████████▌| 347/363 [00:21<00:08, 1.95it/s] Loading 0: 96%|█████████▋| 350/363 [00:21<00:05, 2.45it/s] Loading 0: 97%|█████████▋| 353/363 [00:21<00:03, 3.12it/s] Loading 0: 98%|█████████▊| 357/363 [00:21<00:01, 4.26it/s] Loading 0: 100%|█████████▉| 362/363 [00:21<00:00, 6.34it/s]
Job rica40325-dpo-1008-5162s-v4-mkmlizer completed after 186.2s with status: succeeded
Stopping job with name rica40325-dpo-1008-5162s-v4-mkmlizer
Pipeline stage MKMLizer completed in 186.79s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service rica40325-dpo-1008-5162s-v4
Waiting for inference service rica40325-dpo-1008-5162s-v4 to be ready
Inference service rica40325-dpo-1008-5162s-v4 ready after 130.43958377838135s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Pipeline stage MKMLDeployer completed in 134.69s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 5.891027450561523s
Received healthy response to inference request in 5.69687294960022s
Received healthy response to inference request in 5.3210248947143555s
Received healthy response to inference request in 4.25731897354126s
Received healthy response to inference request in 3.451618194580078s
5 requests
0 failed requests
5th percentile: 3.6127583503723146
10th percentile: 3.7738985061645507
20th percentile: 4.096178817749023
30th percentile: 4.470060157775879
40th percentile: 4.8955425262451175
50th percentile: 5.3210248947143555
60th percentile: 5.471364116668701
70th percentile: 5.621703338623047
80th percentile: 5.735703849792481
90th percentile: 5.813365650177002
95th percentile: 5.852196550369262
99th percentile: 5.883261270523072
mean time: 4.9235724925994875
%s, retrying in %s seconds...
Received healthy response to inference request in 1.7029902935028076s
Received healthy response to inference request in 1.8041858673095703s
Received healthy response to inference request in 2.670348882675171s
Received healthy response to inference request in 2.7134742736816406s
Received healthy response to inference request in 2.065708875656128s
5 requests
0 failed requests
5th percentile: 1.7232294082641602
10th percentile: 1.7434685230255127
20th percentile: 1.7839467525482178
30th percentile: 1.856490468978882
40th percentile: 1.961099672317505
50th percentile: 2.065708875656128
60th percentile: 2.307564878463745
70th percentile: 2.549420881271362
80th percentile: 2.6789739608764647
90th percentile: 2.6962241172790526
95th percentile: 2.704849195480347
99th percentile: 2.711749258041382
mean time: 2.1913416385650635
Pipeline stage StressChecker completed in 39.74s
Shutdown handler de-registered
rica40325-dpo-1008-5162s_v4 status is now deployed due to DeploymentManager action
rica40325-dpo-1008-5162s_v4 status is now inactive due to auto deactivation removed underperforming models
rica40325-dpo-1008-5162s_v4 status is now torndown due to DeploymentManager action