submission_id: riverise-dpo-second_v2
developer_uid: Riverise
best_of: 16
celo_rating: 1239.1
display_name: riverise-dpo-second_v1
family_friendly_score: 0.0
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 0.8, 'min_p': 0.2, 'top_k': 30, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
is_internal_developer: False
language_model: Riverise/dpo-second
max_input_tokens: 512
max_output_tokens: 64
model_architecture: LlamaForCausalLM
model_group: Riverise/dpo-second
model_name: riverise-dpo-second_v1
model_num_parameters: 8030261248.0
model_repo: Riverise/dpo-second
model_size: 8B
num_battles: 11103
num_wins: 5667
ranking_group: single
status: torndown
submission_type: basic
timestamp: 2024-09-01T07:58:59+00:00
us_pacific_date: 2024-09-01
win_ratio: 0.5104025938935423
Resubmit model
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name riverise-dpo-second-v2-mkmlizer
Waiting for job on riverise-dpo-second-v2-mkmlizer to finish
Stopping job with name riverise-dpo-second-v2-mkmlizer
%s, retrying in %s seconds...
Starting job with name riverise-dpo-second-v2-mkmlizer
Waiting for job on riverise-dpo-second-v2-mkmlizer to finish
riverise-dpo-second-v2-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
riverise-dpo-second-v2-mkmlizer: ║ _____ __ __ ║
riverise-dpo-second-v2-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
riverise-dpo-second-v2-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
riverise-dpo-second-v2-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
riverise-dpo-second-v2-mkmlizer: ║ /___/ ║
riverise-dpo-second-v2-mkmlizer: ║ ║
riverise-dpo-second-v2-mkmlizer: ║ Version: 0.10.1 ║
riverise-dpo-second-v2-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
riverise-dpo-second-v2-mkmlizer: ║ https://mk1.ai ║
riverise-dpo-second-v2-mkmlizer: ║ ║
riverise-dpo-second-v2-mkmlizer: ║ The license key for the current software has been verified as ║
riverise-dpo-second-v2-mkmlizer: ║ belonging to: ║
riverise-dpo-second-v2-mkmlizer: ║ ║
riverise-dpo-second-v2-mkmlizer: ║ Chai Research Corp. ║
riverise-dpo-second-v2-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
riverise-dpo-second-v2-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
riverise-dpo-second-v2-mkmlizer: ║ ║
riverise-dpo-second-v2-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
riverise-dpo-second-v2-mkmlizer: Downloaded to shared memory in 47.239s
riverise-dpo-second-v2-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp43k_tyos, device:0
riverise-dpo-second-v2-mkmlizer: Saving flywheel model at /dev/shm/model_cache
riverise-dpo-second-v2-mkmlizer: quantized model in 28.716s
riverise-dpo-second-v2-mkmlizer: Processed model Riverise/dpo-second in 75.956s
riverise-dpo-second-v2-mkmlizer: creating bucket guanaco-mkml-models
riverise-dpo-second-v2-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
riverise-dpo-second-v2-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/riverise-dpo-second-v2
riverise-dpo-second-v2-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/riverise-dpo-second-v2/config.json
riverise-dpo-second-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/riverise-dpo-second-v2/tokenizer_config.json
riverise-dpo-second-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/riverise-dpo-second-v2/tokenizer.json
riverise-dpo-second-v2-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/riverise-dpo-second-v2/flywheel_model.0.safetensors
riverise-dpo-second-v2-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 2%|▏ | 5/291 [00:00<00:10, 26.27it/s] Loading 0: 4%|▍ | 12/291 [00:00<00:07, 36.04it/s] Loading 0: 5%|▌ | 16/291 [00:00<00:08, 33.52it/s] Loading 0: 7%|▋ | 21/291 [00:00<00:07, 37.23it/s] Loading 0: 9%|▊ | 25/291 [00:00<00:07, 34.26it/s] Loading 0: 11%|█ | 31/291 [00:00<00:06, 40.90it/s] Loading 0: 12%|█▏ | 36/291 [00:01<00:10, 24.80it/s] Loading 0: 14%|█▍ | 41/291 [00:01<00:09, 26.70it/s] Loading 0: 16%|█▋ | 48/291 [00:01<00:07, 33.07it/s] Loading 0: 18%|█▊ | 52/291 [00:01<00:07, 32.36it/s] Loading 0: 20%|█▉ | 57/291 [00:01<00:06, 35.61it/s] Loading 0: 21%|██ | 61/291 [00:01<00:06, 34.13it/s] Loading 0: 23%|██▎ | 66/291 [00:01<00:06, 36.79it/s] Loading 0: 24%|██▍ | 70/291 [00:02<00:06, 35.03it/s] Loading 0: 25%|██▌ | 74/291 [00:02<00:06, 35.14it/s] Loading 0: 27%|██▋ | 78/291 [00:02<00:06, 34.75it/s] Loading 0: 28%|██▊ | 82/291 [00:02<00:08, 24.50it/s] Loading 0: 29%|██▉ | 85/291 [00:02<00:08, 25.22it/s] Loading 0: 31%|███ | 90/291 [00:02<00:06, 28.91it/s] Loading 0: 32%|███▏ | 94/291 [00:02<00:06, 28.84it/s] Loading 0: 34%|███▍ | 99/291 [00:03<00:05, 32.08it/s] Loading 0: 35%|███▌ | 103/291 [00:03<00:05, 31.64it/s] Loading 0: 37%|███▋ | 108/291 [00:03<00:05, 34.77it/s] Loading 0: 38%|███▊ | 112/291 [00:03<00:05, 33.22it/s] Loading 0: 40%|███▉ | 116/291 [00:03<00:05, 32.65it/s] Loading 0: 42%|████▏ | 122/291 [00:03<00:04, 37.34it/s] Loading 0: 44%|████▎ | 127/291 [00:03<00:04, 35.07it/s] Loading 0: 46%|████▌ | 133/291 [00:04<00:05, 29.18it/s] Loading 0: 47%|████▋ | 137/291 [00:04<00:05, 29.45it/s] Loading 0: 48%|████▊ | 141/291 [00:04<00:05, 27.57it/s] Loading 0: 51%|█████ | 147/291 [00:04<00:04, 32.18it/s] Loading 0: 52%|█████▏ | 151/291 [00:04<00:04, 31.24it/s] Loading 0: 54%|█████▎ | 156/291 [00:04<00:03, 34.41it/s] Loading 0: 55%|█████▍ | 160/291 [00:05<00:03, 33.01it/s] Loading 0: 57%|█████▋ | 165/291 [00:05<00:03, 35.49it/s] Loading 0: 58%|█████▊ | 169/291 [00:05<00:03, 33.20it/s] Loading 0: 60%|█████▉ | 174/291 [00:05<00:03, 34.62it/s] Loading 0: 61%|██████ | 178/291 [00:05<00:03, 32.67it/s] Loading 0: 63%|██████▎ | 184/291 [00:05<00:02, 38.69it/s] Loading 0: 65%|██████▍ | 189/291 [00:06<00:04, 24.05it/s] Loading 0: 67%|██████▋ | 194/291 [00:06<00:03, 25.29it/s] Loading 0: 69%|██████▉ | 201/291 [00:06<00:02, 31.80it/s] Loading 0: 70%|███████ | 205/291 [00:06<00:02, 31.36it/s] Loading 0: 72%|███████▏ | 210/291 [00:06<00:02, 34.38it/s] Loading 0: 74%|███████▎ | 214/291 [00:06<00:02, 32.31it/s] Loading 0: 75%|███████▌ | 219/291 [00:06<00:02, 34.47it/s] Loading 0: 77%|███████▋ | 223/291 [00:06<00:02, 33.39it/s] Loading 0: 78%|███████▊ | 227/291 [00:07<00:01, 34.17it/s] Loading 0: 79%|███████▉ | 231/291 [00:07<00:01, 34.26it/s] Loading 0: 81%|████████ | 235/291 [00:07<00:02, 25.34it/s] Loading 0: 82%|████████▏ | 239/291 [00:07<00:02, 25.27it/s] Loading 0: 85%|████████▍ | 246/291 [00:07<00:01, 33.22it/s] Loading 0: 86%|████████▌ | 250/291 [00:07<00:01, 32.50it/s] Loading 0: 88%|████████▊ | 255/291 [00:07<00:01, 35.38it/s] Loading 0: 89%|████████▉ | 259/291 [00:08<00:00, 33.08it/s] Loading 0: 91%|█████████ | 264/291 [00:08<00:00, 35.65it/s] Loading 0: 92%|█████████▏| 268/291 [00:08<00:00, 34.21it/s] Loading 0: 94%|█████████▍| 273/291 [00:08<00:00, 36.84it/s] Loading 0: 95%|█████████▌| 277/291 [00:08<00:00, 34.81it/s] Loading 0: 97%|█████████▋| 281/291 [00:08<00:00, 35.20it/s] Loading 0: 98%|█████████▊| 286/291 [00:14<00:01, 2.61it/s] Loading 0: 99%|█████████▉| 289/291 [00:14<00:00, 3.24it/s]
Job riverise-dpo-second-v2-mkmlizer completed after 94.96s with status: succeeded
Stopping job with name riverise-dpo-second-v2-mkmlizer
Pipeline stage MKMLizer completed in 95.73s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.11s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service riverise-dpo-second-v2
Waiting for inference service riverise-dpo-second-v2 to be ready
Inference service riverise-dpo-second-v2 ready after 190.42451357841492s
Pipeline stage MKMLDeployer completed in 190.71s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.914672613143921s
Received healthy response to inference request in 2.0294547080993652s
Received healthy response to inference request in 2.277837038040161s
Received healthy response to inference request in 1.7501881122589111s
Received healthy response to inference request in 1.953500509262085s
5 requests
0 failed requests
5th percentile: 1.783085012435913
10th percentile: 1.815981912612915
20th percentile: 1.881775712966919
30th percentile: 1.9224381923675538
40th percentile: 1.9379693508148192
50th percentile: 1.953500509262085
60th percentile: 1.983882188796997
70th percentile: 2.014263868331909
80th percentile: 2.0791311740875242
90th percentile: 2.1784841060638427
95th percentile: 2.228160572052002
99th percentile: 2.267901744842529
mean time: 1.9851305961608887
Pipeline stage StressChecker completed in 10.63s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
starting trigger_guanaco_pipeline %s
Pipeline stage TriggerMKMLProfilingPipeline completed in 1.82s
riverise-dpo-second_v2 status is now deployed due to DeploymentManager action
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.13s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service riverise-dpo-second-v2-profiler
Waiting for inference service riverise-dpo-second-v2-profiler to be ready
Inference service riverise-dpo-second-v2-profiler ready after 180.42436575889587s
Pipeline stage MKMLProfilerDeployer completed in 180.80s
run pipeline stage %s
Running pipeline stage MKMLProfilerRunner
script pods %s
Pipeline stage MKMLProfilerRunner completed in 0.37s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Checking if service riverise-dpo-second-v2-profiler is running
Tearing down inference service riverise-dpo-second-v2-profiler
Service riverise-dpo-second-v2-profiler has been torndown
Pipeline stage MKMLProfilerDeleter completed in 2.08s
riverise-dpo-second_v2 status is now inactive due to auto deactivation removed underperforming models
riverise-dpo-second_v2 status is now torndown due to DeploymentManager action