developer_uid: Trace2333
submission_id: trace2333-dpo-v9-v1-reprompt_v2
model_name: trace2333-dpo-v9-v1-reprompt_v2
model_group: Trace2333/dpo_v9_v1_repr
status: torndown
timestamp: 2024-08-26T03:30:02+00:00
num_battles: 12069
num_wins: 6240
celo_rating: 1248.52
family_friendly_score: 0.0
submission_type: basic
model_repo: Trace2333/dpo_v9_v1_reprompt
model_architecture: LlamaForCausalLM
model_num_parameters: 8030261248.0
best_of: 16
max_input_tokens: 512
max_output_tokens: 64
display_name: trace2333-dpo-v9-v1-reprompt_v2
is_internal_developer: False
language_model: Trace2333/dpo_v9_v1_reprompt
model_size: 8B
ranking_group: single
us_pacific_date: 2024-08-25
win_ratio: 0.5170270942083023
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Running pipeline stage MKMLizer
Starting job with name trace2333-dpo-v9-v1-reprompt-v2-mkmlizer
Waiting for job on trace2333-dpo-v9-v1-reprompt-v2-mkmlizer to finish
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ _____ __ __ ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ /___/ ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ Version: 0.10.1 ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ https://mk1.ai ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ The license key for the current software has been verified as ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ belonging to: ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ Chai Research Corp. ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ║ ║
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: Downloaded to shared memory in 45.714s
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpfj1qtprd, device:0
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: Saving flywheel model at /dev/shm/model_cache
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: quantized model in 29.679s
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: Processed model Trace2333/dpo_v9_v1_reprompt in 75.394s
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: creating bucket guanaco-mkml-models
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/trace2333-dpo-v9-v1-reprompt-v2
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/trace2333-dpo-v9-v1-reprompt-v2/tokenizer_config.json
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/trace2333-dpo-v9-v1-reprompt-v2/tokenizer.json
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/trace2333-dpo-v9-v1-reprompt-v2/flywheel_model.0.safetensors
trace2333-dpo-v9-v1-reprompt-v2-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 2%|▏ | 5/291 [00:00<00:11, 24.87it/s] Loading 0: 3%|▎ | 10/291 [00:00<00:07, 35.23it/s] Loading 0: 5%|▍ | 14/291 [00:00<00:10, 26.34it/s] Loading 0: 7%|▋ | 21/291 [00:00<00:07, 35.54it/s] Loading 0: 9%|▊ | 25/291 [00:00<00:07, 33.28it/s] Loading 0: 10%|█ | 30/291 [00:00<00:07, 36.83it/s] Loading 0: 12%|█▏ | 34/291 [00:01<00:10, 23.56it/s] Loading 0: 13%|█▎ | 38/291 [00:01<00:10, 25.14it/s] Loading 0: 14%|█▍ | 42/291 [00:01<00:10, 24.54it/s] Loading 0: 16%|█▋ | 48/291 [00:01<00:08, 29.66it/s] Loading 0: 18%|█▊ | 52/291 [00:01<00:08, 29.55it/s] Loading 0: 20%|█▉ | 57/291 [00:01<00:07, 32.48it/s] Loading 0: 21%|██ | 61/291 [00:02<00:07, 30.99it/s] Loading 0: 23%|██▎ | 66/291 [00:02<00:06, 33.47it/s] Loading 0: 24%|██▍ | 70/291 [00:02<00:06, 32.09it/s] Loading 0: 25%|██▌ | 74/291 [00:02<00:06, 31.80it/s] Loading 0: 27%|██▋ | 78/291 [00:02<00:06, 31.81it/s] Loading 0: 28%|██▊ | 82/291 [00:02<00:09, 21.68it/s] Loading 0: 29%|██▉ | 85/291 [00:03<00:09, 22.75it/s] Loading 0: 31%|███ | 90/291 [00:03<00:07, 26.93it/s] Loading 0: 32%|███▏ | 94/291 [00:03<00:07, 27.25it/s] Loading 0: 34%|███▍ | 99/291 [00:03<00:06, 30.47it/s] Loading 0: 35%|███▌ | 103/291 [00:03<00:06, 29.87it/s] Loading 0: 37%|███▋ | 108/291 [00:03<00:05, 32.24it/s] Loading 0: 38%|███▊ | 112/291 [00:03<00:05, 31.38it/s] Loading 0: 40%|███▉ | 116/291 [00:03<00:05, 31.54it/s] Loading 0: 42%|████▏ | 122/291 [00:04<00:04, 35.27it/s] Loading 0: 44%|████▎ | 127/291 [00:04<00:04, 33.84it/s] Loading 0: 46%|████▌ | 133/291 [00:04<00:05, 28.10it/s] Loading 0: 47%|████▋ | 137/291 [00:04<00:05, 28.72it/s] Loading 0: 48%|████▊ | 141/291 [00:04<00:05, 27.37it/s] Loading 0: 51%|█████ | 147/291 [00:04<00:04, 31.71it/s] Loading 0: 52%|█████▏ | 151/291 [00:05<00:04, 30.86it/s] Loading 0: 54%|█████▎ | 156/291 [00:05<00:04, 32.86it/s] Loading 0: 55%|█████▍ | 160/291 [00:05<00:04, 31.67it/s] Loading 0: 57%|█████▋ | 165/291 [00:05<00:03, 34.49it/s] Loading 0: 58%|█████▊ | 169/291 [00:05<00:03, 33.14it/s] Loading 0: 60%|█████▉ | 174/291 [00:05<00:03, 34.66it/s] Loading 0: 61%|██████ | 178/291 [00:05<00:03, 33.17it/s] Loading 0: 63%|██████▎ | 183/291 [00:05<00:02, 36.99it/s] Loading 0: 64%|██████▍ | 187/291 [00:06<00:04, 25.57it/s] Loading 0: 66%|██████▌ | 191/291 [00:06<00:03, 26.46it/s] Loading 0: 67%|██████▋ | 195/291 [00:06<00:03, 25.13it/s] Loading 0: 69%|██████▉ | 201/291 [00:06<00:03, 29.99it/s] Loading 0: 70%|███████ | 205/291 [00:06<00:02, 30.08it/s] Loading 0: 72%|███████▏ | 210/291 [00:06<00:02, 32.82it/s] Loading 0: 74%|███████▎ | 214/291 [00:07<00:02, 32.05it/s] Loading 0: 75%|███████▌ | 219/291 [00:07<00:02, 34.38it/s] Loading 0: 77%|███████▋ | 223/291 [00:07<00:02, 32.83it/s] Loading 0: 78%|███████▊ | 227/291 [00:07<00:01, 33.00it/s] Loading 0: 79%|███████▉ | 231/291 [00:07<00:01, 33.12it/s] Loading 0: 81%|████████ | 235/291 [00:07<00:02, 23.33it/s] Loading 0: 82%|████████▏ | 239/291 [00:08<00:02, 23.80it/s] Loading 0: 85%|████████▍ | 246/291 [00:08<00:01, 31.57it/s] Loading 0: 86%|████████▌ | 250/291 [00:08<00:01, 31.06it/s] Loading 0: 88%|████████▊ | 255/291 [00:08<00:01, 34.06it/s] Loading 0: 89%|████████▉ | 259/291 [00:08<00:00, 32.75it/s] Loading 0: 91%|█████████ | 264/291 [00:08<00:00, 35.50it/s] Loading 0: 92%|█████████▏| 268/291 [00:08<00:00, 33.42it/s] Loading 0: 94%|█████████▍| 273/291 [00:08<00:00, 34.96it/s] Loading 0: 95%|█████████▌| 277/291 [00:09<00:00, 32.82it/s] Loading 0: 97%|█████████▋| 281/291 [00:09<00:00, 32.52it/s] Loading 0: 98%|█████████▊| 286/291 [00:14<00:01, 2.55it/s] Loading 0: 99%|█████████▉| 289/291 [00:14<00:00, 3.17it/s]
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Job trace2333-dpo-v9-v1-reprompt-v2-mkmlizer completed after 94.02s with status: succeeded
Stopping job with name trace2333-dpo-v9-v1-reprompt-v2-mkmlizer
Pipeline stage MKMLizer completed in 95.28s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.09s
Running pipeline stage ISVCDeployer
Creating inference service trace2333-dpo-v9-v1-reprompt-v2
Waiting for inference service trace2333-dpo-v9-v1-reprompt-v2 to be ready
Failed to get response for submission chaiml-llama-8b-pairwis_8189_v19: ('http://chaiml-llama-8b-pairwis-8189-v19-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '{"error":"ValueError : [TypeError(\\"\'numpy.int64\' object is not iterable\\"), TypeError(\'vars() argument must have __dict__ attribute\')]"}')
Connection pool is full, discarding connection: %s. Connection pool size: %s
Inference service trace2333-dpo-v9-v1-reprompt-v2 ready after 160.58318901062012s
Pipeline stage ISVCDeployer completed in 160.98s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9934217929840088s
Received healthy response to inference request in 1.657329797744751s
Received healthy response to inference request in 2.9331886768341064s
Received healthy response to inference request in 1.5069358348846436s
Received healthy response to inference request in 1.6242051124572754s
5 requests
0 failed requests
5th percentile: 1.5303896903991698
10th percentile: 1.5538435459136963
20th percentile: 1.6007512569427491
30th percentile: 1.6308300495147705
40th percentile: 1.6440799236297607
50th percentile: 1.657329797744751
60th percentile: 1.791766595840454
70th percentile: 1.9262033939361571
80th percentile: 2.1813751697540287
90th percentile: 2.5572819232940676
95th percentile: 2.7452353000640866
99th percentile: 2.8955980014801024
mean time: 1.943016242980957
Pipeline stage StressChecker completed in 10.53s
trace2333-dpo-v9-v1-reprompt_v2 status is now deployed due to DeploymentManager action
trace2333-dpo-v9-v1-reprompt_v2 status is now inactive due to auto deactivation removed underperforming models
trace2333-dpo-v9-v1-reprompt_v2 status is now torndown due to DeploymentManager action