developer_uid: Trace2333
submission_id: trace2333-ultra-dol-fd-_4247_v14
model_name: trace2333-ultra-dol-fd-_4247_v14
model_group: Trace2333/ultra_dol_fd_r
status: torndown
timestamp: 2024-08-24T12:44:21+00:00
num_battles: 6890
num_wins: 3543
celo_rating: 1244.17
family_friendly_score: 0.0
submission_type: basic
model_repo: Trace2333/ultra_dol_fd_r64a32_qkvo_epoch6_v1
model_architecture: LlamaForCausalLM
model_num_parameters: 8030261248.0
best_of: 16
max_input_tokens: 512
max_output_tokens: 64
display_name: trace2333-ultra-dol-fd-_4247_v14
is_internal_developer: False
language_model: Trace2333/ultra_dol_fd_r64a32_qkvo_epoch6_v1
model_size: 8B
ranking_group: single
us_pacific_date: 2024-08-24
win_ratio: 0.5142235123367199
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Running pipeline stage MKMLizer
Starting job with name trace2333-ultra-dol-fd-4247-v14-mkmlizer
Waiting for job on trace2333-ultra-dol-fd-4247-v14-mkmlizer to finish
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ _____ __ __ ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ /___/ ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ Version: 0.10.1 ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ https://mk1.ai ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ The license key for the current software has been verified as ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ belonging to: ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ Chai Research Corp. ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ║ ║
trace2333-ultra-dol-fd-4247-v14-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
trace2333-ultra-dol-fd-4247-v14-mkmlizer: Downloaded to shared memory in 45.019s
trace2333-ultra-dol-fd-4247-v14-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp3yhrd0dw, device:0
trace2333-ultra-dol-fd-4247-v14-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Failed to get response for submission blend_lobuf_2024-08-22: ('http://zonemercy-lexical-nemov8-5966-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'EOF\n')
trace2333-ultra-dol-fd-4247-v14-mkmlizer: quantized model in 29.374s
trace2333-ultra-dol-fd-4247-v14-mkmlizer: Processed model Trace2333/ultra_dol_fd_r64a32_qkvo_epoch6_v1 in 74.394s
trace2333-ultra-dol-fd-4247-v14-mkmlizer: creating bucket guanaco-mkml-models
trace2333-ultra-dol-fd-4247-v14-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
trace2333-ultra-dol-fd-4247-v14-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/trace2333-ultra-dol-fd-4247-v14
trace2333-ultra-dol-fd-4247-v14-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/trace2333-ultra-dol-fd-4247-v14/config.json
trace2333-ultra-dol-fd-4247-v14-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/trace2333-ultra-dol-fd-4247-v14/special_tokens_map.json
trace2333-ultra-dol-fd-4247-v14-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/trace2333-ultra-dol-fd-4247-v14/tokenizer_config.json
trace2333-ultra-dol-fd-4247-v14-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/trace2333-ultra-dol-fd-4247-v14/tokenizer.json
trace2333-ultra-dol-fd-4247-v14-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/trace2333-ultra-dol-fd-4247-v14/flywheel_model.0.safetensors
trace2333-ultra-dol-fd-4247-v14-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 2%|▏ | 5/291 [00:00<00:11, 24.99it/s] Loading 0: 4%|▍ | 12/291 [00:00<00:08, 34.82it/s] Loading 0: 5%|▌ | 16/291 [00:00<00:08, 32.26it/s] Loading 0: 7%|▋ | 21/291 [00:00<00:07, 35.37it/s] Loading 0: 9%|▊ | 25/291 [00:00<00:08, 32.46it/s] Loading 0: 11%|█ | 31/291 [00:00<00:06, 38.80it/s] Loading 0: 12%|█▏ | 36/291 [00:01<00:11, 22.70it/s] Loading 0: 14%|█▍ | 41/291 [00:01<00:10, 23.97it/s] Loading 0: 16%|█▋ | 48/291 [00:01<00:07, 30.78it/s] Loading 0: 18%|█▊ | 52/291 [00:01<00:07, 30.18it/s] Loading 0: 20%|█▉ | 57/291 [00:01<00:07, 32.65it/s] Loading 0: 21%|██ | 61/291 [00:02<00:07, 31.29it/s] Loading 0: 23%|██▎ | 66/291 [00:02<00:06, 33.42it/s] Loading 0: 24%|██▍ | 70/291 [00:02<00:06, 31.76it/s] Loading 0: 25%|██▌ | 74/291 [00:02<00:06, 31.47it/s] Loading 0: 27%|██▋ | 78/291 [00:02<00:06, 30.88it/s] Loading 0: 28%|██▊ | 82/291 [00:02<00:09, 21.83it/s] Loading 0: 29%|██▉ | 85/291 [00:02<00:09, 22.82it/s] Loading 0: 31%|███ | 90/291 [00:03<00:07, 27.44it/s] Loading 0: 32%|███▏ | 94/291 [00:03<00:07, 27.87it/s] Loading 0: 34%|███▍ | 99/291 [00:03<00:06, 31.06it/s] Loading 0: 35%|███▌ | 103/291 [00:03<00:06, 29.76it/s] Loading 0: 37%|███▋ | 108/291 [00:03<00:05, 32.84it/s] Loading 0: 38%|███▊ | 112/291 [00:03<00:05, 32.07it/s] Loading 0: 40%|███▉ | 116/291 [00:03<00:05, 32.51it/s] Loading 0: 42%|████▏ | 122/291 [00:04<00:04, 36.53it/s] Loading 0: 44%|████▎ | 127/291 [00:04<00:04, 34.51it/s] Loading 0: 46%|████▌ | 133/291 [00:04<00:05, 28.39it/s] Loading 0: 47%|████▋ | 137/291 [00:04<00:05, 28.56it/s] Loading 0: 48%|████▊ | 141/291 [00:04<00:05, 27.04it/s] Loading 0: 51%|█████ | 147/291 [00:04<00:04, 31.59it/s] Loading 0: 52%|█████▏ | 151/291 [00:05<00:04, 30.22it/s] Loading 0: 54%|█████▎ | 156/291 [00:05<00:04, 33.08it/s] Loading 0: 55%|█████▍ | 160/291 [00:05<00:04, 31.39it/s] Loading 0: 57%|█████▋ | 165/291 [00:05<00:03, 33.40it/s] Loading 0: 58%|█████▊ | 169/291 [00:05<00:03, 32.04it/s] Loading 0: 60%|█████▉ | 174/291 [00:05<00:03, 34.29it/s] Loading 0: 61%|██████ | 178/291 [00:05<00:03, 32.50it/s] Loading 0: 63%|██████▎ | 182/291 [00:05<00:03, 34.01it/s] Loading 0: 64%|██████▍ | 186/291 [00:06<00:04, 25.14it/s] Loading 0: 65%|██████▍ | 189/291 [00:06<00:04, 22.37it/s] Loading 0: 67%|██████▋ | 194/291 [00:06<00:03, 24.74it/s] Loading 0: 69%|██████▉ | 201/291 [00:06<00:02, 32.68it/s] Loading 0: 70%|███████ | 205/291 [00:06<00:02, 32.20it/s] Loading 0: 72%|███████▏ | 210/291 [00:06<00:02, 35.11it/s] Loading 0: 74%|███████▎ | 214/291 [00:07<00:02, 34.00it/s] Loading 0: 75%|███████▌ | 219/291 [00:07<00:01, 37.09it/s] Loading 0: 77%|███████▋ | 223/291 [00:07<00:01, 35.45it/s] Loading 0: 78%|███████▊ | 227/291 [00:07<00:01, 35.35it/s] Loading 0: 79%|███████▉ | 231/291 [00:07<00:01, 34.03it/s] Loading 0: 81%|████████ | 235/291 [00:07<00:02, 24.77it/s] Loading 0: 82%|████████▏ | 239/291 [00:07<00:02, 24.42it/s] Loading 0: 85%|████████▍ | 246/291 [00:08<00:01, 32.18it/s] Loading 0: 86%|████████▌ | 250/291 [00:08<00:01, 31.49it/s] Loading 0: 88%|████████▊ | 255/291 [00:08<00:01, 33.06it/s] Loading 0: 89%|████████▉ | 259/291 [00:08<00:01, 31.62it/s] Loading 0: 91%|█████████ | 264/291 [00:08<00:00, 34.51it/s] Loading 0: 92%|█████████▏| 268/291 [00:08<00:00, 33.13it/s] Loading 0: 94%|█████████▍| 273/291 [00:08<00:00, 34.77it/s] Loading 0: 95%|█████████▌| 277/291 [00:09<00:00, 32.67it/s] Loading 0: 97%|█████████▋| 281/291 [00:09<00:00, 32.89it/s] Loading 0: 98%|█████████▊| 286/291 [00:14<00:01, 2.57it/s] Loading 0: 99%|█████████▉| 289/291 [00:14<00:00, 3.20it/s]
Job trace2333-ultra-dol-fd-4247-v14-mkmlizer completed after 99.65s with status: succeeded
Stopping job with name trace2333-ultra-dol-fd-4247-v14-mkmlizer
Pipeline stage MKMLizer completed in 100.69s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.09s
Running pipeline stage ISVCDeployer
Creating inference service trace2333-ultra-dol-fd-4247-v14
Waiting for inference service trace2333-ultra-dol-fd-4247-v14 to be ready
Failed to get response for submission blend_dedat_2024-08-16: ('http://zonemercy-graft-cogent-v-7573-v6-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:52388->127.0.0.1:8080: read: connection reset by peer\n')
Failed to get response for submission blend_lobuf_2024-08-22: ('http://zonemercy-lexical-nemov8-5966-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'EOF\n')
Failed to get response for submission blend_berib_2024-08-16: ('http://zonemercy-lexical-nemov8-5966-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'upstream connect error or disconnect/reset before headers. reset reason: connection termination')
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Failed to get response for submission blend_lobuf_2024-08-22: ('http://zonemercy-lexical-nemov8-5966-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'EOF\n')
Inference service trace2333-ultra-dol-fd-4247-v14 ready after 283.03944754600525s
Pipeline stage ISVCDeployer completed in 283.92s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.416929244995117s
Received healthy response to inference request in 4.073570489883423s
Received healthy response to inference request in 2.4684197902679443s
Received healthy response to inference request in 2.083529472351074s
Received healthy response to inference request in 2.6958727836608887s
5 requests
0 failed requests
5th percentile: 2.150209426879883
10th percentile: 2.2168893814086914
20th percentile: 2.3502492904663086
30th percentile: 2.4272273540496827
40th percentile: 2.4478235721588133
50th percentile: 2.4684197902679443
60th percentile: 2.5594009876251222
70th percentile: 2.6503821849822997
80th percentile: 2.971412324905396
90th percentile: 3.5224914073944094
95th percentile: 3.7980309486389157
99th percentile: 4.018462581634521
mean time: 2.7476643562316894
Pipeline stage StressChecker completed in 14.49s
trace2333-ultra-dol-fd-_4247_v14 status is now deployed due to DeploymentManager action
trace2333-ultra-dol-fd-_4247_v14 status is now inactive due to auto deactivation removed underperforming models
trace2333-ultra-dol-fd-_4247_v14 status is now torndown due to DeploymentManager action