submission_id: pawankrd-cosmosrp_v71
developer_uid: PawanOsman
alignment_samples: 0
best_of: 16
celo_rating: 1269.66
display_name: pawankrd-cosmosrp_v71
formatter: {'memory_template': "<|start_header_id|>system<|end_header_id|>\n\n{bot_name}'s Persona: {memory}\n\n", 'prompt_template': '{prompt}<|eot_id|>', 'bot_template': '<|start_header_id|>assistant<|end_header_id|>\n\n{bot_name}: {message}<|eot_id|>', 'user_template': '<|start_header_id|>user<|end_header_id|>\n\n{user_name}: {message}<|eot_id|>', 'response_template': '<|start_header_id|>assistant<|end_header_id|>\n\n{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.6, 'top_p': 0.95, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['<', '>'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
is_internal_developer: False
language_model: PawanKrd/CosmosRP
max_input_tokens: 512
max_output_tokens: 64
model_architecture: LlamaForCausalLM
model_group: PawanKrd/CosmosRP
model_name: pawankrd-cosmosrp_v71
model_num_parameters: 8030261248.0
model_repo: PawanKrd/CosmosRP
model_size: 8B
num_battles: 24610
num_wins: 15327
propriety_score: 0.7313860252004581
propriety_total_count: 1746.0
ranking_group: single
reward_formatter: {'bot_template': '{bot_name}: {message}\n', 'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'response_template': '{bot_name}:', 'truncate_by_message': False, 'user_template': '{user_name}: {message}\n'}
reward_repo: Jellywibble/gpt2_xl_pairwise_89m_step_347634
status: torndown
submission_type: basic
timestamp: 2024-07-17T21:10:03+00:00
us_pacific_date: 2024-07-17
win_ratio: 0.6227956115400244
Resubmit model
Running pipeline stage MKMLizer
Starting job with name pawankrd-cosmosrp-v71-mkmlizer
Waiting for job on pawankrd-cosmosrp-v71-mkmlizer to finish
pawankrd-cosmosrp-v71-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
pawankrd-cosmosrp-v71-mkmlizer: ║ _____ __ __ ║
pawankrd-cosmosrp-v71-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
pawankrd-cosmosrp-v71-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
pawankrd-cosmosrp-v71-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
pawankrd-cosmosrp-v71-mkmlizer: ║ /___/ ║
pawankrd-cosmosrp-v71-mkmlizer: ║ ║
pawankrd-cosmosrp-v71-mkmlizer: ║ Version: 0.9.5.post2 ║
pawankrd-cosmosrp-v71-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
pawankrd-cosmosrp-v71-mkmlizer: ║ https://mk1.ai ║
pawankrd-cosmosrp-v71-mkmlizer: ║ ║
pawankrd-cosmosrp-v71-mkmlizer: ║ The license key for the current software has been verified as ║
pawankrd-cosmosrp-v71-mkmlizer: ║ belonging to: ║
pawankrd-cosmosrp-v71-mkmlizer: ║ ║
pawankrd-cosmosrp-v71-mkmlizer: ║ Chai Research Corp. ║
pawankrd-cosmosrp-v71-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
pawankrd-cosmosrp-v71-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
pawankrd-cosmosrp-v71-mkmlizer: ║ ║
pawankrd-cosmosrp-v71-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission v000000-l3-8b-poppy-moon_1166_v: no entry with id "v000000-l3-8b-poppy-moon_1166_v" found on database!
Failed to get response for submission cgato-l3-thespice-8b-dpo_6378_v2azazelle-l3-tyche-8b-v1-0_v2: no entry with id "cgato-l3-thespice-8b-dpo_6378_v2azazelle-l3-tyche-8b-v1-0_v2" found on database!
pawankrd-cosmosrp-v71-mkmlizer: Downloaded to shared memory in 26.089s
pawankrd-cosmosrp-v71-mkmlizer: quantizing model to /dev/shm/model_cache
pawankrd-cosmosrp-v71-mkmlizer: Saving flywheel model at /dev/shm/model_cache
pawankrd-cosmosrp-v71-mkmlizer: lm_head.weight torch.Size([139542528])
pawankrd-cosmosrp-v71-mkmlizer: model.embed_tokens.weight torch.Size([139542528])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.0.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.0.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.0.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.0.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.0.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.0.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.1.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.1.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.1.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.1.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.1.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.1.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.10.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.10.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.10.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.10.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.10.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.10.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.11.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.11.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.11.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.11.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.11.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.11.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.12.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.12.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.12.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.12.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.12.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.12.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.13.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.13.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.13.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.13.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.13.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.13.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.14.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.14.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.14.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.14.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.14.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.14.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.15.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.15.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.15.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.15.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.15.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.15.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.16.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.16.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.16.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.16.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.16.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.16.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.17.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.17.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.17.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.17.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.17.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.17.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.18.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.18.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.18.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.18.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.18.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.18.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.19.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.19.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.19.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.19.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.19.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.19.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.2.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.2.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.2.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.2.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.2.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.2.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.20.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.20.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.20.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.20.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.20.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.20.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.21.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.21.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.21.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.21.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.21.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.21.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.22.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.22.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.22.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.22.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.22.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.22.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.23.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.23.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.23.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.23.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.23.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.23.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.24.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.24.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.24.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.24.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.24.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.24.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.25.input_layernorm.weight torch.Size([4096])
Failed to get response for submission v000000-l3-8b-poppy-moon_1166_v: no entry with id "v000000-l3-8b-poppy-moon_1166_v" found on database!
pawankrd-cosmosrp-v71-mkmlizer: model.layers.25.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.25.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.25.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.25.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.25.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.26.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.26.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.26.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.26.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.26.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.26.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.27.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.27.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.27.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.27.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.27.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.27.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.28.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.28.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.28.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.28.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.28.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.28.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.29.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.29.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.29.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.29.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.29.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.29.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.3.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.3.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.3.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.3.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.3.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.3.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.30.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.30.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.30.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 0%| | 1/291 [00:05<26:40, 5.52s/it] Loading 0: 1%| | 2/291 [00:05<13:17, 2.76s/it] Loading 0: 1%| | 2/291 [00:05<13:17, 2.76s/it] Loading 0: 1%| | 3/291 [00:05<13:14, 2.76s/it] Loading 0: 2%|▏ | 5/291 [00:05<13:09, 2.76s/it] Loading 0: 2%|▏ | 6/291 [00:05<13:06, 2.76s/it] Loading 0: 3%|▎ | 8/291 [00:05<13:01, 2.76s/it] Loading 0: 3%|▎ | 10/291 [00:05<12:55, 2.76s/it] Loading 0: 4%|▍ | 11/291 [00:05<12:52, 2.76s/it] Loading 0: 4%|▍ | 12/291 [00:05<12:50, 2.76s/it] Loading 0: 4%|▍ | 13/291 [00:05<01:28, 3.13it/s] Loading 0: 5%|▍ | 14/291 [00:05<01:28, 3.13it/s] Loading 0: 5%|▌ | 15/291 [00:05<01:28, 3.13it/s] Loading 0: 6%|▌ | 17/291 [00:05<01:27, 3.13it/s] Loading 0: 7%|▋ | 19/291 [00:05<01:26, 3.13it/s] Loading 0: 7%|▋ | 20/291 [00:05<01:26, 3.13it/s] Loading 0: 7%|▋ | 21/291 [00:05<01:26, 3.13it/s] Loading 0: 8%|▊ | 23/291 [00:05<01:25, 3.13it/s] Loading 0: 8%|▊ | 24/291 [00:05<00:39, 6.82it/s] Loading 0: 8%|▊ | 24/291 [00:05<00:39, 6.82it/s] Loading 0: 9%|▉ | 26/291 [00:05<00:38, 6.82it/s] Loading 0: 10%|▉ | 28/291 [00:05<00:38, 6.82it/s] Loading 0: 10%|▉ | 29/291 [00:05<00:38, 6.82it/s] Loading 0: 10%|█ | 30/291 [00:05<00:38, 6.82it/s] Loading 0: 11%|█ | 32/291 [00:05<00:37, 6.82it/s] Loading 0: 11%|█▏ | 33/291 [00:05<00:37, 6.82it/s] Loading 0: 12%|█▏ | 35/291 [00:05<00:37, 6.82it/s] Loading 0: 13%|█▎ | 37/291 [00:05<00:37, 6.82it/s] Loading 0: 13%|█▎ | 38/291 [00:05<00:37, 6.82it/s] Loading 0: 13%|█▎ | 39/291 [00:05<00:36, 6.82it/s] Loading 0: 14%|█▎ | 40/291 [00:05<00:17, 14.02it/s] Loading 0: 14%|█▍ | 41/291 [00:05<00:17, 14.02it/s] Loading 0: 14%|█▍ | 42/291 [00:05<00:17, 14.02it/s] Loading 0: 15%|█▌ | 44/291 [00:05<00:17, 14.02it/s] Loading 0: 16%|█▌ | 46/291 [00:05<00:17, 14.02it/s] Loading 0: 16%|█▌ | 47/291 [00:05<00:17, 14.02it/s] Loading 0: 16%|█▋ | 48/291 [00:05<00:17, 14.02it/s] Loading 0: 17%|█▋ | 50/291 [00:05<00:17, 14.02it/s] Loading 0: 18%|█▊ | 51/291 [00:05<00:12, 19.71it/s] Loading 0: 18%|█▊ | 51/291 [00:05<00:12, 19.71it/s] Loading 0: 18%|█▊ | 53/291 [00:05<00:12, 19.71it/s] Loading 0: 19%|█▉ | 55/291 [00:05<00:11, 19.71it/s] Loading 0: 19%|█▉ | 56/291 [00:05<00:11, 19.71it/s] Loading 0: 20%|█▉ | 57/291 [00:06<00:11, 19.71it/s] Loading 0: 20%|██ | 59/291 [00:06<00:11, 19.71it/s] Loading 0: 21%|██ | 60/291 [00:06<00:11, 19.71it/s] Loading 0: 21%|██▏ | 62/291 [00:06<00:10, 20.92it/s] Loading 0: 21%|██▏ | 62/291 [00:06<00:10, 20.92it/s] Loading 0: 22%|██▏ | 64/291 [00:06<00:10, 20.92it/s] Loading 0: 22%|██▏ | 65/291 [00:06<00:10, 20.92it/s] Loading 0: 23%|██▎ | 66/291 [00:06<00:10, 20.92it/s] Loading 0: 23%|██▎ | 68/291 [00:06<00:10, 20.92it/s] Loading 0: 24%|██▎ | 69/291 [00:06<00:10, 20.92it/s] Loading 0: 24%|██▍ | 71/291 [00:06<00:10, 20.92it/s] Loading 0: 25%|██▌ | 73/291 [00:06<00:10, 20.92it/s] Loading 0: 25%|██▌ | 74/291 [00:06<00:10, 20.92it/s] Loading 0: 26%|██▌ | 75/291 [00:06<00:10, 20.92it/s] Loading 0: 26%|██▌ | 76/291 [00:06<00:07, 30.47it/s] Loading 0: 26%|██▋ | 77/291 [00:06<00:07, 30.47it/s] Loading 0: 27%|██▋ | 78/291 [00:06<00:06, 30.47it/s] Loading 0: 27%|██▋ | 80/291 [00:06<00:06, 30.47it/s] Loading 0: 28%|██▊ | 82/291 [00:06<00:06, 30.47it/s] Loading 0: 29%|██▊ | 83/291 [00:06<00:06, 30.47it/s] Loading 0: 29%|██▉ | 84/291 [00:06<00:06, 30.47it/s] Loading 0: 30%|██▉ | 86/291 [00:06<00:06, 30.47it/s] Loading 0: 30%|██▉ | 87/291 [00:06<00:05, 38.14it/s] Loading 0: 30%|██▉ | 87/291 [00:06<00:05, 38.14it/s] Loading 0: 31%|███ | 89/291 [00:06<00:05, 38.14it/s] Loading 0: 31%|███▏ | 91/291 [00:06<00:05, 38.14it/s] Loading 0: 32%|███▏ | 92/291 [00:06<00:05, 38.14it/s] Loading 0: 32%|███▏ | 93/291 [00:06<00:05, 38.14it/s] Loading 0: 33%|███▎ | 95/291 [00:06<00:05, 38.14it/s] Loading 0: 33%|███▎ | 96/291 [00:06<00:05, 38.14it/s] Loading 0: 34%|███▎ | 98/291 [00:06<00:05, 38.14it/s] Loading 0: 34%|███▍ | 100/291 [00:06<00:05, 38.14it/s] Loading 0: 35%|███▍ | 101/291 [00:06<00:04, 38.14it/s] Loading 0: 35%|███▌ | 102/291 [00:06<00:04, 38.14it/s] Loading 0: 35%|███▌ | 103/291 [00:06<00:03, 53.30it/s] Loading 0: 36%|███▌ | 104/291 [00:06<00:03, 53.30it/s] Loading 0: 36%|███▌ | 105/291 [00:06<00:03, 53.30it/s] Loading 0: 37%|███▋ | 107/291 [00:06<00:03, 53.30it/s] Loading 0: 37%|███▋ | 109/291 [00:06<00:03, 53.30it/s] Loading 0: 38%|███▊ | 110/291 [00:06<00:03, 53.30it/s] Loading 0: 38%|███▊ | 111/291 [00:06<00:03, 53.30it/s] Loading 0: 39%|███▉ | 113/291 [00:06<00:03, 53.30it/s] Loading 0: 39%|███▉ | 114/291 [00:06<00:03, 53.30it/s] Loading 0: 40%|███▉ | 115/291 [00:06<00:02, 62.60it/s] Loading 0: 40%|███▉ | 116/291 [00:06<00:02, 62.60it/s] Loading 0: 41%|████ | 118/291 [00:06<00:02, 62.60it/s] Loading 0: 41%|████ | 119/291 [00:06<00:02, 62.60it/s] Loading 0: 41%|████ | 120/291 [00:06<00:02, 62.60it/s] Loading 0: 42%|████▏ | 122/291 [00:06<00:02, 62.60it/s] Loading 0: 42%|████▏ | 123/291 [00:06<00:02, 62.60it/s] Loading 0: 43%|████▎ | 125/291 [00:06<00:02, 62.60it/s] Loading 0: 44%|████▎ | 127/291 [00:06<00:02, 62.60it/s] Loading 0: 44%|████▍ | 128/291 [00:06<00:02, 62.60it/s] Loading 0: 44%|████▍ | 129/291 [00:06<00:02, 62.60it/s] Loading 0: 45%|████▍ | 130/291 [00:06<00:02, 77.23it/s] Loading 0: 45%|████▌ | 131/291 [00:07<00:02, 77.23it/s] Loading 0: 45%|████▌ | 132/291 [00:07<00:02, 77.23it/s] Loading 0: 46%|████▌ | 134/291 [00:07<00:02, 77.23it/s] Loading 0: 47%|████▋ | 136/291 [00:07<00:02, 77.23it/s] Loading 0: 47%|████▋ | 137/291 [00:07<00:01, 77.23it/s] Loading 0: 47%|████▋ | 138/291 [00:07<00:01, 77.23it/s] Loading 0: 48%|████▊ | 140/291 [00:07<00:01, 77.23it/s] Loading 0: 48%|████▊ | 141/291 [00:07<00:01, 77.23it/s] Loading 0: 49%|████▉ | 142/291 [00:07<00:01, 83.11it/s] Loading 0: 49%|████▉ | 143/291 [00:07<00:01, 83.11it/s] Loading 0: 50%|████▉ | 145/291 [00:07<00:01, 83.11it/s] Loading 0: 50%|█████ | 146/291 [00:07<00:01, 83.11it/s] Loading 0: 51%|█████ | 147/291 [00:07<00:01, 83.11it/s] Loading 0: 51%|█████ | 149/291 [00:07<00:01, 83.11it/s] Loading 0: 52%|█████▏ | 150/291 [00:07<00:01, 83.11it/s] Loading 0: 52%|█████▏ | 152/291 [00:07<00:01, 83.11it/s] Loading 0: 53%|█████▎ | 154/291 [00:07<00:01, 83.11it/s] Loading 0: 53%|█████▎ | 155/291 [00:07<00:01, 83.11it/s] Loading 0: 54%|█████▎ | 156/291 [00:07<00:01, 83.11it/s] Loading 0: 54%|█████▍ | 157/291 [00:07<00:01, 95.81it/s] Loading 0: 54%|█████▍ | 158/291 [00:07<00:01, 95.81it/s] Loading 0: 55%|█████▍ | 159/291 [00:07<00:01, 95.81it/s] Loading 0: 55%|█████▌ | 161/291 [00:07<00:01, 95.81it/s] Loading 0: 56%|█████▌ | 163/291 [00:07<00:01, 95.81it/s] Loading 0: 56%|█████▋ | 164/291 [00:07<00:01, 95.81it/s] Loading 0: 57%|█████▋ | 165/291 [00:07<00:01, 95.81it/s] Loading 0: 57%|█████▋ | 167/291 [00:07<00:01, 95.81it/s] Loading 0: 58%|█████▊ | 168/291 [00:07<00:01, 95.81it/s] Loading 0: 58%|█████▊ | 170/291 [00:07<00:02, 54.52it/s] Loading 0: 58%|█████▊ | 170/291 [00:07<00:02, 54.52it/s] Loading 0: 59%|█████▉ | 172/291 [00:07<00:02, 54.52it/s] Loading 0: 59%|█████▉ | 173/291 [00:07<00:02, 54.52it/s] Loading 0: 60%|█████▉ | 174/291 [00:07<00:02, 54.52it/s] Loading 0: 60%|██████ | 176/291 [00:07<00:02, 54.52it/s] Loading 0: 61%|██████ | 177/291 [00:07<00:02, 54.52it/s] Loading 0: 62%|██████▏ model.layers.30.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.30.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.30.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.31.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.31.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.31.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.31.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.31.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.31.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.4.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.4.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.4.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.4.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.4.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.4.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.5.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.5.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.5.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.5.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.5.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.5.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.6.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.6.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.6.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.6.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.6.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.6.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.7.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.7.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.7.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.7.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.7.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.7.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.8.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.8.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.8.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.8.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.8.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.8.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.9.input_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.9.mlp.down_proj.weight torch.Size([11927552])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.9.mlp.up_gate_proj.weight torch.Size([23855104])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.9.post_attention_layernorm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.9.self_attn.o_proj.weight torch.Size([3407872])
pawankrd-cosmosrp-v71-mkmlizer: model.layers.9.self_attn.qkv_proj.weight torch.Size([5111808])
pawankrd-cosmosrp-v71-mkmlizer: model.norm.weight torch.Size([4096])
pawankrd-cosmosrp-v71-mkmlizer: | 179/291 [00:07<00:02, 54.52it/s] Loading 0: 62%|██████▏ | 181/291 [00:07<00:02, 54.52it/s] Loading 0: 63%|██████▎ | 182/291 [00:07<00:01, 54.52it/s] Loading 0: 63%|██████▎ | 183/291 [00:07<00:01, 54.52it/s] Loading 0: 63%|██████▎ | 184/291 [00:07<00:01, 66.94it/s] Loading 0: 64%|██████▎ | 185/291 [00:07<00:01, 66.94it/s] Loading 0: 64%|██████▍ | 186/291 [00:07<00:01, 66.94it/s] Loading 0: 65%|██████▍ | 188/291 [00:07<00:01, 66.94it/s] Loading 0: 65%|██████▌ | 190/291 [00:07<00:01, 66.94it/s] Loading 0: 66%|██████▌ | 191/291 [00:07<00:01, 66.94it/s] Loading 0: 66%|██████▌ | 192/291 [00:07<00:01, 66.94it/s] Loading 0: 67%|██████▋ | 194/291 [00:07<00:01, 66.94it/s] Loading 0: 67%|██████▋ | 195/291 [00:07<00:01, 73.36it/s] Loading 0: 67%|██████▋ | 195/291 [00:07<00:01, 73.36it/s] Loading 0: 68%|██████▊ | 197/291 [00:07<00:01, 73.36it/s] Loading 0: 68%|██████▊ | 199/291 [00:07<00:01, 73.36it/s] Loading 0: 69%|██████▊ | 200/291 [00:07<00:01, 73.36it/s] Loading 0: 69%|██████▉ | 201/291 [00:07<00:01, 73.36it/s] Loading 0: 70%|██████▉ | 203/291 [00:07<00:01, 73.36it/s] Loading 0: 70%|███████ | 204/291 [00:07<00:01, 73.36it/s] Loading 0: 71%|███████ | 206/291 [00:07<00:01, 73.36it/s] Loading 0: 71%|███████▏ | 208/291 [00:07<00:01, 73.36it/s] Loading 0: 72%|███████▏ | 209/291 [00:07<00:01, 73.36it/s] Loading 0: 72%|███████▏ | 210/291 [00:08<00:01, 73.36it/s] Loading 0: 73%|███████▎ | 211/291 [00:08<00:00, 89.31it/s] Loading 0: 73%|███████▎ | 212/291 [00:08<00:00, 89.31it/s] Loading 0: 73%|███████▎ | 213/291 [00:08<00:00, 89.31it/s] Loading 0: 74%|███████▍ | 215/291 [00:08<00:00, 89.31it/s] Loading 0: 75%|███████▍ | 217/291 [00:08<00:00, 89.31it/s] Loading 0: 75%|███████▍ | 218/291 [00:08<00:00, 89.31it/s] Loading 0: 75%|███████▌ | 219/291 [00:08<00:00, 89.31it/s] Loading 0: 76%|███████▌ | 221/291 [00:08<00:00, 89.31it/s] Loading 0: 76%|███████▋ | 222/291 [00:08<00:00, 89.31it/s] Loading 0: 77%|███████▋ | 223/291 [00:08<00:00, 92.62it/s] Loading 0: 77%|███████▋ | 224/291 [00:08<00:00, 92.62it/s] Loading 0: 78%|███████▊ | 226/291 [00:08<00:00, 92.62it/s] Loading 0: 78%|███████▊ | 227/291 [00:08<00:00, 92.62it/s] Loading 0: 78%|███████▊ | 228/291 [00:08<00:00, 92.62it/s] Loading 0: 79%|███████▉ | 230/291 [00:08<00:00, 92.62it/s] Loading 0: 79%|███████▉ | 231/291 [00:08<00:00, 92.62it/s] Loading 0: 80%|████████ | 233/291 [00:08<00:00, 92.62it/s] Loading 0: 81%|████████ | 235/291 [00:08<00:00, 92.62it/s] Loading 0: 81%|████████ | 236/291 [00:08<00:00, 92.62it/s] Loading 0: 81%|████████▏ | 237/291 [00:08<00:00, 92.62it/s] Loading 0: 82%|████████▏ | 238/291 [00:08<00:00, 104.10it/s] Loading 0: 82%|████████▏ | 239/291 [00:08<00:00, 104.10it/s] Loading 0: 82%|████████▏ | 240/291 [00:08<00:00, 104.10it/s] Loading 0: 83%|████████▎ | 242/291 [00:08<00:00, 104.10it/s] Loading 0: 84%|████████▍ | 244/291 [00:08<00:00, 104.10it/s] Loading 0: 84%|████████▍ | 245/291 [00:08<00:00, 104.10it/s] Loading 0: 85%|████████▍ | 246/291 [00:08<00:00, 104.10it/s] Loading 0: 85%|████████▌ | 248/291 [00:08<00:00, 104.10it/s] Loading 0: 86%|████████▌ | 249/291 [00:08<00:00, 104.10it/s] Loading 0: 86%|████████▋ | 251/291 [00:08<00:00, 107.66it/s] Loading 0: 86%|████████▋ | 251/291 [00:08<00:00, 107.66it/s] Loading 0: 87%|████████▋ | 253/291 [00:08<00:00, 107.66it/s] Loading 0: 87%|████████▋ | 254/291 [00:08<00:00, 107.66it/s] Loading 0: 88%|████████▊ | 255/291 [00:08<00:00, 107.66it/s] Loading 0: 88%|████████▊ | 257/291 [00:08<00:00, 107.66it/s] Loading 0: 89%|████████▊ | 258/291 [00:08<00:00, 107.66it/s] Loading 0: 89%|████████▉ | 260/291 [00:08<00:00, 107.66it/s] Loading 0: 90%|█████████ | 262/291 [00:08<00:00, 107.66it/s] Loading 0: 90%|█████████ | 263/291 [00:08<00:00, 107.66it/s] Loading 0: 91%|█████████ | 264/291 [00:08<00:00, 107.66it/s] Loading 0: 91%|█████████ | 265/291 [00:08<00:00, 114.63it/s] Loading 0: 91%|█████████▏| 266/291 [00:08<00:00, 114.63it/s] Loading 0: 92%|█████████▏| 267/291 [00:08<00:00, 114.63it/s] Loading 0: 92%|█████████▏| 269/291 [00:08<00:00, 114.63it/s] Loading 0: 93%|█████████▎| 271/291 [00:08<00:00, 114.63it/s] Loading 0: 93%|█████████▎| 272/291 [00:08<00:00, 114.63it/s] Loading 0: 94%|█████████▍| 273/291 [00:08<00:00, 114.63it/s] Loading 0: 95%|█████████▍| 275/291 [00:08<00:00, 114.63it/s] Loading 0: 95%|█████████▍| 276/291 [00:08<00:00, 114.63it/s] Loading 0: 96%|█████████▌| 278/291 [00:08<00:00, 59.99it/s] Loading 0: 96%|█████████▌| 278/291 [00:08<00:00, 59.99it/s] Loading 0: 96%|█████████▌| 280/291 [00:08<00:00, 59.99it/s] Loading 0: 97%|█████████▋| 281/291 [00:08<00:00, 59.99it/s] Loading 0: 97%|█████████▋| 282/291 [00:08<00:00, 59.99it/s] Loading 0: 98%|█████████▊| 284/291 [00:08<00:00, 59.99it/s] Loading 0: 98%|█████████▊| 285/291 [00:08<00:00, 59.99it/s] Loading 0: 99%|█████████▊| 287/291 [00:08<00:00, 59.99it/s] Loading 0: 99%|█████████▉| 289/291 [00:08<00:00, 59.99it/s] Loading 0: 100%|█████████▉| 290/291 [00:08<00:00, 59.99it/s] Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
pawankrd-cosmosrp-v71-mkmlizer: quantized model in 29.027s
pawankrd-cosmosrp-v71-mkmlizer: Processed model PawanKrd/CosmosRP in 55.117s
pawankrd-cosmosrp-v71-mkmlizer: creating bucket guanaco-mkml-models
pawankrd-cosmosrp-v71-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
pawankrd-cosmosrp-v71-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/pawankrd-cosmosrp-v71
pawankrd-cosmosrp-v71-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/pawankrd-cosmosrp-v71/special_tokens_map.json
pawankrd-cosmosrp-v71-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/pawankrd-cosmosrp-v71/tokenizer_config.json
pawankrd-cosmosrp-v71-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/pawankrd-cosmosrp-v71/config.json
pawankrd-cosmosrp-v71-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/pawankrd-cosmosrp-v71/tokenizer.json
pawankrd-cosmosrp-v71-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/pawankrd-cosmosrp-v71/flywheel_model.0.safetensors
pawankrd-cosmosrp-v71-mkmlizer: loading reward model from Jellywibble/gpt2_xl_pairwise_89m_step_347634
pawankrd-cosmosrp-v71-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:950: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
pawankrd-cosmosrp-v71-mkmlizer: warnings.warn(
pawankrd-cosmosrp-v71-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:778: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
pawankrd-cosmosrp-v71-mkmlizer: warnings.warn(
pawankrd-cosmosrp-v71-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:469: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
pawankrd-cosmosrp-v71-mkmlizer: warnings.warn(
Failed to get response for submission v000000-l3-8b-poppy-moon_1166_v: no entry with id "v000000-l3-8b-poppy-moon_1166_v" found on database!
Failed to get response for submission cgato-l3-thespice-8b-dpo_6378_v2azazelle-l3-tyche-8b-v1-0_v2: no entry with id "cgato-l3-thespice-8b-dpo_6378_v2azazelle-l3-tyche-8b-v1-0_v2" found on database!
pawankrd-cosmosrp-v71-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
pawankrd-cosmosrp-v71-mkmlizer: Saving duration: 2.353s
pawankrd-cosmosrp-v71-mkmlizer: Processed model Jellywibble/gpt2_xl_pairwise_89m_step_347634 in 13.456s
pawankrd-cosmosrp-v71-mkmlizer: creating bucket guanaco-reward-models
pawankrd-cosmosrp-v71-mkmlizer: Bucket 's3://guanaco-reward-models/' created
pawankrd-cosmosrp-v71-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/pawankrd-cosmosrp-v71_reward
pawankrd-cosmosrp-v71-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/pawankrd-cosmosrp-v71_reward/config.json
pawankrd-cosmosrp-v71-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/pawankrd-cosmosrp-v71_reward/tokenizer_config.json
pawankrd-cosmosrp-v71-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/pawankrd-cosmosrp-v71_reward/special_tokens_map.json
pawankrd-cosmosrp-v71-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/pawankrd-cosmosrp-v71_reward/merges.txt
pawankrd-cosmosrp-v71-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/pawankrd-cosmosrp-v71_reward/vocab.json
pawankrd-cosmosrp-v71-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/pawankrd-cosmosrp-v71_reward/tokenizer.json
pawankrd-cosmosrp-v71-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/pawankrd-cosmosrp-v71_reward/reward.tensors
Job pawankrd-cosmosrp-v71-mkmlizer completed after 102.97s with status: succeeded
Stopping job with name pawankrd-cosmosrp-v71-mkmlizer
Pipeline stage MKMLizer completed in 103.81s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.12s
Running pipeline stage ISVCDeployer
Creating inference service pawankrd-cosmosrp-v71
Waiting for inference service pawankrd-cosmosrp-v71 to be ready
Failed to get response for submission v000000-l3-8b-poppy-moon_1166_v: no entry with id "v000000-l3-8b-poppy-moon_1166_v" found on database!
Failed to get response for submission v000000-l3-8b-poppy-moon_1166_v: no entry with id "v000000-l3-8b-poppy-moon_1166_v" found on database!
Inference service pawankrd-cosmosrp-v71 ready after 40.244242906570435s
Pipeline stage ISVCDeployer completed in 47.18s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2899327278137207s
Received healthy response to inference request in 1.493473768234253s
Received healthy response to inference request in 1.5059740543365479s
Received healthy response to inference request in 1.4247121810913086s
Received healthy response to inference request in 1.413893699645996s
5 requests
0 failed requests
5th percentile: 1.4160573959350586
10th percentile: 1.4182210922241212
20th percentile: 1.422548484802246
30th percentile: 1.4384644985198975
40th percentile: 1.4659691333770752
50th percentile: 1.493473768234253
60th percentile: 1.498473882675171
70th percentile: 1.5034739971160889
80th percentile: 1.6627657890319825
90th percentile: 1.9763492584228517
95th percentile: 2.133140993118286
99th percentile: 2.258574380874634
mean time: 1.6255972862243653
Pipeline stage StressChecker completed in 9.35s
pawankrd-cosmosrp_v71 status is now deployed due to DeploymentManager action
pawankrd-cosmosrp_v71 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of pawankrd-cosmosrp_v71
Running pipeline stage ISVCDeleter
Checking if service pawankrd-cosmosrp-v71 is running
Skipping teardown as no inference service was found
Pipeline stage ISVCDeleter completed in 4.73s
Running pipeline stage MKMLModelDeleter
Cleaning model data from S3
Cleaning model data from model cache
Deleting key pawankrd-cosmosrp-v71/config.json from bucket guanaco-mkml-models
Deleting key pawankrd-cosmosrp-v71/flywheel_model.0.safetensors from bucket guanaco-mkml-models
Deleting key pawankrd-cosmosrp-v71/special_tokens_map.json from bucket guanaco-mkml-models
Deleting key pawankrd-cosmosrp-v71/tokenizer.json from bucket guanaco-mkml-models
Deleting key pawankrd-cosmosrp-v71/tokenizer_config.json from bucket guanaco-mkml-models
Cleaning model data from model cache
Deleting key pawankrd-cosmosrp-v71_reward/config.json from bucket guanaco-reward-models
Deleting key pawankrd-cosmosrp-v71_reward/merges.txt from bucket guanaco-reward-models
Deleting key pawankrd-cosmosrp-v71_reward/reward.tensors from bucket guanaco-reward-models
Deleting key pawankrd-cosmosrp-v71_reward/special_tokens_map.json from bucket guanaco-reward-models
Deleting key pawankrd-cosmosrp-v71_reward/tokenizer.json from bucket guanaco-reward-models
Deleting key pawankrd-cosmosrp-v71_reward/tokenizer_config.json from bucket guanaco-reward-models
Deleting key pawankrd-cosmosrp-v71_reward/vocab.json from bucket guanaco-reward-models
Pipeline stage MKMLModelDeleter completed in 7.99s
pawankrd-cosmosrp_v71 status is now torndown due to DeploymentManager action

Usage Metrics

Latency Metrics