submission_id: hastagaras-cupang-12b-test-2_v4
developer_uid: Hastagaras
alignment_samples: 0
best_of: 4
celo_rating: 1208.81
display_name: hastagaras-cupang-12b-test-2_v1
formatter: {'memory_template': "<s>[INST] Act as {bot_name}\n\n{bot_name}'s Persona: {memory}\n\n", 'prompt_template': '{prompt} [/INST]\n', 'bot_template': '{bot_name}: {message}</s>\n', 'user_template': '[INST] {user_name}: {message} [/INST]\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.05, 'top_p': 0.95, 'min_p': 0.05, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<'], 'max_input_tokens': 1024, 'best_of': 4, 'max_output_tokens': 64, 'reward_max_token_input': 256}
is_internal_developer: False
language_model: Hastagaras/Cupang-12B-Test-2
max_input_tokens: 1024
max_output_tokens: 64
model_architecture: MistralForCausalLM
model_group: Hastagaras/Cupang-12B-Te
model_name: hastagaras-cupang-12b-test-2_v1
model_num_parameters: 12772070400.0
model_repo: Hastagaras/Cupang-12B-Test-2
model_size: 13B
num_battles: 10565
num_wins: 5401
propriety_score: 0.6996699669966997
propriety_total_count: 909.0
ranking_group: single
reward_formatter: {'bot_template': '{bot_name}: {message}\n', 'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'response_template': '{bot_name}:', 'truncate_by_message': False, 'user_template': '{user_name}: {message}\n'}
reward_repo: ChaiML/gpt2_xl_pairwise_89m_step_347634
status: torndown
submission_type: basic
timestamp: 2024-07-31T05:23:17+00:00
us_pacific_date: 2024-07-30
win_ratio: 0.5112162801703739
Download Preference Data
Resubmit model
Running pipeline stage MKMLizer
Starting job with name hastagaras-cupang-12b-test-2-v4-mkmlizer
Waiting for job on hastagaras-cupang-12b-test-2-v4-mkmlizer to finish
hastagaras-cupang-12b-test-2-v4-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ _____ __ __ ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ /___/ ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ Version: 0.9.7 ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ https://mk1.ai ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ The license key for the current software has been verified as ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ belonging to: ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ Chai Research Corp. ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ║ ║
hastagaras-cupang-12b-test-2-v4-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
hastagaras-cupang-12b-test-2-v4-mkmlizer: Downloaded to shared memory in 29.341s
hastagaras-cupang-12b-test-2-v4-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp7i_m5mvq, device:0
hastagaras-cupang-12b-test-2-v4-mkmlizer: Saving flywheel model at /dev/shm/model_cache
hastagaras-cupang-12b-test-2-v4-mkmlizer: quantized model in 36.585s
hastagaras-cupang-12b-test-2-v4-mkmlizer: Processed model Hastagaras/Cupang-12B-Test-2 in 65.926s
hastagaras-cupang-12b-test-2-v4-mkmlizer: creating bucket guanaco-mkml-models
hastagaras-cupang-12b-test-2-v4-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
hastagaras-cupang-12b-test-2-v4-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/hastagaras-cupang-12b-test-2-v4
hastagaras-cupang-12b-test-2-v4-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/hastagaras-cupang-12b-test-2-v4/config.json
hastagaras-cupang-12b-test-2-v4-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/hastagaras-cupang-12b-test-2-v4/special_tokens_map.json
hastagaras-cupang-12b-test-2-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/hastagaras-cupang-12b-test-2-v4/tokenizer_config.json
hastagaras-cupang-12b-test-2-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/hastagaras-cupang-12b-test-2-v4/tokenizer.json
hastagaras-cupang-12b-test-2-v4-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/hastagaras-cupang-12b-test-2-v4/flywheel_model.0.safetensors
hastagaras-cupang-12b-test-2-v4-mkmlizer: loading reward model from ChaiML/gpt2_xl_pairwise_89m_step_347634
hastagaras-cupang-12b-test-2-v4-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:12, 29.07it/s] Loading 0: 4%|▎ | 13/363 [00:00<00:07, 47.98it/s] Loading 0: 5%|▌ | 19/363 [00:00<00:07, 43.28it/s] Loading 0: 7%|▋ | 24/363 [00:00<00:08, 41.67it/s] Loading 0: 9%|▊ | 31/363 [00:00<00:07, 47.27it/s] Loading 0: 10%|▉ | 36/363 [00:00<00:06, 47.87it/s] Loading 0: 11%|█▏ | 41/363 [00:00<00:08, 38.69it/s] Loading 0: 13%|█▎ | 48/363 [00:01<00:06, 45.52it/s] Loading 0: 15%|█▍ | 53/363 [00:01<00:07, 44.17it/s] Loading 0: 16%|█▋ | 59/363 [00:01<00:06, 47.91it/s] Loading 0: 18%|█▊ | 65/363 [00:01<00:10, 29.10it/s] Loading 0: 20%|█▉ | 71/363 [00:01<00:08, 34.39it/s] Loading 0: 21%|██ | 76/363 [00:01<00:08, 35.73it/s] Loading 0: 22%|██▏ | 81/363 [00:02<00:07, 37.60it/s] Loading 0: 24%|██▎ | 86/363 [00:02<00:06, 39.91it/s] Loading 0: 25%|██▌ | 91/363 [00:02<00:07, 34.55it/s] Loading 0: 27%|██▋ | 98/363 [00:02<00:06, 41.59it/s] Loading 0: 28%|██▊ | 103/363 [00:02<00:06, 40.96it/s] Loading 0: 30%|██▉ | 108/363 [00:02<00:06, 41.93it/s] Loading 0: 31%|███ | 113/363 [00:02<00:06, 35.86it/s] Loading 0: 33%|███▎ | 118/363 [00:03<00:06, 36.31it/s] Loading 0: 34%|███▍ | 125/363 [00:03<00:05, 43.09it/s] Loading 0: 36%|███▌ | 130/363 [00:03<00:05, 42.52it/s] Loading 0: 37%|███▋ | 135/363 [00:03<00:05, 42.89it/s] Loading 0: 39%|███▊ | 140/363 [00:03<00:05, 44.49it/s] Loading 0: 40%|███▉ | 145/363 [00:03<00:08, 27.10it/s] Loading 0: 41%|████ | 149/363 [00:03<00:07, 27.72it/s] Loading 0: 43%|████▎ | 156/363 [00:04<00:05, 35.66it/s] Loading 0: 44%|████▍ | 161/363 [00:04<00:05, 36.95it/s] Loading 0: 46%|████▌ | 166/363 [00:04<00:05, 39.18it/s] Loading 0: 47%|████▋ | 171/363 [00:04<00:04, 41.33it/s] Loading 0: 48%|████▊ | 176/363 [00:04<00:05, 35.64it/s] Loading 0: 50%|█████ | 183/363 [00:04<00:04, 41.87it/s] Loading 0: 52%|█████▏ | 188/363 [00:04<00:04, 41.88it/s] Loading 0: 53%|█████▎ | 193/363 [00:04<00:03, 42.83it/s] Loading 0: 55%|█████▍ | 198/363 [00:05<00:03, 43.97it/s] Loading 0: 56%|█████▌ | 203/363 [00:05<00:04, 35.61it/s] Loading 0: 58%|█████▊ | 210/363 [00:05<00:03, 43.14it/s] Loading 0: 59%|█████▉ | 215/363 [00:05<00:03, 42.94it/s] Loading 0: 61%|██████ | 220/363 [00:05<00:03, 44.54it/s] Loading 0: 62%|██████▏ | 225/363 [00:05<00:04, 27.74it/s] Loading 0: 63%|██████▎ | 230/363 [00:06<00:04, 29.61it/s] Loading 0: 66%|██████▌ | 238/363 [00:06<00:03, 38.03it/s] Loading 0: 67%|██████▋ | 244/363 [00:06<00:03, 39.05it/s] Loading 0: 69%|██████▊ | 249/363 [00:06<00:02, 38.90it/s] Loading 0: 71%|███████ | 256/363 [00:06<00:02, 44.06it/s] Loading 0: 72%|███████▏ | 262/363 [00:06<00:02, 43.22it/s] Loading 0: 74%|███████▎ | 267/363 [00:06<00:02, 42.62it/s] Loading 0: 75%|███████▌ | 273/363 [00:06<00:01, 45.82it/s] Loading 0: 77%|███████▋ | 278/363 [00:07<00:01, 45.23it/s] Loading 0: 78%|███████▊ | 283/363 [00:07<00:01, 45.97it/s] Loading 0: 79%|███████▉ | 288/363 [00:07<00:01, 46.54it/s] Loading 0: 81%|████████ | 293/363 [00:07<00:01, 38.95it/s] Loading 0: 83%|████████▎ | 301/363 [00:07<00:01, 48.28it/s] Loading 0: 85%|████████▍ | 307/363 [00:14<00:20, 2.78it/s] Loading 0: 86%|████████▌ | 312/363 [00:14<00:13, 3.68it/s] Loading 0: 88%|████████▊ | 320/363 [00:14<00:07, 5.70it/s] Loading 0: 90%|████████▉ | 326/363 [00:14<00:04, 7.55it/s] Loading 0: 91%|█████████ | 331/363 [00:15<00:03, 9.49it/s] Loading 0: 93%|█████████▎| 338/363 [00:15<00:01, 13.25it/s] Loading 0: 94%|█████████▍| 343/363 [00:15<00:01, 16.26it/s] Loading 0: 96%|█████████▌| 348/363 [00:15<00:00, 18.08it/s] Loading 0: 98%|█████████▊| 355/363 [00:15<00:00, 24.19it/s] Loading 0: 99%|█████████▉| 360/363 [00:15<00:00, 27.12it/s] /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:957: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
hastagaras-cupang-12b-test-2-v4-mkmlizer: warnings.warn(
hastagaras-cupang-12b-test-2-v4-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:785: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
hastagaras-cupang-12b-test-2-v4-mkmlizer: warnings.warn(
hastagaras-cupang-12b-test-2-v4-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:469: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
hastagaras-cupang-12b-test-2-v4-mkmlizer: warnings.warn(
hastagaras-cupang-12b-test-2-v4-mkmlizer: Saving duration: 1.369s
hastagaras-cupang-12b-test-2-v4-mkmlizer: Processed model ChaiML/gpt2_xl_pairwise_89m_step_347634 in 12.759s
hastagaras-cupang-12b-test-2-v4-mkmlizer: creating bucket guanaco-reward-models
hastagaras-cupang-12b-test-2-v4-mkmlizer: Bucket 's3://guanaco-reward-models/' created
hastagaras-cupang-12b-test-2-v4-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/hastagaras-cupang-12b-test-2-v4_reward
hastagaras-cupang-12b-test-2-v4-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/hastagaras-cupang-12b-test-2-v4_reward/config.json
hastagaras-cupang-12b-test-2-v4-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/hastagaras-cupang-12b-test-2-v4_reward/tokenizer_config.json
hastagaras-cupang-12b-test-2-v4-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/hastagaras-cupang-12b-test-2-v4_reward/special_tokens_map.json
hastagaras-cupang-12b-test-2-v4-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/hastagaras-cupang-12b-test-2-v4_reward/merges.txt
hastagaras-cupang-12b-test-2-v4-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/hastagaras-cupang-12b-test-2-v4_reward/vocab.json
hastagaras-cupang-12b-test-2-v4-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/hastagaras-cupang-12b-test-2-v4_reward/tokenizer.json
hastagaras-cupang-12b-test-2-v4-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/hastagaras-cupang-12b-test-2-v4_reward/reward.tensors
Job hastagaras-cupang-12b-test-2-v4-mkmlizer completed after 118.29s with status: succeeded
Stopping job with name hastagaras-cupang-12b-test-2-v4-mkmlizer
Pipeline stage MKMLizer completed in 119.29s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.10s
Running pipeline stage ISVCDeployer
Creating inference service hastagaras-cupang-12b-test-2-v4
Waiting for inference service hastagaras-cupang-12b-test-2-v4 to be ready
Inference service hastagaras-cupang-12b-test-2-v4 ready after 131.27527785301208s
Pipeline stage ISVCDeployer completed in 133.07s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.539595603942871s
Received healthy response to inference request in 1.4935131072998047s
Received healthy response to inference request in 1.5847983360290527s
Received healthy response to inference request in 1.585867166519165s
Received healthy response to inference request in 1.5403821468353271s
5 requests
0 failed requests
5th percentile: 1.502886915206909
10th percentile: 1.5122607231140137
20th percentile: 1.5310083389282227
30th percentile: 1.5492653846740723
40th percentile: 1.5670318603515625
50th percentile: 1.5847983360290527
60th percentile: 1.5852258682250977
70th percentile: 1.5856534004211427
80th percentile: 1.7766128540039063
90th percentile: 2.158104228973389
95th percentile: 2.34884991645813
99th percentile: 2.501446466445923
mean time: 1.7488312721252441
Pipeline stage StressChecker completed in 9.66s
hastagaras-cupang-12b-test-2_v4 status is now deployed due to DeploymentManager action
hastagaras-cupang-12b-test-2_v4 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of hastagaras-cupang-12b-test-2_v4
Running pipeline stage ISVCDeleter
Checking if service hastagaras-cupang-12b-test-2-v4 is running
Tearing down inference service hastagaras-cupang-12b-test-2-v4
Service hastagaras-cupang-12b-test-2-v4 has been torndown
Pipeline stage ISVCDeleter completed in 4.68s
Running pipeline stage MKMLModelDeleter
Cleaning model data from S3
Cleaning model data from model cache
Deleting key hastagaras-cupang-12b-test-2-v4/config.json from bucket guanaco-mkml-models
Deleting key hastagaras-cupang-12b-test-2-v4/flywheel_model.0.safetensors from bucket guanaco-mkml-models
Deleting key hastagaras-cupang-12b-test-2-v4/special_tokens_map.json from bucket guanaco-mkml-models
Deleting key hastagaras-cupang-12b-test-2-v4/tokenizer.json from bucket guanaco-mkml-models
Deleting key hastagaras-cupang-12b-test-2-v4/tokenizer_config.json from bucket guanaco-mkml-models
Cleaning model data from model cache
Deleting key hastagaras-cupang-12b-test-2-v4_reward/config.json from bucket guanaco-reward-models
Deleting key hastagaras-cupang-12b-test-2-v4_reward/merges.txt from bucket guanaco-reward-models
Deleting key hastagaras-cupang-12b-test-2-v4_reward/reward.tensors from bucket guanaco-reward-models
Deleting key hastagaras-cupang-12b-test-2-v4_reward/special_tokens_map.json from bucket guanaco-reward-models
Deleting key hastagaras-cupang-12b-test-2-v4_reward/tokenizer.json from bucket guanaco-reward-models
Deleting key hastagaras-cupang-12b-test-2-v4_reward/tokenizer_config.json from bucket guanaco-reward-models
Deleting key hastagaras-cupang-12b-test-2-v4_reward/vocab.json from bucket guanaco-reward-models
Pipeline stage MKMLModelDeleter completed in 6.36s
hastagaras-cupang-12b-test-2_v4 status is now torndown due to DeploymentManager action

Usage Metrics

Latency Metrics