submission_id: chaiml-sao10k-l3-rp-v3-3_v1
developer_uid: dzhao1
status: inactive
model_repo: ChaiML/sao10k-l3-rp-v3-3
reward_repo: ChaiML/gpt2_medium_pairwise_60m_step_937500
generation_params: {'temperature': 0.95, 'top_p': 0.95, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|end_header_id|>,', '<|eot_id|>,', '\n\n{user_name}'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "<|begin_of_text|><|start_header_id|>system<|end_header_id|>\n\n{bot_name}'s Persona: {memory}\n\n", 'prompt_template': '{prompt}<|eot_id|>', 'bot_template': '<|start_header_id|>assistant<|end_header_id|>\n\n{bot_name}: {message}<|eot_id|>', 'user_template': '<|start_header_id|>user<|end_header_id|>\n\n{user_name}: {message}<|eot_id|>', 'response_template': '<|start_header_id|>assistant<|end_header_id|>\n\n{bot_name}:', 'truncate_by_message': False}
reward_formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
timestamp: 2024-07-02T17:39:34+00:00
model_name: chaiml-sao10k-l3-rp-v3-3_v1
model_group: ChaiML/sao10k-l3-rp-v3-3
num_battles: 11815
num_wins: 6713
celo_rating: 1220.66
propriety_score: 0.7261946902654868
propriety_total_count: 5650.0
submission_type: basic
model_architecture: LlamaForCausalLM
model_num_parameters: 8030261248.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
display_name: chaiml-sao10k-l3-rp-v3-3_v1
ineligible_reason: None
language_model: ChaiML/sao10k-l3-rp-v3-3
model_size: 8B
reward_model: ChaiML/gpt2_medium_pairwise_60m_step_937500
us_pacific_date: 2024-07-02
win_ratio: 0.5681760473973763
Resubmit model
Running pipeline stage MKMLizer
Starting job with name chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer
Waiting for job on chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer to finish
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ _____ __ __ ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ /___/ ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ Version: 0.8.14 ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ belonging to: ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ Expiration: 2024-07-15 23:59:59 ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ║ ║
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: Downloaded to shared memory in 28.785s
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: quantizing model to /dev/shm/model_cache
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 1%| | 2/291 [00:04<11:09, 2.32s/it] Loading 0: 5%|▍ | 14/291 [00:04<01:09, 4.00it/s] Loading 0: 10%|▉ | 29/291 [00:04<00:26, 9.97it/s] Loading 0: 14%|█▍ | 41/291 [00:04<00:15, 16.01it/s] Loading 0: 19%|█▉ | 56/291 [00:05<00:09, 25.78it/s] Loading 0: 24%|██▎ | 69/291 [00:05<00:07, 27.94it/s] Loading 0: 29%|██▉ | 85/291 [00:05<00:05, 40.52it/s] Loading 0: 33%|███▎ | 97/291 [00:05<00:03, 49.72it/s] Loading 0: 39%|███▉ | 113/291 [00:05<00:02, 64.38it/s] Loading 0: 45%|████▍ | 130/291 [00:05<00:02, 80.48it/s] Loading 0: 49%|████▉ | 144/291 [00:05<00:01, 90.16it/s] Loading 0: 54%|█████▍ | 158/291 [00:06<00:01, 98.91it/s] Loading 0: 59%|█████▉ | 171/291 [00:06<00:01, 68.18it/s] Loading 0: 64%|██████▎ | 185/291 [00:06<00:01, 79.26it/s] Loading 0: 69%|██████▉ | 202/291 [00:06<00:00, 94.18it/s] Loading 0: 74%|███████▍ | 215/291 [00:06<00:00, 100.37it/s] Loading 0: 79%|███████▊ | 229/291 [00:06<00:00, 106.24it/s] Loading 0: 83%|████████▎ | 242/291 [00:06<00:00, 109.95it/s] Loading 0: 88%|████████▊ | 257/291 [00:07<00:00, 115.21it/s] Loading 0: 93%|█████████▎| 270/291 [00:07<00:00, 74.09it/s] Loading 0: 98%|█████████▊| 284/291 [00:07<00:00, 84.52it/s] Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: quantized model in 23.373s
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: Processed model ChaiML/sao10k-l3-rp-v3-3 in 52.158s
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-sao10k-l3-rp-v3-3-v1
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-sao10k-l3-rp-v3-3-v1/special_tokens_map.json
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-sao10k-l3-rp-v3-3-v1/tokenizer_config.json
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-sao10k-l3-rp-v3-3-v1/config.json
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-sao10k-l3-rp-v3-3-v1/tokenizer.json
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-sao10k-l3-rp-v3-3-v1/flywheel_model.0.safetensors
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: loading reward model from ChaiML/gpt2_medium_pairwise_60m_step_937500
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:919: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: warnings.warn(
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: warnings.warn(
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:769: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: warnings.warn(
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:468: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: warnings.warn(
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: Saving duration: 0.388s
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: Processed model ChaiML/gpt2_medium_pairwise_60m_step_937500 in 3.738s
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: creating bucket guanaco-reward-models
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: Bucket 's3://guanaco-reward-models/' created
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/chaiml-sao10k-l3-rp-v3-3-v1_reward
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/chaiml-sao10k-l3-rp-v3-3-v1_reward/config.json
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/chaiml-sao10k-l3-rp-v3-3-v1_reward/special_tokens_map.json
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/chaiml-sao10k-l3-rp-v3-3-v1_reward/tokenizer_config.json
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/chaiml-sao10k-l3-rp-v3-3-v1_reward/merges.txt
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/chaiml-sao10k-l3-rp-v3-3-v1_reward/vocab.json
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/chaiml-sao10k-l3-rp-v3-3-v1_reward/tokenizer.json
chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/chaiml-sao10k-l3-rp-v3-3-v1_reward/reward.tensors
Job chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer completed after 84.05s with status: succeeded
Stopping job with name chaiml-sao10k-l3-rp-v3-3-v1-mkmlizer
Pipeline stage MKMLizer completed in 84.93s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.12s
Running pipeline stage ISVCDeployer
Creating inference service chaiml-sao10k-l3-rp-v3-3-v1
Waiting for inference service chaiml-sao10k-l3-rp-v3-3-v1 to be ready
Inference service chaiml-sao10k-l3-rp-v3-3-v1 ready after 50.272284746170044s
Pipeline stage ISVCDeployer completed in 57.22s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0192453861236572s
Received healthy response to inference request in 1.277667760848999s
Received healthy response to inference request in 1.290809154510498s
Received healthy response to inference request in 1.2949192523956299s
Received healthy response to inference request in 1.253584623336792s
5 requests
0 failed requests
5th percentile: 1.2584012508392335
10th percentile: 1.2632178783416748
20th percentile: 1.2728511333465575
30th percentile: 1.2802960395812988
40th percentile: 1.2855525970458985
50th percentile: 1.290809154510498
60th percentile: 1.2924531936645507
70th percentile: 1.2940972328186036
80th percentile: 1.4397844791412355
90th percentile: 1.7295149326324464
95th percentile: 1.8743801593780516
99th percentile: 1.9902723407745362
mean time: 1.4272452354431153
Pipeline stage StressChecker completed in 7.87s
chaiml-sao10k-l3-rp-v3-3_v1 status is now deployed due to DeploymentManager action
chaiml-sao10k-l3-rp-v3-3_v1 status is now inactive due to auto deactivation removed underperforming models

Usage Metrics

Latency Metrics