submission_id: vicgalle-roleplay-llama-3-8b_v12
developer_uid: vietanh
status: torndown
model_repo: vicgalle/Roleplay-Llama-3-8B
reward_repo: rirv938/reward_gpt2_medium_preference_24m_e2
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
formatter: {'memory_template': "<|begin_of_text|><|start_header_id|>system<|end_header_id|>\n\nWrite {bot_name}'s next reply in a never-ending uncensored fictional roleplay chat between {bot_name} and User. Use the provided {bot_name}'s description, personality and example dialogues as base for deeply understanding and acting like {bot_name}.\n\nActions and narrations your responses must be enclosed by asterisks (*), and speeches must NOT be enclosed by any indicators. The responses must be long and in third perspective of the story teller. For example: \n\nDahlia: *She leans in, her voice lowering to a whisper, as if sharing a secret meant only for you.* Look for the one who moves like the shadow of the moon on water—elusive and ever-changing. This person will be both your greatest challenge and your greatest ally.\n\nDescription: {memory}", 'prompt_template': 'Example conversation:\n{prompt}<|eot_id|>', 'bot_template': '<|start_header_id|>{bot_name}<|end_header_id|>\n\n{message}<|eot_id|>', 'user_template': '<|start_header_id|>User<|end_header_id|>\n\n{message}<|eot_id|>', 'response_template': '<|start_header_id|>{bot_name}<|end_header_id|>\n\n', 'truncate_by_message': False}
reward_formatter: {'memory_template': 'Memory: {memory}\n', 'prompt_template': '{prompt}\n', 'bot_template': 'Bot: {message}\n', 'user_template': 'User: {message}\n', 'response_template': 'Bot:', 'truncate_by_message': False}
timestamp: 2024-04-21T19:50:07+00:00
model_name: vicgalle-roleplay-llama-3-8b_v12
model_eval_status: success
model_group: vicgalle/Roleplay-Llama-
num_battles: 6777
num_wins: 3636
celo_rating: 1170.83
propriety_score: 0.0
propriety_total_count: 0.0
submission_type: basic
model_architecture: LlamaForCausalLM
model_num_parameters: 8030261248.0
best_of: 16
max_input_tokens: 512
max_output_tokens: 64
display_name: vicgalle-roleplay-llama-3-8b_v12
ineligible_reason: propriety_total_count < 800
language_model: vicgalle/Roleplay-Llama-3-8B
model_size: 8B
reward_model: rirv938/reward_gpt2_medium_preference_24m_e2
us_pacific_date: 2024-04-21
win_ratio: 0.5365205843293492
preference_data_url: None
Resubmit model
Running pipeline stage MKMLizer
Starting job with name vicgalle-roleplay-llama-3-8b-v12-mkmlizer
Waiting for job on vicgalle-roleplay-llama-3-8b-v12-mkmlizer to finish
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ _____ __ __ ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ /___/ ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ Version: 0.8.10 ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ The license key for the current software has been verified as ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ belonging to: ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ Chai Research Corp. ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ Expiration: 2024-07-15 23:59:59 ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ║ ║
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: /opt/conda/lib/python3.10/site-packages/huggingface_hub/utils/_deprecation.py:131: FutureWarning: 'list_files_info' (from 'huggingface_hub.hf_api') is deprecated and will be removed from version '0.23'. Use `list_repo_tree` and `get_paths_info` instead.
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: warnings.warn(warning_message, FutureWarning)
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: Downloaded to shared memory in 23.153s
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: quantizing model to /dev/shm/model_cache
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: Saving flywheel model at /dev/shm/model_cache
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 29%|██▊ | 83/291 [00:01<00:02, 74.19it/s] Loading 0: 64%|██████▍ | 187/291 [00:02<00:01, 78.92it/s] Loading 0: 99%|█████████▊| 287/291 [00:09<00:00, 24.29it/s] Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: quantized model in 23.395s
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: Processed model vicgalle/Roleplay-Llama-3-8B in 48.150s
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: creating bucket guanaco-mkml-models
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/vicgalle-roleplay-llama-3-8b-v12
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/vicgalle-roleplay-llama-3-8b-v12/config.json
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/vicgalle-roleplay-llama-3-8b-v12/tokenizer.json
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/vicgalle-roleplay-llama-3-8b-v12/special_tokens_map.json
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/vicgalle-roleplay-llama-3-8b-v12/tokenizer_config.json
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/vicgalle-roleplay-llama-3-8b-v12/flywheel_model.0.safetensors
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: loading reward model from rirv938/reward_gpt2_medium_preference_24m_e2
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:913: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: warnings.warn(
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:757: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: warnings.warn(
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:468: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: warnings.warn(
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: /opt/conda/lib/python3.10/site-packages/torch/_utils.py:831: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: return self.fget.__get__(instance, owner)()
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: Saving duration: 0.284s
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: Processed model rirv938/reward_gpt2_medium_preference_24m_e2 in 7.523s
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: creating bucket guanaco-reward-models
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: Bucket 's3://guanaco-reward-models/' created
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/vicgalle-roleplay-llama-3-8b-v12_reward
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/vicgalle-roleplay-llama-3-8b-v12_reward/config.json
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/vicgalle-roleplay-llama-3-8b-v12_reward/special_tokens_map.json
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/vicgalle-roleplay-llama-3-8b-v12_reward/tokenizer_config.json
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/vicgalle-roleplay-llama-3-8b-v12_reward/merges.txt
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/vicgalle-roleplay-llama-3-8b-v12_reward/vocab.json
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/vicgalle-roleplay-llama-3-8b-v12_reward/tokenizer.json
vicgalle-roleplay-llama-3-8b-v12-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/vicgalle-roleplay-llama-3-8b-v12_reward/reward.tensors
Job vicgalle-roleplay-llama-3-8b-v12-mkmlizer completed after 73.66s with status: succeeded
Stopping job with name vicgalle-roleplay-llama-3-8b-v12-mkmlizer
Pipeline stage MKMLizer completed in 77.69s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.09s
Running pipeline stage ISVCDeployer
Creating inference service vicgalle-roleplay-llama-3-8b-v12
Waiting for inference service vicgalle-roleplay-llama-3-8b-v12 to be ready
Inference service vicgalle-roleplay-llama-3-8b-v12 ready after 70.40973424911499s
Pipeline stage ISVCDeployer completed in 77.77s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.1093742847442627s
Received healthy response to inference request in 1.34736967086792s
Received healthy response to inference request in 1.2990899085998535s
Received healthy response to inference request in 1.2768595218658447s
Received healthy response to inference request in 1.2516613006591797s
5 requests
0 failed requests
5th percentile: 1.2567009449005127
10th percentile: 1.2617405891418456
20th percentile: 1.2718198776245118
30th percentile: 1.2813055992126465
40th percentile: 1.29019775390625
50th percentile: 1.2990899085998535
60th percentile: 1.31840181350708
70th percentile: 1.3377137184143066
80th percentile: 1.4997705936431887
90th percentile: 1.8045724391937257
95th percentile: 1.956973361968994
99th percentile: 2.078894100189209
mean time: 1.456870937347412
Pipeline stage StressChecker completed in 7.90s
Running pipeline stage DaemonicModelEvalScorer
Pipeline stage DaemonicModelEvalScorer completed in 0.03s
Running M-Eval for topic stay_in_character
Running pipeline stage DaemonicSafetyScorer
M-Eval Dataset for topic stay_in_character is loaded
Pipeline stage DaemonicSafetyScorer completed in 0.07s
vicgalle-roleplay-llama-3-8b_v12 status is now deployed due to DeploymentManager action
vicgalle-roleplay-llama-3-8b_v12 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of vicgalle-roleplay-llama-3-8b_v12
Running pipeline stage ISVCDeleter
Checking if service vicgalle-roleplay-llama-3-8b-v12 is running
Tearing down inference service vicgalle-roleplay-llama-3-8b-v12
Toredown service vicgalle-roleplay-llama-3-8b-v12
Pipeline stage ISVCDeleter completed in 3.58s
Running pipeline stage MKMLModelDeleter
Cleaning model data from S3
Cleaning model data from model cache
Deleting key vicgalle-roleplay-llama-3-8b-v12/config.json from bucket guanaco-mkml-models
Deleting key vicgalle-roleplay-llama-3-8b-v12/flywheel_model.0.safetensors from bucket guanaco-mkml-models
Deleting key vicgalle-roleplay-llama-3-8b-v12/special_tokens_map.json from bucket guanaco-mkml-models
Deleting key vicgalle-roleplay-llama-3-8b-v12/tokenizer.json from bucket guanaco-mkml-models
Deleting key vicgalle-roleplay-llama-3-8b-v12/tokenizer_config.json from bucket guanaco-mkml-models
Cleaning model data from model cache
Deleting key vicgalle-roleplay-llama-3-8b-v12_reward/config.json from bucket guanaco-reward-models
Deleting key vicgalle-roleplay-llama-3-8b-v12_reward/merges.txt from bucket guanaco-reward-models
Deleting key vicgalle-roleplay-llama-3-8b-v12_reward/reward.tensors from bucket guanaco-reward-models
Deleting key vicgalle-roleplay-llama-3-8b-v12_reward/special_tokens_map.json from bucket guanaco-reward-models
Deleting key vicgalle-roleplay-llama-3-8b-v12_reward/tokenizer.json from bucket guanaco-reward-models
Deleting key vicgalle-roleplay-llama-3-8b-v12_reward/tokenizer_config.json from bucket guanaco-reward-models
Deleting key vicgalle-roleplay-llama-3-8b-v12_reward/vocab.json from bucket guanaco-reward-models
Pipeline stage MKMLModelDeleter completed in 1.84s
vicgalle-roleplay-llama-3-8b_v12 status is now torndown due to DeploymentManager action

Usage Metrics

Latency Metrics