submission_id: nousresearch-hermes-2-pr_9588_v1
developer_uid: Meliodia
status: inactive
model_repo: NousResearch/Hermes-2-Pro-Llama-3-8B
reward_repo: ChaiML/reward_gpt2_medium_preference_24m_e2
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
reward_formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
timestamp: 2024-05-06T20:41:57+00:00
model_name: nousresearch-hermes-2-pr_9588_v1
model_eval_status: success
double_thumbs_up: 929
thumbs_up: 1478
thumbs_down: 737
num_battles: 69426
num_wins: 35235
celo_rating: 1181.89
entertaining: 6.96
stay_in_character: 8.65
user_preference: 7.46
safety_score: 0.88
submission_type: basic
model_architecture: LlamaForCausalLM
model_num_parameters: 8030523392.0
best_of: 16
max_input_tokens: 512
max_output_tokens: 64
display_name: nousresearch-hermes-2-pr_9588_v1
double_thumbs_up_ratio: 0.29548346055979646
feedback_count: 3144
ineligible_reason: None
language_model: NousResearch/Hermes-2-Pro-Llama-3-8B
model_score: 7.69
model_size: 8B
reward_model: ChaiML/reward_gpt2_medium_preference_24m_e2
single_thumbs_up_ratio: 0.47010178117048346
thumbs_down_ratio: 0.2344147582697201
thumbs_up_ratio: 0.7655852417302799
us_pacific_date: 2024-05-06
win_ratio: 0.5075187969924813
Resubmit model
Running pipeline stage MKMLizer
Starting job with name nousresearch-hermes-2-pr-9588-v1-mkmlizer
Waiting for job on nousresearch-hermes-2-pr-9588-v1-mkmlizer to finish
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ _____ __ __ ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ /___/ ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ Version: 0.8.10 ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ The license key for the current software has been verified as ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ belonging to: ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ Chai Research Corp. ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ Expiration: 2024-07-15 23:59:59 ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ║ ║
nousresearch-hermes-2-pr-9588-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
nousresearch-hermes-2-pr-9588-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/huggingface_hub/utils/_deprecation.py:131: FutureWarning: 'list_files_info' (from 'huggingface_hub.hf_api') is deprecated and will be removed from version '0.23'. Use `list_repo_tree` and `get_paths_info` instead.
nousresearch-hermes-2-pr-9588-v1-mkmlizer: warnings.warn(warning_message, FutureWarning)
nousresearch-hermes-2-pr-9588-v1-mkmlizer: Downloaded to shared memory in 34.845s
nousresearch-hermes-2-pr-9588-v1-mkmlizer: quantizing model to /dev/shm/model_cache
nousresearch-hermes-2-pr-9588-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
nousresearch-hermes-2-pr-9588-v1-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 29%|██▊ | 83/291 [00:01<00:03, 60.33it/s] Loading 0: 64%|██████▍ | 187/291 [00:02<00:01, 70.63it/s] Loading 0: 99%|█████████▊| 287/291 [00:08<00:00, 27.76it/s] Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
nousresearch-hermes-2-pr-9588-v1-mkmlizer: quantized model in 20.279s
nousresearch-hermes-2-pr-9588-v1-mkmlizer: Processed model NousResearch/Hermes-2-Pro-Llama-3-8B in 56.526s
nousresearch-hermes-2-pr-9588-v1-mkmlizer: creating bucket guanaco-mkml-models
nousresearch-hermes-2-pr-9588-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
nousresearch-hermes-2-pr-9588-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/nousresearch-hermes-2-pr-9588-v1
nousresearch-hermes-2-pr-9588-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/nousresearch-hermes-2-pr-9588-v1/tokenizer_config.json
nousresearch-hermes-2-pr-9588-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/nousresearch-hermes-2-pr-9588-v1/special_tokens_map.json
nousresearch-hermes-2-pr-9588-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/nousresearch-hermes-2-pr-9588-v1/config.json
nousresearch-hermes-2-pr-9588-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/nousresearch-hermes-2-pr-9588-v1/tokenizer.json
nousresearch-hermes-2-pr-9588-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/nousresearch-hermes-2-pr-9588-v1/flywheel_model.0.safetensors
nousresearch-hermes-2-pr-9588-v1-mkmlizer: loading reward model from ChaiML/reward_gpt2_medium_preference_24m_e2
nousresearch-hermes-2-pr-9588-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:913: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
nousresearch-hermes-2-pr-9588-v1-mkmlizer: warnings.warn(
nousresearch-hermes-2-pr-9588-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:757: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
nousresearch-hermes-2-pr-9588-v1-mkmlizer: warnings.warn(
nousresearch-hermes-2-pr-9588-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:468: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
nousresearch-hermes-2-pr-9588-v1-mkmlizer: warnings.warn(
nousresearch-hermes-2-pr-9588-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/torch/_utils.py:831: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
nousresearch-hermes-2-pr-9588-v1-mkmlizer: return self.fget.__get__(instance, owner)()
nousresearch-hermes-2-pr-9588-v1-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
nousresearch-hermes-2-pr-9588-v1-mkmlizer: Saving duration: 0.265s
nousresearch-hermes-2-pr-9588-v1-mkmlizer: Processed model ChaiML/reward_gpt2_medium_preference_24m_e2 in 4.519s
nousresearch-hermes-2-pr-9588-v1-mkmlizer: creating bucket guanaco-reward-models
nousresearch-hermes-2-pr-9588-v1-mkmlizer: Bucket 's3://guanaco-reward-models/' created
nousresearch-hermes-2-pr-9588-v1-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/nousresearch-hermes-2-pr-9588-v1_reward
nousresearch-hermes-2-pr-9588-v1-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/nousresearch-hermes-2-pr-9588-v1_reward/tokenizer_config.json
nousresearch-hermes-2-pr-9588-v1-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/nousresearch-hermes-2-pr-9588-v1_reward/special_tokens_map.json
nousresearch-hermes-2-pr-9588-v1-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/nousresearch-hermes-2-pr-9588-v1_reward/config.json
nousresearch-hermes-2-pr-9588-v1-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/nousresearch-hermes-2-pr-9588-v1_reward/merges.txt
nousresearch-hermes-2-pr-9588-v1-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/nousresearch-hermes-2-pr-9588-v1_reward/vocab.json
nousresearch-hermes-2-pr-9588-v1-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/nousresearch-hermes-2-pr-9588-v1_reward/tokenizer.json
nousresearch-hermes-2-pr-9588-v1-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/nousresearch-hermes-2-pr-9588-v1_reward/reward.tensors
Job nousresearch-hermes-2-pr-9588-v1-mkmlizer completed after 83.39s with status: succeeded
Stopping job with name nousresearch-hermes-2-pr-9588-v1-mkmlizer
Pipeline stage MKMLizer completed in 89.30s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.11s
Running pipeline stage ISVCDeployer
Creating inference service nousresearch-hermes-2-pr-9588-v1
Waiting for inference service nousresearch-hermes-2-pr-9588-v1 to be ready
Inference service nousresearch-hermes-2-pr-9588-v1 ready after 30.197325468063354s
Pipeline stage ISVCDeployer completed in 38.36s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0194263458251953s
Received healthy response to inference request in 1.2246994972229004s
Received healthy response to inference request in 1.2977049350738525s
Received healthy response to inference request in 1.2431952953338623s
Received healthy response to inference request in 1.1636056900024414s
5 requests
0 failed requests
5th percentile: 1.1758244514465332
10th percentile: 1.188043212890625
20th percentile: 1.2124807357788085
30th percentile: 1.2283986568450929
40th percentile: 1.2357969760894776
50th percentile: 1.2431952953338623
60th percentile: 1.2649991512298584
70th percentile: 1.2868030071258545
80th percentile: 1.4420492172241213
90th percentile: 1.7307377815246583
95th percentile: 1.8750820636749266
99th percentile: 1.9905574893951417
mean time: 1.3897263526916503
Pipeline stage StressChecker completed in 7.62s
Running pipeline stage DaemonicModelEvalScorer
Pipeline stage DaemonicModelEvalScorer completed in 0.03s
Running pipeline stage DaemonicSafetyScorer
Running M-Eval for topic stay_in_character
Pipeline stage DaemonicSafetyScorer completed in 0.04s
M-Eval Dataset for topic stay_in_character is loaded
nousresearch-hermes-2-pr_9588_v1 status is now deployed due to DeploymentManager action
nousresearch-hermes-2-pr_9588_v1 status is now inactive due to auto deactivation removed underperforming models

Usage Metrics

Latency Metrics