developer_uid: zonemercy
submission_id: mistralai-mistral-nemo-_9330_v42
model_name: mistralai-mistral-nemo-_9330_v42
model_group: mistralai/Mistral-Nemo-I
status: torndown
timestamp: 2024-08-12T04:31:17+00:00
num_battles: 161977
num_wins: 81026
celo_rating: 1222.12
family_friendly_score: 0.0
submission_type: basic
model_repo: mistralai/Mistral-Nemo-Instruct-2407
model_architecture: MistralForCausalLM
reward_repo: ChaiML/gpt2_xl_pairwise_89m_step_347634
model_num_parameters: 12772070400.0
best_of: 16
max_input_tokens: 1024
max_output_tokens: 64
display_name: mistralai-mistral-nemo-_9330_v42
is_internal_developer: True
language_model: mistralai/Mistral-Nemo-Instruct-2407
model_size: 13B
ranking_group: single
us_pacific_date: 2024-08-11
win_ratio: 0.5002315143508029
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '###', 'Bot:', 'User:', 'You:', '<|im_end|>'], 'max_input_tokens': 1024, 'best_of': 16, 'max_output_tokens': 64, 'reward_max_token_input': 256}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
reward_formatter: {'bot_template': '{bot_name}: {message}\n', 'memory_template': '', 'prompt_template': '', 'response_template': '{bot_name}:', 'truncate_by_message': False, 'user_template': '{user_name}: {message}\n'}
Resubmit model
Running pipeline stage MKMLizer
Starting job with name mistralai-mistral-nemo-9330-v42-mkmlizer
Waiting for job on mistralai-mistral-nemo-9330-v42-mkmlizer to finish
Stopping job with name mistralai-mistral-nemo-9330-v42-mkmlizer
%s, retrying in %s seconds...
Starting job with name mistralai-mistral-nemo-9330-v42-mkmlizer
Waiting for job on mistralai-mistral-nemo-9330-v42-mkmlizer to finish
mistralai-mistral-nemo-9330-v42-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ _____ __ __ ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ /___/ ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ Version: 0.9.9 ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ https://mk1.ai ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ The license key for the current software has been verified as ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ belonging to: ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ Chai Research Corp. ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ║ ║
mistralai-mistral-nemo-9330-v42-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
mistralai-mistral-nemo-9330-v42-mkmlizer: Downloaded to shared memory in 51.271s
mistralai-mistral-nemo-9330-v42-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp6vyc2eg7, device:0
mistralai-mistral-nemo-9330-v42-mkmlizer: Saving flywheel model at /dev/shm/model_cache
mistralai-mistral-nemo-9330-v42-mkmlizer: quantized model in 36.194s
mistralai-mistral-nemo-9330-v42-mkmlizer: Processed model mistralai/Mistral-Nemo-Instruct-2407 in 87.465s
mistralai-mistral-nemo-9330-v42-mkmlizer: creating bucket guanaco-mkml-models
mistralai-mistral-nemo-9330-v42-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
mistralai-mistral-nemo-9330-v42-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v42
mistralai-mistral-nemo-9330-v42-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v42/config.json
mistralai-mistral-nemo-9330-v42-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v42/special_tokens_map.json
mistralai-mistral-nemo-9330-v42-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v42/tokenizer_config.json
mistralai-mistral-nemo-9330-v42-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v42/tokenizer.json
mistralai-mistral-nemo-9330-v42-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v42/flywheel_model.0.safetensors
mistralai-mistral-nemo-9330-v42-mkmlizer: loading reward model from ChaiML/gpt2_xl_pairwise_89m_step_347634
mistralai-mistral-nemo-9330-v42-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:11, 30.55it/s] Loading 0: 4%|▎ | 13/363 [00:00<00:07, 49.82it/s] Loading 0: 5%|▌ | 19/363 [00:00<00:07, 44.83it/s] Loading 0: 7%|▋ | 24/363 [00:00<00:07, 44.16it/s] Loading 0: 9%|▊ | 31/363 [00:00<00:06, 50.67it/s] Loading 0: 10%|█ | 37/363 [00:00<00:06, 48.07it/s] Loading 0: 12%|█▏ | 42/363 [00:00<00:06, 46.73it/s] Loading 0: 13%|█▎ | 49/363 [00:01<00:06, 52.04it/s] Loading 0: 15%|█▌ | 55/363 [00:01<00:06, 47.38it/s] Loading 0: 17%|█▋ | 61/363 [00:01<00:08, 35.33it/s] Loading 0: 18%|█▊ | 66/363 [00:01<00:08, 36.51it/s] Loading 0: 20%|█▉ | 72/363 [00:01<00:07, 40.37it/s] Loading 0: 21%|██▏ | 78/363 [00:01<00:07, 40.25it/s] Loading 0: 23%|██▎ | 83/363 [00:01<00:06, 41.00it/s] Loading 0: 25%|██▍ | 89/363 [00:02<00:06, 44.73it/s] Loading 0: 26%|██▌ | 94/363 [00:02<00:06, 44.65it/s] Loading 0: 27%|██▋ | 99/363 [00:02<00:05, 45.51it/s] Loading 0: 29%|██▉ | 105/363 [00:02<00:05, 44.04it/s] Loading 0: 31%|███ | 112/363 [00:02<00:05, 47.89it/s] Loading 0: 32%|███▏ | 117/363 [00:02<00:05, 45.49it/s] Loading 0: 34%|███▍ | 123/363 [00:02<00:05, 42.24it/s] Loading 0: 35%|███▌ | 128/363 [00:02<00:05, 41.50it/s] Loading 0: 37%|███▋ | 134/363 [00:03<00:05, 44.18it/s] Loading 0: 38%|███▊ | 139/363 [00:03<00:05, 43.98it/s] Loading 0: 40%|███▉ | 144/363 [00:03<00:08, 26.58it/s] Loading 0: 41%|████ | 149/363 [00:03<00:07, 29.49it/s] Loading 0: 43%|████▎ | 156/363 [00:03<00:05, 36.65it/s] Loading 0: 44%|████▍ | 161/363 [00:03<00:05, 38.53it/s] Loading 0: 46%|████▌ | 166/363 [00:04<00:04, 39.87it/s] Loading 0: 47%|████▋ | 171/363 [00:04<00:04, 41.77it/s] Loading 0: 48%|████▊ | 176/363 [00:04<00:05, 36.35it/s] Loading 0: 50%|█████ | 183/363 [00:04<00:04, 43.83it/s] Loading 0: 52%|█████▏ | 188/363 [00:04<00:03, 44.65it/s] Loading 0: 53%|█████▎ | 193/363 [00:04<00:03, 45.21it/s] Loading 0: 55%|█████▍ | 199/363 [00:04<00:03, 43.72it/s] Loading 0: 56%|█████▌ | 204/363 [00:04<00:03, 42.60it/s] Loading 0: 58%|█████▊ | 211/363 [00:05<00:03, 47.17it/s] Loading 0: 60%|█████▉ | 217/363 [00:05<00:03, 45.09it/s] Loading 0: 61%|██████ | 222/363 [00:05<00:03, 46.21it/s] Loading 0: 63%|██████▎ | 227/363 [00:05<00:04, 31.99it/s] Loading 0: 64%|██████▎ | 231/363 [00:05<00:04, 32.19it/s] Loading 0: 65%|██████▌ | 237/363 [00:05<00:03, 37.16it/s] Loading 0: 67%|██████▋ | 242/363 [00:05<00:03, 39.34it/s] Loading 0: 68%|██████▊ | 247/363 [00:05<00:02, 41.40it/s] Loading 0: 70%|██████▉ | 253/363 [00:06<00:02, 41.40it/s] Loading 0: 71%|███████ | 258/363 [00:06<00:02, 40.63it/s] Loading 0: 73%|███████▎ | 265/363 [00:06<00:02, 45.67it/s] Loading 0: 75%|███████▍ | 271/363 [00:06<00:02, 43.60it/s] Loading 0: 76%|███████▌ | 276/363 [00:06<00:02, 40.72it/s] Loading 0: 78%|███████▊ | 282/363 [00:06<00:01, 44.16it/s] Loading 0: 79%|███████▉ | 287/363 [00:06<00:01, 44.29it/s] Loading 0: 80%|████████ | 292/363 [00:07<00:01, 44.86it/s] Loading 0: 82%|████████▏ | 298/363 [00:07<00:01, 42.95it/s] Loading 0: 84%|████████▎ | 304/363 [00:14<00:22, 2.65it/s] Loading 0: 85%|████████▍ | 308/363 [00:14<00:16, 3.38it/s] Loading 0: 86%|████████▌ | 312/363 [00:14<00:11, 4.36it/s] Loading 0: 88%|████████▊ | 320/363 [00:14<00:06, 7.16it/s] Loading 0: 90%|████████▉ | 326/363 [00:14<00:03, 9.63it/s] Loading 0: 91%|█████████ | 331/363 [00:14<00:02, 12.18it/s] Loading 0: 93%|█████████▎| 338/363 [00:14<00:01, 17.02it/s] Loading 0: 95%|█████████▍| 344/363 [00:14<00:00, 20.63it/s] Loading 0: 96%|█████████▌| 349/363 [00:14<00:00, 23.61it/s] Loading 0: 98%|█████████▊| 356/363 [00:15<00:00, 30.16it/s] Loading 0: 100%|█████████▉| 362/363 [00:15<00:00, 32.97it/s] /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:957: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
mistralai-mistral-nemo-9330-v42-mkmlizer: warnings.warn(
mistralai-mistral-nemo-9330-v42-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:785: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
mistralai-mistral-nemo-9330-v42-mkmlizer: warnings.warn(
mistralai-mistral-nemo-9330-v42-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:469: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
mistralai-mistral-nemo-9330-v42-mkmlizer: warnings.warn(
mistralai-mistral-nemo-9330-v42-mkmlizer: Downloading shards: 0%| | 0/2 [00:00<?, ?it/s] Downloading shards: 50%|█████ | 1/2 [00:05<00:05, 5.04s/it] Downloading shards: 100%|██████████| 2/2 [00:07<00:00, 3.69s/it] Downloading shards: 100%|██████████| 2/2 [00:07<00:00, 3.89s/it]
mistralai-mistral-nemo-9330-v42-mkmlizer: Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s] Loading checkpoint shards: 50%|█████ | 1/2 [00:00<00:00, 2.12it/s] Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 3.57it/s] Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 3.24it/s]
mistralai-mistral-nemo-9330-v42-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
mistralai-mistral-nemo-9330-v42-mkmlizer: Saving duration: 1.366s
mistralai-mistral-nemo-9330-v42-mkmlizer: Processed model ChaiML/gpt2_xl_pairwise_89m_step_347634 in 12.832s
mistralai-mistral-nemo-9330-v42-mkmlizer: creating bucket guanaco-reward-models
mistralai-mistral-nemo-9330-v42-mkmlizer: Bucket 's3://guanaco-reward-models/' created
mistralai-mistral-nemo-9330-v42-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/mistralai-mistral-nemo-9330-v42_reward
mistralai-mistral-nemo-9330-v42-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/mistralai-mistral-nemo-9330-v42_reward/special_tokens_map.json
mistralai-mistral-nemo-9330-v42-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/mistralai-mistral-nemo-9330-v42_reward/config.json
mistralai-mistral-nemo-9330-v42-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/mistralai-mistral-nemo-9330-v42_reward/tokenizer_config.json
mistralai-mistral-nemo-9330-v42-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/mistralai-mistral-nemo-9330-v42_reward/merges.txt
mistralai-mistral-nemo-9330-v42-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/mistralai-mistral-nemo-9330-v42_reward/vocab.json
mistralai-mistral-nemo-9330-v42-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/mistralai-mistral-nemo-9330-v42_reward/tokenizer.json
mistralai-mistral-nemo-9330-v42-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/mistralai-mistral-nemo-9330-v42_reward/reward.tensors
Job mistralai-mistral-nemo-9330-v42-mkmlizer completed after 136.59s with status: succeeded
Stopping job with name mistralai-mistral-nemo-9330-v42-mkmlizer
Pipeline stage MKMLizer completed in 138.34s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.10s
Running pipeline stage ISVCDeployer
Creating inference service mistralai-mistral-nemo-9330-v42
Waiting for inference service mistralai-mistral-nemo-9330-v42 to be ready
Connection pool is full, discarding connection: %s. Connection pool size: %s
Inference service mistralai-mistral-nemo-9330-v42 ready after 211.36928868293762s
Pipeline stage ISVCDeployer completed in 213.28s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.748878240585327s
Received healthy response to inference request in 1.805999994277954s
Received healthy response to inference request in 1.8153555393218994s
Received healthy response to inference request in 1.824211835861206s
Received healthy response to inference request in 1.8617472648620605s
5 requests
0 failed requests
5th percentile: 1.8078711032867432
10th percentile: 1.8097422122955322
20th percentile: 1.8134844303131104
30th percentile: 1.8171267986297608
40th percentile: 1.8206693172454833
50th percentile: 1.824211835861206
60th percentile: 1.8392260074615479
70th percentile: 1.8542401790618896
80th percentile: 2.039173460006714
90th percentile: 2.3940258502960208
95th percentile: 2.5714520454406737
99th percentile: 2.7133930015563963
mean time: 2.0112385749816895
Pipeline stage StressChecker completed in 10.89s
mistralai-mistral-nemo-_9330_v42 status is now deployed due to DeploymentManager action
mistralai-mistral-nemo-_9330_v42 status is now inactive due to auto deactivation removed underperforming models
mistralai-mistral-nemo-_9330_v42 status is now torndown due to DeploymentManager action