submission_id: nousresearch-meta-llama_4941_v91
developer_uid: zonemercy
alignment_samples: 0
best_of: 1
celo_rating: 1106.24
display_name: nousresearch-meta-llama_4941_v91
formatter: {'memory_template': "<|begin_of_text|><|start_header_id|>system<|end_header_id|>\n\n{bot_name}'s Persona: {memory}\n\n", 'prompt_template': '{prompt}<|eot_id|>', 'bot_template': '<|start_header_id|>assistant<|end_header_id|>\n\n{bot_name}: {message}<|eot_id|>', 'user_template': '<|start_header_id|>user<|end_header_id|>\n\n{user_name}: {message}<|eot_id|>', 'response_template': '<|start_header_id|>assistant<|end_header_id|>\n\n{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 1, 'max_output_tokens': 64, 'reward_max_token_input': 256}
is_internal_developer: True
language_model: NousResearch/Meta-Llama-3-8B-Instruct
max_input_tokens: 512
max_output_tokens: 64
model_architecture: LlamaForCausalLM
model_group: NousResearch/Meta-Llama-
model_name: nousresearch-meta-llama_4941_v91
model_num_parameters: 8030261248.0
model_repo: NousResearch/Meta-Llama-3-8B-Instruct
model_size: 8B
num_battles: 8168
num_wins: 3086
propriety_score: 0.7473065621939275
propriety_total_count: 1021.0
ranking_group: single
reward_formatter: {'bot_template': '{bot_name}: {message}\n', 'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'response_template': '{bot_name}:', 'truncate_by_message': False, 'user_template': '{user_name}: {message}\n'}
reward_repo: ChaiML/gpt2_xl_pairwise_89m_step_347634
status: torndown
submission_type: basic
timestamp: 2024-07-27T19:44:08+00:00
us_pacific_date: 2024-07-27
win_ratio: 0.3778158667972576
Download Preference Data
Resubmit model
Running pipeline stage MKMLizer
Starting job with name nousresearch-meta-llama-4941-v91-mkmlizer
Waiting for job on nousresearch-meta-llama-4941-v91-mkmlizer to finish
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
nousresearch-meta-llama-4941-v91-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
nousresearch-meta-llama-4941-v91-mkmlizer: ║ _____ __ __ ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ /___/ ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ Version: 0.9.7 ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ https://mk1.ai ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ The license key for the current software has been verified as ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ belonging to: ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ Chai Research Corp. ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
nousresearch-meta-llama-4941-v91-mkmlizer: ║ ║
nousresearch-meta-llama-4941-v91-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
nousresearch-meta-llama-4941-v91-mkmlizer: Downloaded to shared memory in 20.605s
nousresearch-meta-llama-4941-v91-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpt617ol1p, device:0
nousresearch-meta-llama-4941-v91-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
nousresearch-meta-llama-4941-v91-mkmlizer: quantized model in 25.383s
nousresearch-meta-llama-4941-v91-mkmlizer: Processed model NousResearch/Meta-Llama-3-8B-Instruct in 45.989s
nousresearch-meta-llama-4941-v91-mkmlizer: creating bucket guanaco-mkml-models
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
nousresearch-meta-llama-4941-v91-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
nousresearch-meta-llama-4941-v91-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/nousresearch-meta-llama-4941-v91
nousresearch-meta-llama-4941-v91-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/nousresearch-meta-llama-4941-v91/config.json
nousresearch-meta-llama-4941-v91-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/nousresearch-meta-llama-4941-v91/special_tokens_map.json
nousresearch-meta-llama-4941-v91-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/nousresearch-meta-llama-4941-v91/tokenizer_config.json
nousresearch-meta-llama-4941-v91-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/nousresearch-meta-llama-4941-v91/tokenizer.json
nousresearch-meta-llama-4941-v91-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/nousresearch-meta-llama-4941-v91/flywheel_model.0.safetensors
nousresearch-meta-llama-4941-v91-mkmlizer: loading reward model from ChaiML/gpt2_xl_pairwise_89m_step_347634
nousresearch-meta-llama-4941-v91-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 2%|▏ | 5/291 [00:00<00:07, 36.84it/s] Loading 0: 5%|▍ | 14/291 [00:00<00:05, 47.30it/s] Loading 0: 8%|▊ | 23/291 [00:00<00:05, 50.33it/s] Loading 0: 11%|█ | 32/291 [00:00<00:05, 51.50it/s] Loading 0: 14%|█▍ | 41/291 [00:00<00:04, 52.23it/s] Loading 0: 17%|█▋ | 50/291 [00:00<00:04, 52.60it/s] Loading 0: 20%|██ | 59/291 [00:01<00:04, 53.00it/s] Loading 0: 23%|██▎ | 67/291 [00:01<00:03, 58.13it/s] Loading 0: 25%|██▌ | 74/291 [00:01<00:03, 55.54it/s] Loading 0: 27%|██▋ | 80/291 [00:01<00:03, 54.09it/s] Loading 0: 30%|██▉ | 86/291 [00:01<00:06, 33.91it/s] Loading 0: 32%|███▏ | 93/291 [00:01<00:04, 39.93it/s] Loading 0: 34%|███▍ | 99/291 [00:02<00:04, 43.52it/s] Loading 0: 36%|███▌ | 105/291 [00:02<00:04, 42.05it/s] Loading 0: 38%|███▊ | 112/291 [00:02<00:03, 46.91it/s] Loading 0: 41%|████ | 118/291 [00:02<00:03, 45.63it/s] Loading 0: 42%|████▏ | 123/291 [00:02<00:03, 45.94it/s] Loading 0: 45%|████▍ | 130/291 [00:02<00:03, 51.31it/s] Loading 0: 47%|████▋ | 136/291 [00:02<00:03, 46.71it/s] Loading 0: 48%|████▊ | 141/291 [00:02<00:03, 46.21it/s] Loading 0: 51%|█████ | 148/291 [00:03<00:02, 51.44it/s] Loading 0: 53%|█████▎ | 154/291 [00:03<00:02, 48.15it/s] Loading 0: 55%|█████▍ | 159/291 [00:03<00:02, 46.89it/s] Loading 0: 57%|█████▋ | 166/291 [00:03<00:02, 51.96it/s] Loading 0: 59%|█████▉ | 172/291 [00:03<00:02, 49.04it/s] Loading 0: 62%|██████▏ | 179/291 [00:03<00:02, 51.73it/s] Loading 0: 64%|██████▎ | 185/291 [00:03<00:01, 53.14it/s] Loading 0: 66%|██████▌ | 191/291 [00:04<00:02, 35.18it/s] Loading 0: 67%|██████▋ | 196/291 [00:04<00:02, 36.99it/s] Loading 0: 69%|██████▉ | 202/291 [00:04<00:02, 41.72it/s] Loading 0: 71%|███████▏ | 208/291 [00:04<00:02, 40.30it/s] Loading 0: 73%|███████▎ | 213/291 [00:04<00:01, 41.15it/s] Loading 0: 76%|███████▌ | 220/291 [00:04<00:01, 46.59it/s] Loading 0: 77%|███████▋ | 225/291 [00:04<00:01, 47.19it/s] Loading 0: 79%|███████▉ | 230/291 [00:05<00:01, 39.46it/s] Loading 0: 82%|████████▏ | 238/291 [00:05<00:01, 47.43it/s] Loading 0: 84%|████████▍ | 244/291 [00:05<00:01, 46.44it/s] Loading 0: 86%|████████▌ | 249/291 [00:05<00:00, 46.58it/s] Loading 0: 88%|████████▊ | 256/291 [00:05<00:00, 50.85it/s] Loading 0: 90%|█████████ | 262/291 [00:05<00:00, 48.15it/s] Loading 0: 92%|█████████▏| 267/291 [00:05<00:00, 47.79it/s] Loading 0: 94%|█████████▍| 274/291 [00:05<00:00, 52.64it/s] Loading 0: 96%|█████████▌| 280/291 [00:05<00:00, 48.56it/s] Loading 0: 98%|█████████▊| 285/291 [00:06<00:00, 48.30it/s] Loading 0: 100%|█████████▉| 290/291 [00:11<00:00, 3.32it/s] /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:957: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
nousresearch-meta-llama-4941-v91-mkmlizer: warnings.warn(
nousresearch-meta-llama-4941-v91-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:785: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
nousresearch-meta-llama-4941-v91-mkmlizer: warnings.warn(
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
nousresearch-meta-llama-4941-v91-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:469: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
nousresearch-meta-llama-4941-v91-mkmlizer: warnings.warn(
nousresearch-meta-llama-4941-v91-mkmlizer: Downloading shards: 0%| | 0/2 [00:00<?, ?it/s] Downloading shards: 50%|█████ | 1/2 [00:05<00:05, 5.07s/it] Downloading shards: 100%|██████████| 2/2 [00:08<00:00, 3.82s/it] Downloading shards: 100%|██████████| 2/2 [00:08<00:00, 4.01s/it]
nousresearch-meta-llama-4941-v91-mkmlizer: Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s] Loading checkpoint shards: 50%|█████ | 1/2 [00:00<00:00, 2.47it/s] Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 4.00it/s] Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 3.66it/s]
nousresearch-meta-llama-4941-v91-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
nousresearch-meta-llama-4941-v91-mkmlizer: Saving duration: 1.298s
nousresearch-meta-llama-4941-v91-mkmlizer: Processed model ChaiML/gpt2_xl_pairwise_89m_step_347634 in 13.110s
nousresearch-meta-llama-4941-v91-mkmlizer: creating bucket guanaco-reward-models
nousresearch-meta-llama-4941-v91-mkmlizer: Bucket 's3://guanaco-reward-models/' created
nousresearch-meta-llama-4941-v91-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/nousresearch-meta-llama-4941-v91_reward
nousresearch-meta-llama-4941-v91-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/nousresearch-meta-llama-4941-v91_reward/special_tokens_map.json
nousresearch-meta-llama-4941-v91-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/nousresearch-meta-llama-4941-v91_reward/tokenizer_config.json
nousresearch-meta-llama-4941-v91-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/nousresearch-meta-llama-4941-v91_reward/config.json
nousresearch-meta-llama-4941-v91-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/nousresearch-meta-llama-4941-v91_reward/merges.txt
nousresearch-meta-llama-4941-v91-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/nousresearch-meta-llama-4941-v91_reward/vocab.json
nousresearch-meta-llama-4941-v91-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/nousresearch-meta-llama-4941-v91_reward/tokenizer.json
nousresearch-meta-llama-4941-v91-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/nousresearch-meta-llama-4941-v91_reward/reward.tensors
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Job nousresearch-meta-llama-4941-v91-mkmlizer completed after 95.87s with status: succeeded
Stopping job with name nousresearch-meta-llama-4941-v91-mkmlizer
Pipeline stage MKMLizer completed in 97.16s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.15s
Running pipeline stage ISVCDeployer
Creating inference service nousresearch-meta-llama-4941-v91
Waiting for inference service nousresearch-meta-llama-4941-v91 to be ready
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'activator request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'activator request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1-_5730_v8: ('http://shuttleai-shuttle-2-5-1-5730-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Inference service nousresearch-meta-llama-4941-v91 ready after 100.63878726959229s
Pipeline stage ISVCDeployer completed in 102.48s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.6786234378814697s
Received healthy response to inference request in 0.8741347789764404s
Failed to get response for submission shuttleai-shuttle-2-5-1_5730_v10: ('http://shuttleai-shuttle-2-5-1-5730-v10-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'request timeout')
Received healthy response to inference request in 1.0602402687072754s
Received healthy response to inference request in 1.0450785160064697s
Received healthy response to inference request in 1.0222837924957275s
5 requests
0 failed requests
5th percentile: 0.9037645816802978
10th percentile: 0.9333943843841552
20th percentile: 0.9926539897918701
30th percentile: 1.0268427371978759
40th percentile: 1.0359606266021728
50th percentile: 1.0450785160064697
60th percentile: 1.051143217086792
70th percentile: 1.0572079181671143
80th percentile: 1.1839169025421143
90th percentile: 1.431270170211792
95th percentile: 1.5549468040466308
99th percentile: 1.6538881111145018
mean time: 1.1360721588134766
Pipeline stage StressChecker completed in 6.53s
nousresearch-meta-llama_4941_v91 status is now deployed due to DeploymentManager action
nousresearch-meta-llama_4941_v91 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of nousresearch-meta-llama_4941_v91
Running pipeline stage ISVCDeleter
Checking if service nousresearch-meta-llama-4941-v91 is running
Tearing down inference service nousresearch-meta-llama-4941-v91
Service nousresearch-meta-llama-4941-v91 has been torndown
Pipeline stage ISVCDeleter completed in 5.31s
Running pipeline stage MKMLModelDeleter
Cleaning model data from S3
Cleaning model data from model cache
Deleting key nousresearch-meta-llama-4941-v91/config.json from bucket guanaco-mkml-models
Deleting key nousresearch-meta-llama-4941-v91/flywheel_model.0.safetensors from bucket guanaco-mkml-models
Deleting key nousresearch-meta-llama-4941-v91/special_tokens_map.json from bucket guanaco-mkml-models
Deleting key nousresearch-meta-llama-4941-v91/tokenizer.json from bucket guanaco-mkml-models
Deleting key nousresearch-meta-llama-4941-v91/tokenizer_config.json from bucket guanaco-mkml-models
Cleaning model data from model cache
Deleting key nousresearch-meta-llama-4941-v91_reward/config.json from bucket guanaco-reward-models
Deleting key nousresearch-meta-llama-4941-v91_reward/merges.txt from bucket guanaco-reward-models
Deleting key nousresearch-meta-llama-4941-v91_reward/reward.tensors from bucket guanaco-reward-models
Deleting key nousresearch-meta-llama-4941-v91_reward/special_tokens_map.json from bucket guanaco-reward-models
Deleting key nousresearch-meta-llama-4941-v91_reward/tokenizer.json from bucket guanaco-reward-models
Deleting key nousresearch-meta-llama-4941-v91_reward/tokenizer_config.json from bucket guanaco-reward-models
Deleting key nousresearch-meta-llama-4941-v91_reward/vocab.json from bucket guanaco-reward-models
Pipeline stage MKMLModelDeleter completed in 5.90s
nousresearch-meta-llama_4941_v91 status is now torndown due to DeploymentManager action

Usage Metrics

Latency Metrics