developer_uid: azuruce
submission_id: mistralai-mistral-nemo_9330_v180
model_name: baseline
model_group: mistralai/Mistral-Nemo-I
status: torndown
timestamp: 2024-11-01T07:55:49+00:00
num_battles: 11504
num_wins: 5818
celo_rating: 1255.38
family_friendly_score: 0.5836
family_friendly_standard_error: 0.0069715283833604235
submission_type: basic
model_repo: mistralai/Mistral-Nemo-Instruct-2407
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
display_name: baseline
is_internal_developer: True
language_model: mistralai/Mistral-Nemo-Instruct-2407
model_size: 13B
ranking_group: single
us_pacific_date: 2024-11-01
win_ratio: 0.5057371349095967
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>', '<|end_of_text|>', 'You:'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name mistralai-mistral-nemo-9330-v180-mkmlizer
Waiting for job on mistralai-mistral-nemo-9330-v180-mkmlizer to finish
mistralai-mistral-nemo-9330-v180-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ _____ __ __ ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ /___/ ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ Version: 0.11.12 ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ https://mk1.ai ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ The license key for the current software has been verified as ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ belonging to: ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ Chai Research Corp. ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ║ ║
mistralai-mistral-nemo-9330-v180-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
mistralai-mistral-nemo-9330-v180-mkmlizer: Downloaded to shared memory in 53.046s
mistralai-mistral-nemo-9330-v180-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpfhsgoz9m, device:0
mistralai-mistral-nemo-9330-v180-mkmlizer: Saving flywheel model at /dev/shm/model_cache
mistralai-mistral-nemo-9330-v180-mkmlizer: quantized model in 37.565s
mistralai-mistral-nemo-9330-v180-mkmlizer: Processed model mistralai/Mistral-Nemo-Instruct-2407 in 90.612s
mistralai-mistral-nemo-9330-v180-mkmlizer: creating bucket guanaco-mkml-models
mistralai-mistral-nemo-9330-v180-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
mistralai-mistral-nemo-9330-v180-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v180
mistralai-mistral-nemo-9330-v180-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v180/config.json
mistralai-mistral-nemo-9330-v180-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v180/special_tokens_map.json
mistralai-mistral-nemo-9330-v180-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v180/tokenizer_config.json
mistralai-mistral-nemo-9330-v180-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v180/tokenizer.json
mistralai-mistral-nemo-9330-v180-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v180/flywheel_model.0.safetensors
mistralai-mistral-nemo-9330-v180-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:12, 29.15it/s] Loading 0: 3%|▎ | 12/363 [00:00<00:07, 45.75it/s] Loading 0: 5%|▍ | 17/363 [00:00<00:07, 43.48it/s] Loading 0: 6%|▌ | 22/363 [00:00<00:07, 44.43it/s] Loading 0: 8%|▊ | 28/363 [00:00<00:07, 43.69it/s] Loading 0: 9%|▉ | 33/363 [00:00<00:07, 44.19it/s] Loading 0: 11%|█ | 40/363 [00:00<00:06, 49.41it/s] Loading 0: 13%|█▎ | 46/363 [00:01<00:06, 47.04it/s] Loading 0: 14%|█▍ | 51/363 [00:01<00:06, 46.48it/s] Loading 0: 16%|█▋ | 59/363 [00:01<00:05, 55.41it/s] Loading 0: 18%|█▊ | 65/363 [00:01<00:08, 33.74it/s] Loading 0: 20%|█▉ | 72/363 [00:01<00:07, 39.16it/s] Loading 0: 21%|██ | 77/363 [00:01<00:07, 39.19it/s] Loading 0: 23%|██▎ | 82/363 [00:02<00:08, 31.27it/s] Loading 0: 25%|██▍ | 89/363 [00:02<00:07, 37.56it/s] Loading 0: 26%|██▌ | 94/363 [00:02<00:07, 37.56it/s] Loading 0: 27%|██▋ | 99/363 [00:02<00:07, 37.51it/s] Loading 0: 29%|██▊ | 104/363 [00:02<00:06, 38.56it/s] Loading 0: 30%|███ | 109/363 [00:02<00:06, 39.01it/s] Loading 0: 31%|███▏ | 114/363 [00:02<00:07, 31.78it/s] Loading 0: 33%|███▎ | 118/363 [00:03<00:08, 29.68it/s] Loading 0: 34%|███▍ | 123/363 [00:03<00:07, 33.92it/s] Loading 0: 35%|███▍ | 127/363 [00:03<00:07, 31.41it/s] Loading 0: 36%|███▋ | 132/363 [00:03<00:06, 34.24it/s] Loading 0: 37%|███▋ | 136/363 [00:03<00:07, 29.74it/s] Loading 0: 39%|███▉ | 141/363 [00:03<00:06, 33.71it/s] Loading 0: 40%|███▉ | 145/363 [00:04<00:09, 22.08it/s] Loading 0: 41%|████ | 149/363 [00:04<00:09, 22.86it/s] Loading 0: 42%|████▏ | 154/363 [00:04<00:07, 26.71it/s] Loading 0: 44%|████▎ | 158/363 [00:04<00:08, 25.11it/s] Loading 0: 45%|████▌ | 165/363 [00:04<00:06, 32.48it/s] Loading 0: 47%|████▋ | 169/363 [00:04<00:05, 32.99it/s] Loading 0: 48%|████▊ | 174/363 [00:04<00:05, 34.61it/s] Loading 0: 49%|████▉ | 178/363 [00:05<00:05, 33.27it/s] Loading 0: 50%|█████ | 183/363 [00:05<00:05, 35.07it/s] Loading 0: 52%|█████▏ | 187/363 [00:05<00:05, 32.79it/s] Loading 0: 53%|█████▎ | 191/363 [00:05<00:05, 33.89it/s] Loading 0: 54%|█████▎ | 195/363 [00:05<00:05, 31.70it/s] Loading 0: 55%|█████▌ | 201/363 [00:05<00:04, 38.23it/s] Loading 0: 57%|█████▋ | 206/363 [00:05<00:04, 39.09it/s] Loading 0: 58%|█████▊ | 211/363 [00:05<00:03, 41.15it/s] Loading 0: 60%|█████▉ | 217/363 [00:06<00:03, 40.86it/s] Loading 0: 61%|██████▏ | 223/363 [00:06<00:04, 32.38it/s] Loading 0: 63%|██████▎ | 227/363 [00:06<00:04, 32.59it/s] Loading 0: 64%|██████▎ | 231/363 [00:06<00:04, 31.78it/s] Loading 0: 66%|██████▌ | 238/363 [00:06<00:03, 38.51it/s] Loading 0: 67%|██████▋ | 244/363 [00:06<00:02, 39.90it/s] Loading 0: 69%|██████▊ | 249/363 [00:06<00:02, 40.13it/s] Loading 0: 70%|███████ | 255/363 [00:07<00:02, 43.71it/s] Loading 0: 72%|███████▏ | 260/363 [00:07<00:02, 44.21it/s] Loading 0: 73%|███████▎ | 265/363 [00:07<00:02, 45.46it/s] Loading 0: 75%|███████▍ | 271/363 [00:07<00:02, 44.83it/s] Loading 0: 76%|███████▌ | 276/363 [00:07<00:02, 42.84it/s] Loading 0: 78%|███████▊ | 282/363 [00:07<00:01, 46.25it/s] Loading 0: 79%|███████▉ | 287/363 [00:07<00:01, 46.29it/s] Loading 0: 80%|████████ | 292/363 [00:07<00:01, 46.45it/s] Loading 0: 82%|████████▏ | 298/363 [00:08<00:01, 44.58it/s] Loading 0: 84%|████████▎ | 304/363 [00:14<00:22, 2.65it/s] Loading 0: 85%|████████▍ | 308/363 [00:14<00:16, 3.37it/s] Loading 0: 86%|████████▌ | 312/363 [00:15<00:11, 4.32it/s] Loading 0: 88%|████████▊ | 319/363 [00:15<00:06, 6.73it/s] Loading 0: 89%|████████▉ | 324/363 [00:15<00:04, 8.78it/s] Loading 0: 91%|█████████ | 329/363 [00:15<00:02, 11.34it/s] Loading 0: 92%|█████████▏| 334/363 [00:15<00:01, 14.50it/s] Loading 0: 93%|█████████▎| 339/363 [00:15<00:01, 16.34it/s] Loading 0: 95%|█████████▌| 346/363 [00:15<00:00, 22.68it/s] Loading 0: 97%|█████████▋| 351/363 [00:16<00:00, 25.65it/s] Loading 0: 98%|█████████▊| 356/363 [00:16<00:00, 28.81it/s] Loading 0: 99%|█████████▉| 361/363 [00:16<00:00, 32.33it/s]
Job mistralai-mistral-nemo-9330-v180-mkmlizer completed after 114.72s with status: succeeded
Stopping job with name mistralai-mistral-nemo-9330-v180-mkmlizer
Pipeline stage MKMLizer completed in 115.32s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.18s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service mistralai-mistral-nemo-9330-v180
Waiting for inference service mistralai-mistral-nemo-9330-v180 to be ready
Inference service mistralai-mistral-nemo-9330-v180 ready after 130.64942836761475s
Pipeline stage MKMLDeployer completed in 131.32s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8385214805603027s
Received healthy response to inference request in 1.6318750381469727s
Received healthy response to inference request in 1.739013910293579s
Received healthy response to inference request in 1.4445505142211914s
Received healthy response to inference request in 1.9054927825927734s
5 requests
0 failed requests
5th percentile: 1.4820154190063477
10th percentile: 1.519480323791504
20th percentile: 1.5944101333618164
30th percentile: 1.6533028125762939
40th percentile: 1.6961583614349365
50th percentile: 1.739013910293579
60th percentile: 1.7788169384002686
70th percentile: 1.818619966506958
80th percentile: 1.8519157409667968
90th percentile: 1.8787042617797851
95th percentile: 1.8920985221862794
99th percentile: 1.9028139305114746
mean time: 1.711890745162964
Pipeline stage StressChecker completed in 10.01s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 3.43s
Shutdown handler de-registered
mistralai-mistral-nemo_9330_v180 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2532.01s
Shutdown handler de-registered
mistralai-mistral-nemo_9330_v180 status is now inactive due to auto deactivation removed underperforming models
mistralai-mistral-nemo_9330_v180 status is now torndown due to DeploymentManager action