developer_uid: azuruce
submission_id: chaiml-nemo-20241017-mod_6892_v2
model_name: chaiml-nemo-20241017-mod_6892_v2
model_group: ChaiML/nemo-20241017-mod
status: torndown
timestamp: 2024-10-18T15:56:20+00:00
num_battles: 6528
num_wins: 3063
celo_rating: 1230.95
family_friendly_score: 0.5915167913417158
family_friendly_standard_error: 0.006181908555161077
submission_type: basic
model_repo: ChaiML/nemo-20241017-model_stock-remerge_v2_5merge-albert
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 4
max_input_tokens: 1024
max_output_tokens: 64
display_name: chaiml-nemo-20241017-mod_6892_v2
is_internal_developer: True
language_model: ChaiML/nemo-20241017-model_stock-remerge_v2_5merge-albert
model_size: 13B
ranking_group: single
us_pacific_date: 2024-10-18
win_ratio: 0.46920955882352944
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>', '<|end_of_text|>', 'You:'], 'max_input_tokens': 1024, 'best_of': 4, 'max_output_tokens': 64}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-nemo-20241017-mod-6892-v2-mkmlizer
Waiting for job on chaiml-nemo-20241017-mod-6892-v2-mkmlizer to finish
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ _____ __ __ ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ /___/ ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ Version: 0.11.12 ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ https://mk1.ai ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ belonging to: ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ Chai Research Corp. ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ║ ║
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: Downloaded to shared memory in 33.993s
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmptcl_z01c, device:0
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Failed to get response for submission mistralai-mistral-nemo_9330_v174: ('http://mistralai-mistral-nemo-9330-v174-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:36536->127.0.0.1:8080: read: connection reset by peer\n')
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: quantized model in 37.588s
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: Processed model ChaiML/nemo-20241017-model_stock-remerge_v2_5merge-albert in 71.581s
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: creating bucket guanaco-mkml-models
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-nemo-20241017-mod-6892-v2
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-nemo-20241017-mod-6892-v2/config.json
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-nemo-20241017-mod-6892-v2/special_tokens_map.json
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-nemo-20241017-mod-6892-v2/tokenizer_config.json
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-nemo-20241017-mod-6892-v2/tokenizer.json
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-nemo-20241017-mod-6892-v2/flywheel_model.0.safetensors
chaiml-nemo-20241017-mod-6892-v2-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%| | 2/363 [00:06<18:20, 3.05s/it] Loading 0: 2%|▏ | 6/363 [00:06<04:53, 1.22it/s] Loading 0: 3%|▎ | 11/363 [00:06<02:08, 2.74it/s] Loading 0: 4%|▍ | 15/363 [00:06<01:21, 4.25it/s] Loading 0: 6%|▋ | 23/363 [00:06<00:40, 8.44it/s] Loading 0: 8%|▊ | 29/363 [00:06<00:28, 11.85it/s] Loading 0: 9%|▉ | 34/363 [00:06<00:21, 14.99it/s] Loading 0: 11%|█ | 40/363 [00:07<00:19, 16.53it/s] Loading 0: 12%|█▏ | 44/363 [00:07<00:16, 18.88it/s] Loading 0: 14%|█▍ | 50/363 [00:07<00:13, 24.07it/s] Loading 0: 15%|█▌ | 56/363 [00:07<00:11, 27.58it/s] Loading 0: 17%|█▋ | 60/363 [00:07<00:10, 28.46it/s] Loading 0: 18%|█▊ | 67/363 [00:07<00:08, 36.04it/s] Loading 0: 20%|█▉ | 72/363 [00:07<00:07, 37.93it/s] Loading 0: 21%|██ | 77/363 [00:08<00:07, 38.31it/s] Loading 0: 23%|██▎ | 82/363 [00:08<00:06, 40.89it/s] Loading 0: 24%|██▍ | 87/363 [00:08<00:07, 36.35it/s] Loading 0: 26%|██▌ | 95/363 [00:08<00:05, 44.73it/s] Loading 0: 28%|██▊ | 101/363 [00:08<00:06, 43.43it/s] Loading 0: 29%|██▉ | 106/363 [00:08<00:06, 40.81it/s] Loading 0: 31%|███ | 112/363 [00:08<00:05, 44.82it/s] Loading 0: 32%|███▏ | 117/363 [00:08<00:05, 43.13it/s] Loading 0: 34%|███▎ | 122/363 [00:09<00:07, 30.73it/s] Loading 0: 35%|███▍ | 126/363 [00:09<00:07, 32.03it/s] Loading 0: 36%|███▌ | 131/363 [00:09<00:06, 34.44it/s] Loading 0: 37%|███▋ | 135/363 [00:09<00:06, 34.63it/s] Loading 0: 38%|███▊ | 139/363 [00:09<00:06, 34.57it/s] Loading 0: 39%|███▉ | 143/363 [00:09<00:06, 31.90it/s] Loading 0: 41%|████ | 148/363 [00:09<00:06, 35.14it/s] Loading 0: 42%|████▏ | 152/363 [00:10<00:06, 33.37it/s] Loading 0: 43%|████▎ | 157/363 [00:10<00:05, 35.44it/s] Loading 0: 44%|████▍ | 161/363 [00:10<00:05, 34.76it/s] Loading 0: 46%|████▌ | 166/363 [00:10<00:05, 37.06it/s] Loading 0: 47%|████▋ | 170/363 [00:10<00:05, 35.61it/s] Loading 0: 48%|████▊ | 175/363 [00:10<00:05, 37.06it/s] Loading 0: 49%|████▉ | 179/363 [00:10<00:05, 35.65it/s] Loading 0: 51%|█████ | 184/363 [00:10<00:04, 37.57it/s] Loading 0: 52%|█████▏ | 188/363 [00:11<00:04, 35.34it/s] Loading 0: 53%|█████▎ | 192/363 [00:11<00:04, 36.53it/s] Loading 0: 54%|█████▍ | 196/363 [00:11<00:04, 33.41it/s] Loading 0: 56%|█████▌ | 202/363 [00:11<00:06, 26.05it/s] Loading 0: 56%|█████▋ | 205/363 [00:11<00:06, 25.65it/s] Loading 0: 58%|█████▊ | 211/363 [00:11<00:04, 31.31it/s] Loading 0: 59%|█████▉ | 215/363 [00:12<00:04, 31.42it/s] Loading 0: 61%|██████ | 220/363 [00:12<00:04, 33.41it/s] Loading 0: 62%|██████▏ | 224/363 [00:12<00:04, 32.91it/s] Loading 0: 63%|██████▎ | 229/363 [00:12<00:03, 35.54it/s] Loading 0: 64%|██████▍ | 233/363 [00:12<00:03, 34.26it/s] Loading 0: 66%|██████▌ | 238/363 [00:12<00:03, 35.81it/s] Loading 0: 67%|██████▋ | 242/363 [00:12<00:03, 34.60it/s] Loading 0: 68%|██████▊ | 247/363 [00:12<00:03, 36.15it/s] Loading 0: 69%|██████▉ | 251/363 [00:13<00:03, 34.52it/s] Loading 0: 71%|███████ | 256/363 [00:13<00:02, 36.84it/s] Loading 0: 72%|███████▏ | 260/363 [00:13<00:02, 35.28it/s] Loading 0: 73%|███████▎ | 265/363 [00:13<00:02, 36.44it/s] Loading 0: 74%|███████▍ | 269/363 [00:13<00:02, 34.81it/s] Loading 0: 75%|███████▌ | 274/363 [00:13<00:02, 36.91it/s] Loading 0: 77%|███████▋ | 278/363 [00:13<00:02, 35.68it/s] Loading 0: 78%|███████▊ | 283/363 [00:14<00:03, 26.30it/s] Loading 0: 79%|███████▉ | 287/363 [00:14<00:02, 27.39it/s] Loading 0: 80%|████████ | 292/363 [00:14<00:02, 30.63it/s] Loading 0: 82%|████████▏ | 296/363 [00:14<00:02, 30.03it/s] Loading 0: 83%|████████▎ | 301/363 [00:14<00:01, 33.21it/s] Loading 0: 84%|████████▍ | 305/363 [00:14<00:01, 32.63it/s] Loading 0: 85%|████████▌ | 310/363 [00:14<00:01, 34.37it/s] Loading 0: 87%|████████▋ | 314/363 [00:14<00:01, 33.26it/s] Loading 0: 88%|████████▊ | 319/363 [00:15<00:01, 36.00it/s] Loading 0: 89%|████████▉ | 323/363 [00:15<00:01, 34.71it/s] Loading 0: 90%|█████████ | 328/363 [00:15<00:00, 36.10it/s] Loading 0: 91%|█████████▏| 332/363 [00:15<00:00, 34.80it/s] Loading 0: 93%|█████████▎| 337/363 [00:15<00:00, 37.02it/s] Loading 0: 94%|█████████▍| 341/363 [00:15<00:00, 35.41it/s] Loading 0: 95%|█████████▌| 346/363 [00:15<00:00, 36.89it/s] Loading 0: 96%|█████████▋| 350/363 [00:15<00:00, 35.47it/s] Loading 0: 98%|█████████▊| 355/363 [00:16<00:00, 37.85it/s] Loading 0: 99%|█████████▉| 359/363 [00:16<00:00, 37.22it/s]
Job chaiml-nemo-20241017-mod-6892-v2-mkmlizer completed after 94.26s with status: succeeded
Stopping job with name chaiml-nemo-20241017-mod-6892-v2-mkmlizer
Pipeline stage MKMLizer completed in 94.83s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.19s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-nemo-20241017-mod-6892-v2
Waiting for inference service chaiml-nemo-20241017-mod-6892-v2 to be ready
Inference service chaiml-nemo-20241017-mod-6892-v2 ready after 170.93961143493652s
Pipeline stage MKMLDeployer completed in 171.53s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.960395336151123s
Received healthy response to inference request in 1.7182204723358154s
Received healthy response to inference request in 1.8386752605438232s
Received healthy response to inference request in 1.3654708862304688s
Received healthy response to inference request in 1.484816312789917s
5 requests
0 failed requests
5th percentile: 1.3893399715423584
10th percentile: 1.4132090568542481
20th percentile: 1.4609472274780273
30th percentile: 1.5314971446990966
40th percentile: 1.624858808517456
50th percentile: 1.7182204723358154
60th percentile: 1.7664023876190185
70th percentile: 1.8145843029022217
80th percentile: 1.8630192756652832
90th percentile: 1.911707305908203
95th percentile: 1.936051321029663
99th percentile: 1.955526533126831
mean time: 1.6735156536102296
Pipeline stage StressChecker completed in 10.12s
Shutdown handler de-registered
chaiml-nemo-20241017-mod_6892_v2 status is now deployed due to DeploymentManager action
chaiml-nemo-20241017-mod_6892_v2 status is now inactive due to auto deactivation removed underperforming models
chaiml-nemo-20241017-mod_6892_v2 status is now torndown due to DeploymentManager action