submission_id: arushimgupta-final-check_2833_v1
developer_uid: immaculate_possum_03470
best_of: 8
celo_rating: 1246.7
display_name: nemo_base_1
family_friendly_score: 0.5893607305936073
family_friendly_standard_error: 0.008114037993985425
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.075, 'top_k': 60, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
ineligible_reason: num_battles<5000
is_internal_developer: False
language_model: arushimgupta/final_checkpoint_dpo5
max_input_tokens: 1024
max_output_tokens: 64
model_architecture: MistralForCausalLM
model_group: arushimgupta/final_check
model_name: nemo_base_1
model_num_parameters: 12772070400.0
model_repo: arushimgupta/final_checkpoint_dpo5
model_size: 13B
num_battles: 3663
num_wins: 1794
ranking_group: single
status: torndown
submission_type: basic
timestamp: 2024-09-28T00:59:17+00:00
us_pacific_date: 2024-09-27
win_ratio: 0.48976248976248976
Download Preference Data
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name arushimgupta-final-check-2833-v1-mkmlizer
Waiting for job on arushimgupta-final-check-2833-v1-mkmlizer to finish
arushimgupta-final-check-2833-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
arushimgupta-final-check-2833-v1-mkmlizer: ║ _____ __ __ ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ /___/ ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ Version: 0.11.12 ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ https://mk1.ai ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ The license key for the current software has been verified as ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ belonging to: ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ Chai Research Corp. ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
arushimgupta-final-check-2833-v1-mkmlizer: ║ ║
arushimgupta-final-check-2833-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
arushimgupta-final-check-2833-v1-mkmlizer: Downloaded to shared memory in 51.291s
arushimgupta-final-check-2833-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp65ren9fi, device:0
arushimgupta-final-check-2833-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
arushimgupta-final-check-2833-v1-mkmlizer: quantized model in 35.367s
arushimgupta-final-check-2833-v1-mkmlizer: Processed model arushimgupta/final_checkpoint_dpo5 in 86.658s
arushimgupta-final-check-2833-v1-mkmlizer: creating bucket guanaco-mkml-models
arushimgupta-final-check-2833-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
arushimgupta-final-check-2833-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/arushimgupta-final-check-2833-v1
arushimgupta-final-check-2833-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/arushimgupta-final-check-2833-v1/config.json
arushimgupta-final-check-2833-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/arushimgupta-final-check-2833-v1/special_tokens_map.json
arushimgupta-final-check-2833-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/arushimgupta-final-check-2833-v1/tokenizer_config.json
arushimgupta-final-check-2833-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/arushimgupta-final-check-2833-v1/tokenizer.json
arushimgupta-final-check-2833-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/arushimgupta-final-check-2833-v1/flywheel_model.0.safetensors
arushimgupta-final-check-2833-v1-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:13, 27.18it/s] Loading 0: 4%|▎ | 13/363 [00:00<00:07, 47.65it/s] Loading 0: 5%|▌ | 19/363 [00:00<00:07, 44.84it/s] Loading 0: 7%|▋ | 24/363 [00:00<00:07, 43.50it/s] Loading 0: 9%|▊ | 31/363 [00:00<00:06, 50.29it/s] Loading 0: 10%|█ | 37/363 [00:00<00:06, 48.04it/s] Loading 0: 12%|█▏ | 42/363 [00:00<00:06, 47.50it/s] Loading 0: 13%|█▎ | 49/363 [00:01<00:05, 53.05it/s] Loading 0: 15%|█▌ | 55/363 [00:01<00:06, 48.86it/s] Loading 0: 17%|█▋ | 61/363 [00:01<00:08, 34.09it/s] Loading 0: 18%|█▊ | 66/363 [00:01<00:08, 34.38it/s] Loading 0: 20%|█▉ | 72/363 [00:01<00:07, 38.63it/s] Loading 0: 21%|██▏ | 78/363 [00:01<00:06, 40.80it/s] Loading 0: 23%|██▎ | 83/363 [00:01<00:06, 40.87it/s] Loading 0: 25%|██▍ | 90/363 [00:02<00:05, 46.66it/s] Loading 0: 26%|██▌ | 95/363 [00:02<00:05, 47.23it/s] Loading 0: 28%|██▊ | 100/363 [00:02<00:06, 39.64it/s] Loading 0: 30%|███ | 109/363 [00:02<00:04, 51.32it/s] Loading 0: 32%|███▏ | 115/363 [00:02<00:05, 48.16it/s] Loading 0: 33%|███▎ | 121/363 [00:02<00:05, 47.31it/s] Loading 0: 35%|███▍ | 127/363 [00:02<00:05, 40.70it/s] Loading 0: 37%|███▋ | 135/363 [00:03<00:04, 47.94it/s] Loading 0: 39%|███▉ | 141/363 [00:03<00:04, 47.71it/s] Loading 0: 40%|████ | 147/363 [00:03<00:06, 33.33it/s] Loading 0: 42%|████▏ | 152/363 [00:03<00:06, 34.47it/s] Loading 0: 43%|████▎ | 157/363 [00:03<00:05, 36.08it/s] Loading 0: 45%|████▍ | 162/363 [00:03<00:05, 38.85it/s] Loading 0: 46%|████▌ | 167/363 [00:04<00:05, 35.03it/s] Loading 0: 48%|████▊ | 175/363 [00:04<00:04, 44.25it/s] Loading 0: 50%|████▉ | 181/363 [00:04<00:04, 44.62it/s] Loading 0: 51%|█████ | 186/363 [00:04<00:04, 43.25it/s] Loading 0: 53%|█████▎ | 193/363 [00:04<00:03, 48.08it/s] Loading 0: 55%|█████▍ | 199/363 [00:04<00:03, 44.71it/s] Loading 0: 56%|█████▌ | 204/363 [00:04<00:03, 44.32it/s] Loading 0: 58%|█████▊ | 211/363 [00:04<00:03, 49.51it/s] Loading 0: 60%|█████▉ | 217/363 [00:05<00:03, 45.87it/s] Loading 0: 61%|██████ | 222/363 [00:05<00:03, 45.79it/s] Loading 0: 63%|██████▎ | 227/363 [00:05<00:04, 32.49it/s] Loading 0: 64%|██████▎ | 231/363 [00:05<00:04, 32.19it/s] Loading 0: 66%|██████▌ | 238/363 [00:05<00:03, 39.46it/s] Loading 0: 67%|██████▋ | 244/363 [00:05<00:02, 40.75it/s] Loading 0: 69%|██████▊ | 249/363 [00:05<00:02, 41.26it/s] Loading 0: 71%|███████ | 256/363 [00:06<00:02, 45.99it/s] Loading 0: 72%|███████▏ | 262/363 [00:06<00:02, 45.32it/s] Loading 0: 74%|███████▎ | 267/363 [00:06<00:02, 44.60it/s] Loading 0: 75%|███████▌ | 273/363 [00:06<00:01, 47.57it/s] Loading 0: 77%|███████▋ | 278/363 [00:06<00:01, 44.16it/s] Loading 0: 78%|███████▊ | 283/363 [00:06<00:01, 42.81it/s] Loading 0: 80%|███████▉ | 289/363 [00:06<00:01, 42.61it/s] Loading 0: 81%|████████ | 294/363 [00:06<00:01, 41.75it/s] Loading 0: 82%|████████▏ | 299/363 [00:07<00:01, 43.47it/s] Loading 0: 84%|████████▎ | 304/363 [00:13<00:23, 2.48it/s] Loading 0: 85%|████████▍ | 308/363 [00:13<00:17, 3.22it/s] Loading 0: 86%|████████▌ | 312/363 [00:13<00:12, 4.19it/s] Loading 0: 88%|████████▊ | 319/363 [00:14<00:06, 6.66it/s] Loading 0: 89%|████████▉ | 324/363 [00:14<00:04, 8.79it/s] Loading 0: 91%|█████████ | 329/363 [00:14<00:02, 11.49it/s] Loading 0: 92%|█████████▏| 335/363 [00:14<00:01, 15.32it/s] Loading 0: 94%|█████████▎| 340/363 [00:14<00:01, 18.74it/s] Loading 0: 96%|█████████▌| 347/363 [00:14<00:00, 25.42it/s] Loading 0: 97%|█████████▋| 353/363 [00:14<00:00, 29.21it/s] Loading 0: 99%|█████████▊| 358/363 [00:14<00:00, 31.73it/s]
Job arushimgupta-final-check-2833-v1-mkmlizer completed after 163.82s with status: succeeded
Stopping job with name arushimgupta-final-check-2833-v1-mkmlizer
Pipeline stage MKMLizer completed in 164.67s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.08s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service arushimgupta-final-check-2833-v1
Waiting for inference service arushimgupta-final-check-2833-v1 to be ready
Inference service arushimgupta-final-check-2833-v1 ready after 220.5996491909027s
Pipeline stage MKMLDeployer completed in 220.98s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.4158620834350586s
Received healthy response to inference request in 1.7673685550689697s
Received healthy response to inference request in 1.781648874282837s
Received healthy response to inference request in 1.8303470611572266s
Received healthy response to inference request in 2.4410533905029297s
5 requests
0 failed requests
5th percentile: 1.7702246189117432
10th percentile: 1.7730806827545167
20th percentile: 1.7787928104400634
30th percentile: 1.7913885116577148
40th percentile: 1.8108677864074707
50th percentile: 1.8303470611572266
60th percentile: 2.064553070068359
70th percentile: 2.2987590789794923
80th percentile: 2.420900344848633
90th percentile: 2.4309768676757812
95th percentile: 2.4360151290893555
99th percentile: 2.4400457382202148
mean time: 2.0472559928894043
Pipeline stage StressChecker completed in 10.82s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 3.36s
Shutdown handler de-registered
arushimgupta-final-check_2833_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.11s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.10s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service arushimgupta-final-check-2833-v1-profiler
Waiting for inference service arushimgupta-final-check-2833-v1-profiler to be ready
Tearing down inference service arushimgupta-final-check-2833-v1-profiler
%s, retrying in %s seconds...
Creating inference service arushimgupta-final-check-2833-v1-profiler
Waiting for inference service arushimgupta-final-check-2833-v1-profiler to be ready
Tearing down inference service arushimgupta-final-check-2833-v1-profiler
%s, retrying in %s seconds...
Creating inference service arushimgupta-final-check-2833-v1-profiler
Waiting for inference service arushimgupta-final-check-2833-v1-profiler to be ready
Tearing down inference service arushimgupta-final-check-2833-v1-profiler
clean up pipeline due to error=%s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.14s
Shutdown handler de-registered
arushimgupta-final-check_2833_v1 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of chaiml-nemo-community-2c_v1
Deleting key arushimgupta-final-check-3580-v3/special_tokens_map.json from bucket guanaco-mkml-models
Deleting key arushimgupta-final-check-3178-v2/tokenizer.json from bucket guanaco-mkml-models
Deleting key arushimgupta-lora-save-2-v1/config.json from bucket guanaco-mkml-models
Deleting key arushimgupta-lora-save-1-v1/flywheel_model.0.safetensors from bucket guanaco-mkml-models
Deleting key arushimgupta-final-check-3580-v1/tokenizer.json from bucket guanaco-mkml-models
Checking if service chaiml-nemo-chai-4bio-me-9462-v2 is running
Tearing down inference service chaiml-llama-8b-big-retu-8570-v2
Cleaning model data from S3
Running pipeline stage MKMLDeleter
Cleaning model data from S3
run pipeline %s
arushimgupta-final-check_2833_v1 status is now torndown due to DeploymentManager action
Shutdown handler not registered because Python interpreter is not running in the main thread
admin requested tearing down of chaiml-nemo-community-5_v1
Deleting key arushimgupta-final-check-3580-v3/tokenizer.json from bucket guanaco-mkml-models
arushimgupta-final-check_2833_v1 status is now torndown due to DeploymentManager action
Shutdown handler not registered because Python interpreter is not running in the main thread
Pipeline stage %s skipped, reason=%s
admin requested tearing down of arushimgupta-final-check_2833_v1
run pipeline %s
Pipeline stage MKMLDeleter completed in 0.36s
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline stage %s
admin requested tearing down of arushimgupta-final-check_3178_v1
run pipeline stage %s
run pipeline %s
Running pipeline stage MKMLDeleter
Shutdown handler not registered because Python interpreter is not running in the main thread
Running pipeline stage MKMLModelDeleter
run pipeline stage %s
admin requested tearing down of arushimgupta-final-check_3178_v2
Pipeline stage %s skipped, reason=%s
run pipeline %s
Pipeline stage %s skipped, reason=%s
Running pipeline stage MKMLDeleter
Shutdown handler not registered because Python interpreter is not running in the main thread
Pipeline stage MKMLDeleter completed in 0.89s
run pipeline stage %s
Pipeline stage MKMLModelDeleter completed in 0.95s
Pipeline stage %s skipped, reason=%s
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLDeleter
Shutdown handler de-registered
Pipeline stage MKMLDeleter completed in 0.81s
run pipeline stage %s
anthracite-org-magnum-v2_6820_v1 status is now torndown due to DeploymentManager action
run pipeline stage %s
Running pipeline stage MKMLDeleter
Pipeline stage %s skipped, reason=%s
Pipeline stage MKMLDeleter completed in 0.81s
Running pipeline stage MKMLModelDeleter
Pipeline stage %s skipped, reason=%s
Pipeline stage MKMLModelDeleter completed in 0.89s
run pipeline stage %s
Pipeline stage %s skipped, reason=%s
Pipeline stage MKMLDeleter completed in 0.72s
Shutdown handler de-registered
Running pipeline stage MKMLModelDeleter
Pipeline stage MKMLModelDeleter completed in 0.71s
run pipeline stage %s
arliai-mistral-nemo-12b-_9104_v4 status is now torndown due to DeploymentManager action
Pipeline stage %s skipped, reason=%s
Shutdown handler de-registered
Running pipeline stage MKMLModelDeleter
Pipeline stage MKMLModelDeleter completed in 0.73s
Pipeline stage MKMLModelDeleter completed in 0.73s
arushimgupta-final-check_2833_v1 status is now torndown due to DeploymentManager action
Pipeline stage %s skipped, reason=%s
arushimgupta-final-check_2833_v1 status is now torndown due to DeploymentManager action
arushimgupta-final-check_2833_v1 status is now torndown due to DeploymentManager action
arushimgupta-final-check_2833_v1 status is now torndown due to DeploymentManager action
Shutdown handler not registered because Python interpreter is not running in the main thread
Pipeline stage %s skipped, reason=%s
admin requested tearing down of arushimgupta-final-check_2833_v1
run pipeline %s
Pipeline stage MKMLDeleter completed in 0.46s
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline stage %s
admin requested tearing down of arushimgupta-final-check_3178_v1
run pipeline stage %s
run pipeline %s
Running pipeline stage MKMLDeleter
Shutdown handler not registered because Python interpreter is not running in the main thread
Running pipeline stage MKMLModelDeleter
run pipeline stage %s
Pipeline stage %s skipped, reason=%s
run pipeline %s
Pipeline stage %s skipped, reason=%s
Running pipeline stage MKMLDeleter
Pipeline stage MKMLDeleter completed in 0.77s
run pipeline stage %s
Pipeline stage MKMLModelDeleter completed in 0.70s
Pipeline stage %s skipped, reason=%s
run pipeline stage %s
Running pipeline stage MKMLDeleter
Shutdown handler de-registered
Pipeline stage MKMLDeleter completed in 0.76s
Pipeline stage %s skipped, reason=%s
anthracite-org-magnum-v2_6820_v1 status is now torndown due to DeploymentManager action
run pipeline stage %s
Pipeline stage %s skipped, reason=%s
Pipeline stage MKMLDeleter completed in 0.90s
run pipeline stage %s
Running pipeline stage MKMLModelDeleter
Pipeline stage MKMLModelDeleter completed in 0.92s
Running pipeline stage MKMLModelDeleter
Pipeline stage %s skipped, reason=%s
Shutdown handler de-registered
Pipeline stage %s skipped, reason=%s
arliai-mistral-nemo-12b-_9104_v4 status is now torndown due to DeploymentManager action
Pipeline stage MKMLModelDeleter completed in 0.56s
Shutdown handler de-registered
Shutdown handler de-registered
Shutdown handler de-registered
arushimgupta-final-check_2833_v1 status is now torndown due to DeploymentManager action
arushimgupta-final-check_3178_v1 status is now torndown due to DeploymentManager action
arushimgupta-final-check_2833_v1 status is now torndown due to DeploymentManager action
arushimgupta-final-check_3178_v1 status is now torndown due to DeploymentManager action
Shutdown handler not registered because Python interpreter is not running in the main thread
Pipeline stage %s skipped, reason=%s
admin requested tearing down of arushimgupta-final-check_2833_v1
run pipeline %s
Pipeline stage MKMLDeleter completed in 0.54s
Shutdown handler not registered because Python interpreter is not running in the main thread
admin requested tearing down of arushimgupta-final-check_3178_v1
run pipeline stage %s
run pipeline stage %s
run pipeline %s
Shutdown handler not registered because Python interpreter is not running in the main thread
Running pipeline stage MKMLDeleter
admin requested tearing down of arushimgupta-final-check_3178_v2
Running pipeline stage MKMLModelDeleter
run pipeline stage %s
run pipeline %s
Pipeline stage %s skipped, reason=%s
Pipeline stage %s skipped, reason=%s
Shutdown handler not registered because Python interpreter is not running in the main thread
Running pipeline stage MKMLDeleter
run pipeline stage %s
Pipeline stage MKMLDeleter completed in 1.11s
Pipeline stage MKMLModelDeleter completed in 0.91s
run pipeline %s
Pipeline stage %s skipped, reason=%s
Running pipeline stage MKMLDeleter
run pipeline stage %s
Shutdown handler de-registered
run pipeline stage %s
Pipeline stage MKMLDeleter completed in 1.12s
anthracite-org-magnum-v2_6820_v1 status is now torndown due to DeploymentManager action
Running pipeline stage MKMLDeleter
run pipeline stage %s
Pipeline stage MKMLDeleter completed in 1.22s
Pipeline stage %s skipped, reason=%s
Pipeline stage %s skipped, reason=%s
Running pipeline stage MKMLModelDeleter
run pipeline stage %s
Pipeline stage MKMLModelDeleter completed in 1.05s
Pipeline stage MKMLDeleter completed in 0.90s
Pipeline stage %s skipped, reason=%s
Running pipeline stage MKMLModelDeleter
Shutdown handler de-registered
run pipeline stage %s
Pipeline stage MKMLModelDeleter completed in 0.82s
Running pipeline stage MKMLModelDeleter
arliai-mistral-nemo-12b-_9104_v4 status is now torndown due to DeploymentManager action
Shutdown handler de-registered
Pipeline stage MKMLModelDeleter completed in 0.91s
Pipeline stage %s skipped, reason=%s
Shutdown handler de-registered
Pipeline stage %s skipped, reason=%s
Shutdown handler de-registered
arushimgupta-final-check_2833_v1 status is now torndown due to DeploymentManager action
arushimgupta-final-check_2833_v1 status is now torndown due to DeploymentManager action
Shutdown handler de-registered
arushimgupta-final-check_2833_v1 status is now torndown due to DeploymentManager action
arushimgupta-final-check_2833_v1 status is now torndown due to DeploymentManager action