submission_id: chaiml-elo-alignment-run-3_v56
developer_uid: chai_backend_admin
best_of: 16
celo_rating: 1266.87
display_name: chaiml-elo-alignment-run-3_v56
family_friendly_score: 0.5786
family_friendly_standard_error: 0.006983151723971061
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.95, 'top_p': 1.0, 'min_p': 0.08, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 16, 'max_output_tokens': 64}
gpu_counts: {'NVIDIA RTX A5000': 1}
is_internal_developer: True
language_model: ChaiML/elo_alignment_run_3
latencies: [{'batch_size': 1, 'throughput': 0.8654063219832688, 'latency_mean': 1.1554662656784058, 'latency_p50': 1.1573816537857056, 'latency_p90': 1.299802565574646}, {'batch_size': 3, 'throughput': 1.5342539742157824, 'latency_mean': 1.9507559251785278, 'latency_p50': 1.950265884399414, 'latency_p90': 2.1823355197906493}, {'batch_size': 5, 'throughput': 1.7601473316987348, 'latency_mean': 2.8271560680866243, 'latency_p50': 2.8335787057876587, 'latency_p90': 3.1621808290481566}, {'batch_size': 6, 'throughput': 1.8096852490576678, 'latency_mean': 3.297809703350067, 'latency_p50': 3.2991846799850464, 'latency_p90': 3.6873255252838133}, {'batch_size': 8, 'throughput': 1.885363300537769, 'latency_mean': 4.213445301055908, 'latency_p50': 4.213890433311462, 'latency_p90': 4.738788843154907}, {'batch_size': 10, 'throughput': 1.8901611047941689, 'latency_mean': 5.232948710918427, 'latency_p50': 5.192382216453552, 'latency_p90': 6.080938005447388}]
max_input_tokens: 512
max_output_tokens: 64
model_architecture: LlamaForCausalLM
model_group: ChaiML/elo_alignment_run
model_name: chaiml-elo-alignment-run-3_v56
model_num_parameters: 8030261248.0
model_repo: ChaiML/elo_alignment_run_3
model_size: 8B
num_battles: 14601
num_wins: 8017
ranking_group: single
status: inactive
submission_type: basic
throughput_3p7s: 1.86
timestamp: 2024-11-11T18:07:24+00:00
us_pacific_date: 2024-11-11
win_ratio: 0.549071981371139
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-elo-alignment-run-3-v56-mkmlizer
Waiting for job on chaiml-elo-alignment-run-3-v56-mkmlizer to finish
chaiml-elo-alignment-run-3-v56-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ _____ __ __ ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ /___/ ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ Version: 0.11.33 ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ https://mk1.ai ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ belonging to: ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ Chai Research Corp. ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ║ ║
chaiml-elo-alignment-run-3-v56-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
chaiml-elo-alignment-run-3-v56-mkmlizer: Downloaded to shared memory in 96.155s
chaiml-elo-alignment-run-3-v56-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpj0e8gd14, device:0
chaiml-elo-alignment-run-3-v56-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-elo-alignment-run-3-v56-mkmlizer: quantized model in 31.263s
chaiml-elo-alignment-run-3-v56-mkmlizer: Processed model ChaiML/elo_alignment_run_3 in 127.419s
chaiml-elo-alignment-run-3-v56-mkmlizer: creating bucket guanaco-mkml-models
chaiml-elo-alignment-run-3-v56-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-elo-alignment-run-3-v56-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-elo-alignment-run-3-v56
chaiml-elo-alignment-run-3-v56-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-elo-alignment-run-3-v56/config.json
chaiml-elo-alignment-run-3-v56-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-elo-alignment-run-3-v56/special_tokens_map.json
chaiml-elo-alignment-run-3-v56-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-elo-alignment-run-3-v56/tokenizer_config.json
chaiml-elo-alignment-run-3-v56-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-elo-alignment-run-3-v56/tokenizer.json
chaiml-elo-alignment-run-3-v56-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-elo-alignment-run-3-v56/flywheel_model.0.safetensors
chaiml-elo-alignment-run-3-v56-mkmlizer: Loading 0: 0%| | 0/291 [00:00<?, ?it/s] Loading 0: 2%|▏ | 5/291 [00:00<00:11, 24.96it/s] Loading 0: 4%|▍ | 12/291 [00:00<00:07, 36.06it/s] Loading 0: 5%|▌ | 16/291 [00:00<00:08, 32.85it/s] Loading 0: 7%|▋ | 21/291 [00:00<00:07, 35.22it/s] Loading 0: 9%|▊ | 25/291 [00:00<00:08, 32.39it/s] Loading 0: 11%|█ | 31/291 [00:00<00:06, 39.61it/s] Loading 0: 12%|█▏ | 36/291 [00:01<00:10, 23.82it/s] Loading 0: 14%|█▍ | 41/291 [00:01<00:09, 25.20it/s] Loading 0: 16%|█▋ | 48/291 [00:01<00:07, 32.05it/s] Loading 0: 18%|█▊ | 52/291 [00:01<00:07, 31.84it/s] Loading 0: 20%|█▉ | 57/291 [00:01<00:07, 33.08it/s] Loading 0: 21%|██ | 61/291 [00:01<00:07, 31.23it/s] Loading 0: 23%|██▎ | 66/291 [00:02<00:06, 32.68it/s] Loading 0: 24%|██▍ | 70/291 [00:02<00:07, 30.77it/s] Loading 0: 25%|██▌ | 74/291 [00:02<00:07, 29.65it/s] Loading 0: 27%|██▋ | 78/291 [00:02<00:07, 29.22it/s] Loading 0: 28%|██▊ | 81/291 [00:02<00:10, 20.32it/s] Loading 0: 29%|██▉ | 84/291 [00:02<00:10, 20.31it/s] Loading 0: 31%|███ | 90/291 [00:03<00:07, 26.73it/s] Loading 0: 32%|███▏ | 94/291 [00:03<00:07, 27.05it/s] Loading 0: 34%|███▍ | 99/291 [00:03<00:06, 30.13it/s] Loading 0: 35%|███▌ | 103/291 [00:03<00:06, 30.37it/s] Loading 0: 37%|███▋ | 108/291 [00:03<00:05, 33.69it/s] Loading 0: 38%|███▊ | 112/291 [00:03<00:05, 32.72it/s] Loading 0: 40%|███▉ | 116/291 [00:03<00:05, 33.12it/s] Loading 0: 42%|████▏ | 122/291 [00:04<00:04, 36.52it/s] Loading 0: 44%|████▎ | 127/291 [00:04<00:04, 33.55it/s] Loading 0: 46%|████▌ | 133/291 [00:04<00:05, 27.15it/s] Loading 0: 47%|████▋ | 137/291 [00:04<00:05, 26.83it/s] Loading 0: 48%|████▊ | 140/291 [00:04<00:06, 22.84it/s] Loading 0: 50%|████▉ | 145/291 [00:04<00:05, 26.86it/s] Loading 0: 51%|█████ | 149/291 [00:05<00:05, 24.95it/s] Loading 0: 54%|█████▎ | 156/291 [00:05<00:04, 31.29it/s] Loading 0: 55%|█████▍ | 160/291 [00:05<00:04, 29.78it/s] Loading 0: 57%|█████▋ | 165/291 [00:05<00:03, 31.78it/s] Loading 0: 58%|█████▊ | 169/291 [00:05<00:04, 30.16it/s] Loading 0: 60%|█████▉ | 174/291 [00:05<00:03, 32.49it/s] Loading 0: 61%|██████ | 178/291 [00:06<00:03, 30.75it/s] Loading 0: 63%|██████▎ | 182/291 [00:06<00:03, 32.84it/s] Loading 0: 64%|██████▍ | 186/291 [00:06<00:04, 23.25it/s] Loading 0: 65%|██████▍ | 189/291 [00:06<00:04, 21.01it/s] Loading 0: 67%|██████▋ | 194/291 [00:06<00:04, 22.87it/s] Loading 0: 69%|██████▉ | 201/291 [00:06<00:03, 28.46it/s] Loading 0: 70%|███████ | 205/291 [00:07<00:03, 27.95it/s] Loading 0: 72%|███████▏ | 210/291 [00:07<00:02, 30.29it/s] Loading 0: 74%|███████▎ | 214/291 [00:07<00:02, 28.95it/s] Loading 0: 75%|███████▌ | 219/291 [00:07<00:02, 31.16it/s] Loading 0: 77%|███████▋ | 223/291 [00:07<00:02, 29.40it/s] Loading 0: 78%|███████▊ | 227/291 [00:07<00:02, 29.08it/s] Loading 0: 79%|███████▉ | 230/291 [00:07<00:02, 27.39it/s] Loading 0: 80%|████████ | 233/291 [00:08<00:02, 19.59it/s] Loading 0: 82%|████████▏ | 238/291 [00:08<00:02, 24.21it/s] Loading 0: 83%|████████▎ | 241/291 [00:08<00:02, 24.31it/s] Loading 0: 85%|████████▍ | 246/291 [00:08<00:01, 27.72it/s] Loading 0: 86%|████████▌ | 249/291 [00:08<00:01, 24.92it/s] Loading 0: 87%|████████▋ | 253/291 [00:08<00:01, 28.02it/s] Loading 0: 88%|████████▊ | 257/291 [00:09<00:01, 25.12it/s] Loading 0: 90%|█████████ | 262/291 [00:09<00:00, 30.29it/s] Loading 0: 91%|█████████▏| 266/291 [00:09<00:00, 26.83it/s] Loading 0: 94%|█████████▍| 273/291 [00:09<00:00, 33.42it/s] Loading 0: 95%|█████████▌| 277/291 [00:09<00:00, 31.14it/s] Loading 0: 97%|█████████▋| 281/291 [00:09<00:00, 30.56it/s] Loading 0: 98%|█████████▊| 286/291 [00:15<00:02, 2.46it/s] Loading 0: 99%|█████████▉| 289/291 [00:15<00:00, 3.05it/s]
Connection pool is full, discarding connection: %s. Connection pool size: %s
Job chaiml-elo-alignment-run-3-v56-mkmlizer completed after 155.51s with status: succeeded
Stopping job with name chaiml-elo-alignment-run-3-v56-mkmlizer
Pipeline stage MKMLizer completed in 156.03s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.17s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-elo-alignment-run-3-v56
Waiting for inference service chaiml-elo-alignment-run-3-v56 to be ready
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Failed to get response for submission jic062-dpo-v3-0-nemo_v1: ('http://jic062-dpo-v3-0-nemo-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Inference service chaiml-elo-alignment-run-3-v56 ready after 181.03204655647278s
Pipeline stage MKMLDeployer completed in 181.59s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.916252613067627s
Received healthy response to inference request in 1.612868070602417s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 1.3444201946258545s
Received healthy response to inference request in 1.6200740337371826s
Received healthy response to inference request in 1.5782313346862793s
5 requests
0 failed requests
5th percentile: 1.3911824226379395
10th percentile: 1.4379446506500244
20th percentile: 1.5314691066741943
30th percentile: 1.585158681869507
40th percentile: 1.599013376235962
50th percentile: 1.612868070602417
60th percentile: 1.6157504558563232
70th percentile: 1.6186328411102295
80th percentile: 1.6793097496032716
90th percentile: 1.7977811813354492
95th percentile: 1.857016897201538
99th percentile: 1.904405469894409
mean time: 1.6143692493438722
Pipeline stage StressChecker completed in 9.56s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.83s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 2.07s
Shutdown handler de-registered
chaiml-elo-alignment-run-3_v56 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2900.24s
Shutdown handler de-registered
chaiml-elo-alignment-run-3_v56 status is now inactive due to auto deactivation removed underperforming models