developer_uid: zonemercy
submission_id: chaiml-lexical-viral-v6a_2464_v6
model_name: tempv1-2
model_group: ChaiML/Lexical-Viral-v6a
status: inactive
timestamp: 2024-12-11T15:40:57+00:00
num_battles: 12306
num_wins: 6307
celo_rating: 1273.57
family_friendly_score: 0.6078
family_friendly_standard_error: 0.006904768786860281
submission_type: basic
model_repo: ChaiML/Lexical-Viral-v6ava-12b01e5r256
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
latencies: [{'batch_size': 1, 'throughput': 0.6139596188766926, 'latency_mean': 1.6287116515636444, 'latency_p50': 1.6182618141174316, 'latency_p90': 1.811811590194702}, {'batch_size': 3, 'throughput': 1.1185258181389666, 'latency_mean': 2.6715712094306947, 'latency_p50': 2.6773011684417725, 'latency_p90': 2.920884943008423}, {'batch_size': 5, 'throughput': 1.3584968680989529, 'latency_mean': 3.6603198146820066, 'latency_p50': 3.6548603773117065, 'latency_p90': 4.011550092697143}, {'batch_size': 6, 'throughput': 1.4281302885393992, 'latency_mean': 4.181557954549789, 'latency_p50': 4.205661177635193, 'latency_p90': 4.6759352684021}, {'batch_size': 8, 'throughput': 1.5062801439793145, 'latency_mean': 5.283517218828202, 'latency_p50': 5.252646803855896, 'latency_p90': 6.0156331062316895}, {'batch_size': 10, 'throughput': 1.5361012519865163, 'latency_mean': 6.460727610588074, 'latency_p50': 6.416983485221863, 'latency_p90': 7.244562029838562}]
gpu_counts: {'NVIDIA RTX A5000': 1}
display_name: tempv1-2
is_internal_developer: True
language_model: ChaiML/Lexical-Viral-v6ava-12b01e5r256
model_size: 13B
ranking_group: single
throughput_3p7s: 1.37
us_pacific_date: 2024-12-11
win_ratio: 0.512514220705347
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', '####', 'Bot:', 'User:', 'You:', '<|im_end|>', '<|eot_id|>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-lexical-viral-v6a-2464-v6-mkmlizer
Waiting for job on chaiml-lexical-viral-v6a-2464-v6-mkmlizer to finish
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ _____ __ __ ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ /___/ ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ Version: 0.11.12 ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ https://mk1.ai ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ belonging to: ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ Chai Research Corp. ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ║ ║
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: Downloaded to shared memory in 38.829s
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpztxinp3v, device:0
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: quantized model in 38.908s
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: Processed model ChaiML/Lexical-Viral-v6ava-12b01e5r256 in 77.738s
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: creating bucket guanaco-mkml-models
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-lexical-viral-v6a-2464-v6
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-lexical-viral-v6a-2464-v6/config.json
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-lexical-viral-v6a-2464-v6/special_tokens_map.json
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-lexical-viral-v6a-2464-v6/tokenizer_config.json
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-lexical-viral-v6a-2464-v6/tokenizer.json
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-lexical-viral-v6a-2464-v6/flywheel_model.0.safetensors
chaiml-lexical-viral-v6a-2464-v6-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:14, 24.87it/s] Loading 0: 3%|▎ | 12/363 [00:00<00:08, 41.25it/s] Loading 0: 5%|▍ | 17/363 [00:00<00:08, 40.45it/s] Loading 0: 6%|▌ | 22/363 [00:00<00:08, 41.08it/s] Loading 0: 7%|▋ | 27/363 [00:00<00:07, 42.03it/s] Loading 0: 9%|▉ | 32/363 [00:00<00:09, 33.50it/s] Loading 0: 11%|█ | 39/363 [00:00<00:07, 41.35it/s] Loading 0: 12%|█▏ | 44/363 [00:01<00:07, 39.90it/s] Loading 0: 13%|█▎ | 49/363 [00:01<00:07, 39.45it/s] Loading 0: 15%|█▍ | 54/363 [00:01<00:07, 40.05it/s] Loading 0: 16%|█▋ | 59/363 [00:01<00:07, 40.73it/s] Loading 0: 18%|█▊ | 64/363 [00:01<00:12, 23.32it/s] Loading 0: 20%|█▉ | 71/363 [00:02<00:09, 30.24it/s] Loading 0: 21%|██ | 76/363 [00:02<00:08, 32.01it/s] Loading 0: 22%|██▏ | 81/363 [00:02<00:08, 34.01it/s] Loading 0: 24%|██▎ | 86/363 [00:02<00:07, 35.79it/s] Loading 0: 25%|██▌ | 91/363 [00:02<00:08, 30.57it/s] Loading 0: 27%|██▋ | 98/363 [00:02<00:07, 36.63it/s] Loading 0: 28%|██▊ | 103/363 [00:02<00:07, 36.42it/s] Loading 0: 29%|██▉ | 107/363 [00:03<00:07, 35.97it/s] Loading 0: 31%|███ | 112/363 [00:03<00:06, 37.67it/s] Loading 0: 32%|███▏ | 116/363 [00:03<00:07, 34.86it/s] Loading 0: 33%|███▎ | 120/363 [00:03<00:07, 33.78it/s] Loading 0: 34%|███▍ | 125/363 [00:03<00:06, 36.87it/s] Loading 0: 36%|███▌ | 129/363 [00:03<00:06, 34.70it/s] Loading 0: 37%|███▋ | 133/363 [00:03<00:06, 35.88it/s] Loading 0: 38%|███▊ | 137/363 [00:03<00:06, 32.64it/s] Loading 0: 39%|███▉ | 142/363 [00:04<00:08, 25.15it/s] Loading 0: 40%|███▉ | 145/363 [00:04<00:08, 25.08it/s] Loading 0: 41%|████ | 149/363 [00:04<00:08, 25.39it/s] Loading 0: 43%|████▎ | 156/363 [00:04<00:06, 33.49it/s] Loading 0: 44%|████▍ | 160/363 [00:04<00:06, 32.61it/s] Loading 0: 45%|████▌ | 165/363 [00:04<00:05, 35.99it/s] Loading 0: 47%|████▋ | 169/363 [00:04<00:05, 34.76it/s] Loading 0: 48%|████▊ | 174/363 [00:05<00:05, 37.00it/s] Loading 0: 49%|████▉ | 178/363 [00:05<00:05, 34.43it/s] Loading 0: 50%|█████ | 183/363 [00:05<00:04, 36.92it/s] Loading 0: 52%|█████▏ | 187/363 [00:05<00:04, 35.49it/s] Loading 0: 53%|█████▎ | 192/363 [00:05<00:04, 37.69it/s] Loading 0: 54%|█████▍ | 196/363 [00:05<00:04, 35.20it/s] Loading 0: 55%|█████▌ | 201/363 [00:05<00:04, 37.08it/s] Loading 0: 56%|█████▋ | 205/363 [00:05<00:04, 35.27it/s] Loading 0: 58%|█████▊ | 210/363 [00:06<00:04, 38.24it/s] Loading 0: 59%|█████▉ | 214/363 [00:06<00:04, 36.13it/s] Loading 0: 60%|██████ | 218/363 [00:06<00:04, 34.63it/s] Loading 0: 61%|██████▏ | 223/363 [00:06<00:05, 25.08it/s] Loading 0: 62%|██████▏ | 226/363 [00:06<00:05, 24.32it/s] Loading 0: 63%|██████▎ | 230/363 [00:06<00:05, 24.76it/s] Loading 0: 65%|██████▌ | 237/363 [00:07<00:03, 32.81it/s] Loading 0: 66%|██████▋ | 241/363 [00:07<00:03, 32.15it/s] Loading 0: 68%|██████▊ | 246/363 [00:07<00:03, 34.80it/s] Loading 0: 69%|██████▉ | 250/363 [00:07<00:03, 33.85it/s] Loading 0: 70%|███████ | 255/363 [00:07<00:02, 36.75it/s] Loading 0: 71%|███████▏ | 259/363 [00:07<00:02, 34.95it/s] Loading 0: 73%|███████▎ | 264/363 [00:07<00:02, 37.73it/s] Loading 0: 74%|███████▍ | 268/363 [00:07<00:02, 36.49it/s] Loading 0: 75%|███████▌ | 273/363 [00:07<00:02, 39.34it/s] Loading 0: 77%|███████▋ | 278/363 [00:08<00:02, 39.81it/s] Loading 0: 78%|███████▊ | 283/363 [00:08<00:01, 40.92it/s] Loading 0: 79%|███████▉ | 288/363 [00:08<00:01, 42.52it/s] Loading 0: 81%|████████ | 293/363 [00:08<00:02, 32.03it/s] Loading 0: 82%|████████▏ | 299/363 [00:08<00:01, 36.86it/s] Loading 0: 84%|████████▎ | 304/363 [00:15<00:24, 2.38it/s] Loading 0: 85%|████████▍ | 307/363 [00:15<00:19, 2.89it/s] Loading 0: 86%|████████▌ | 312/363 [00:15<00:12, 4.12it/s] Loading 0: 88%|████████▊ | 319/363 [00:15<00:06, 6.51it/s] Loading 0: 89%|████████▉ | 324/363 [00:16<00:04, 8.52it/s] Loading 0: 90%|█████████ | 328/363 [00:16<00:03, 10.51it/s] Loading 0: 91%|█████████▏| 332/363 [00:16<00:02, 12.66it/s] Loading 0: 93%|█████████▎| 337/363 [00:16<00:01, 16.28it/s] Loading 0: 94%|█████████▍| 341/363 [00:16<00:01, 18.59it/s] Loading 0: 96%|█████████▌| 347/363 [00:16<00:00, 23.99it/s] Loading 0: 97%|█████████▋| 351/363 [00:16<00:00, 26.68it/s] Loading 0: 98%|█████████▊| 356/363 [00:16<00:00, 30.27it/s] Loading 0: 99%|█████████▉| 361/363 [00:17<00:00, 33.98it/s]
Job chaiml-lexical-viral-v6a-2464-v6-mkmlizer completed after 104.4s with status: succeeded
Stopping job with name chaiml-lexical-viral-v6a-2464-v6-mkmlizer
Pipeline stage MKMLizer completed in 104.93s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-lexical-viral-v6a-2464-v6
Waiting for inference service chaiml-lexical-viral-v6a-2464-v6 to be ready
Inference service chaiml-lexical-viral-v6a-2464-v6 ready after 180.65349674224854s
Pipeline stage MKMLDeployer completed in 181.29s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9747648239135742s
Received healthy response to inference request in 1.4620575904846191s
Received healthy response to inference request in 1.7364509105682373s
Received healthy response to inference request in 1.8659546375274658s
Received healthy response to inference request in 1.4365785121917725s
5 requests
0 failed requests
5th percentile: 1.4416743278503419
10th percentile: 1.446770143508911
20th percentile: 1.4569617748260497
30th percentile: 1.5169362545013427
40th percentile: 1.6266935825347901
50th percentile: 1.7364509105682373
60th percentile: 1.7882524013519288
70th percentile: 1.8400538921356202
80th percentile: 1.8877166748046874
90th percentile: 1.9312407493591308
95th percentile: 1.9530027866363526
99th percentile: 1.9704124164581298
mean time: 1.6951612949371337
Pipeline stage StressChecker completed in 9.94s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.48s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 2.37s
Shutdown handler de-registered
chaiml-lexical-viral-v6a_2464_v6 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2636.95s
Shutdown handler de-registered
chaiml-lexical-viral-v6a_2464_v6 status is now inactive due to auto deactivation removed underperforming models