developer_uid: chai_backend_admin
submission_id: zonemercy-lexical-viral-_5364_v2
model_name: tempv1-4
model_group: zonemercy/Lexical-Viral-
status: inactive
timestamp: 2024-11-19T20:58:22+00:00
num_battles: 12106
num_wins: 6379
celo_rating: 1272.89
family_friendly_score: 0.5986
family_friendly_standard_error: 0.006932215230357464
submission_type: basic
model_repo: zonemercy/Lexical-Viral-v6ava-22b11e5r256la32
model_architecture: MistralForCausalLM
model_num_parameters: 22247282688.0
best_of: 4
max_input_tokens: 1024
max_output_tokens: 64
latencies: [{'batch_size': 1, 'throughput': 0.3879264797642067, 'latency_mean': 2.577747809886932, 'latency_p50': 2.58116614818573, 'latency_p90': 2.8463711500167848}, {'batch_size': 3, 'throughput': 0.8123810684730951, 'latency_mean': 3.6856809210777284, 'latency_p50': 3.6808247566223145, 'latency_p90': 4.059659504890441}, {'batch_size': 5, 'throughput': 1.083014388723131, 'latency_mean': 4.597458537817001, 'latency_p50': 4.61540412902832, 'latency_p90': 5.15342571735382}, {'batch_size': 6, 'throughput': 1.1586480131190333, 'latency_mean': 5.133719094991684, 'latency_p50': 5.137701511383057, 'latency_p90': 5.73266122341156}, {'batch_size': 10, 'throughput': 1.3771176500334972, 'latency_mean': 7.206994714736939, 'latency_p50': 7.227896332740784, 'latency_p90': 8.111801314353942}]
gpu_counts: {'NVIDIA RTX A6000': 1}
display_name: tempv1-4
is_internal_developer: True
language_model: zonemercy/Lexical-Viral-v6ava-22b11e5r256la32
model_size: 22B
ranking_group: single
throughput_3p7s: 0.82
us_pacific_date: 2024-11-19
win_ratio: 0.5269287956385263
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', '####', 'Bot:', 'User:', 'You:', '<|im_end|>', '<|eot_id|>'], 'max_input_tokens': 1024, 'best_of': 4, 'max_output_tokens': 64}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name zonemercy-lexical-viral-5364-v2-mkmlizer
Waiting for job on zonemercy-lexical-viral-5364-v2-mkmlizer to finish
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
zonemercy-lexical-viral-5364-v2-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ _____ __ __ ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ /___/ ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ Version: 0.11.12 ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ https://mk1.ai ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ The license key for the current software has been verified as ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ belonging to: ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ Chai Research Corp. ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ║ ║
zonemercy-lexical-viral-5364-v2-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Failed to get response for submission chaiml-virgo-edit-v1-1e5_v9: ('http://chaiml-virgo-edit-v1-1e5-v9-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:48066->127.0.0.1:8080: read: connection reset by peer\n')
zonemercy-lexical-viral-5364-v2-mkmlizer: Downloaded to shared memory in 67.047s
zonemercy-lexical-viral-5364-v2-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpshvhb_2a, device:0
zonemercy-lexical-viral-5364-v2-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Connection pool is full, discarding connection: %s. Connection pool size: %s
zonemercy-lexical-viral-5364-v2-mkmlizer: quantized model in 46.849s
zonemercy-lexical-viral-5364-v2-mkmlizer: Processed model zonemercy/Lexical-Viral-v6ava-22b11e5r256la32 in 113.897s
zonemercy-lexical-viral-5364-v2-mkmlizer: creating bucket guanaco-mkml-models
zonemercy-lexical-viral-5364-v2-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zonemercy-lexical-viral-5364-v2-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zonemercy-lexical-viral-5364-v2
zonemercy-lexical-viral-5364-v2-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zonemercy-lexical-viral-5364-v2/config.json
zonemercy-lexical-viral-5364-v2-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zonemercy-lexical-viral-5364-v2/special_tokens_map.json
zonemercy-lexical-viral-5364-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zonemercy-lexical-viral-5364-v2/tokenizer_config.json
zonemercy-lexical-viral-5364-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zonemercy-lexical-viral-5364-v2/tokenizer.json
zonemercy-lexical-viral-5364-v2-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/zonemercy-lexical-viral-5364-v2/flywheel_model.1.safetensors
zonemercy-lexical-viral-5364-v2-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zonemercy-lexical-viral-5364-v2/flywheel_model.0.safetensors
zonemercy-lexical-viral-5364-v2-mkmlizer: Loading 0: 0%| | 0/507 [00:00<?, ?it/s] Loading 0: 1%| | 5/507 [00:00<00:24, 20.66it/s] Loading 0: 2%|▏ | 10/507 [00:00<00:16, 29.91it/s] Loading 0: 3%|▎ | 14/507 [00:00<00:19, 24.83it/s] Loading 0: 4%|▎ | 19/507 [00:00<00:16, 30.23it/s] Loading 0: 5%|▍ | 23/507 [00:00<00:19, 25.47it/s] Loading 0: 6%|▌ | 28/507 [00:01<00:16, 29.86it/s] Loading 0: 6%|▋ | 32/507 [00:01<00:17, 27.83it/s] Loading 0: 7%|▋ | 37/507 [00:01<00:14, 31.44it/s] Loading 0: 8%|▊ | 41/507 [00:01<00:16, 27.84it/s] Loading 0: 9%|▉ | 46/507 [00:01<00:14, 32.08it/s] Loading 0: 10%|▉ | 50/507 [00:01<00:15, 29.56it/s] Loading 0: 11%|█ | 54/507 [00:02<00:22, 20.18it/s] Loading 0: 11%|█ | 57/507 [00:02<00:22, 19.86it/s] Loading 0: 12%|█▏ | 61/507 [00:02<00:19, 22.83it/s] Loading 0: 13%|█▎ | 65/507 [00:02<00:20, 21.92it/s] Loading 0: 14%|█▍ | 70/507 [00:02<00:16, 26.86it/s] Loading 0: 15%|█▍ | 75/507 [00:02<00:14, 30.36it/s] Loading 0: 16%|█▌ | 80/507 [00:02<00:13, 30.77it/s] Loading 0: 17%|█▋ | 85/507 [00:03<00:12, 34.60it/s] Loading 0: 18%|█▊ | 89/507 [00:03<00:14, 29.35it/s] Loading 0: 19%|█▉ | 96/507 [00:03<00:11, 35.14it/s] Loading 0: 20%|█▉ | 100/507 [00:03<00:12, 33.54it/s] Loading 0: 21%|██ | 104/507 [00:03<00:11, 35.00it/s] Loading 0: 21%|██▏ | 108/507 [00:03<00:13, 30.23it/s] Loading 0: 22%|██▏ | 112/507 [00:03<00:12, 31.97it/s] Loading 0: 23%|██▎ | 116/507 [00:04<00:19, 20.47it/s] Loading 0: 24%|██▍ | 122/507 [00:04<00:16, 23.24it/s] Loading 0: 25%|██▌ | 129/507 [00:04<00:12, 29.33it/s] Loading 0: 26%|██▌ | 133/507 [00:04<00:12, 29.43it/s] Loading 0: 27%|██▋ | 138/507 [00:04<00:11, 32.15it/s] Loading 0: 28%|██▊ | 142/507 [00:05<00:11, 31.58it/s] Loading 0: 29%|██▉ | 147/507 [00:05<00:10, 34.05it/s] Loading 0: 30%|██▉ | 151/507 [00:05<00:10, 32.59it/s] Loading 0: 31%|███ | 156/507 [00:05<00:10, 34.67it/s] Loading 0: 32%|███▏ | 160/507 [00:05<00:10, 32.67it/s] Loading 0: 32%|███▏ | 164/507 [00:05<00:10, 32.63it/s] Loading 0: 33%|███▎ | 169/507 [00:05<00:12, 26.21it/s] Loading 0: 34%|███▍ | 172/507 [00:06<00:13, 24.95it/s] Loading 0: 35%|███▍ | 176/507 [00:06<00:13, 24.57it/s] Loading 0: 36%|███▌ | 181/507 [00:06<00:10, 29.77it/s] Loading 0: 36%|███▋ | 185/507 [00:06<00:11, 28.60it/s] Loading 0: 37%|███▋ | 190/507 [00:06<00:09, 33.18it/s] Loading 0: 38%|███▊ | 194/507 [00:06<00:10, 30.37it/s] Loading 0: 40%|███▉ | 201/507 [00:06<00:08, 37.15it/s] Loading 0: 40%|████ | 205/507 [00:07<00:08, 36.12it/s] Loading 0: 41%|████▏ | 210/507 [00:07<00:07, 37.29it/s] Loading 0: 42%|████▏ | 214/507 [00:07<00:08, 35.29it/s] Loading 0: 43%|████▎ | 218/507 [00:07<00:08, 34.07it/s] Loading 0: 44%|████▍ | 222/507 [00:07<00:08, 32.55it/s] Loading 0: 45%|████▍ | 226/507 [00:07<00:11, 24.68it/s] Loading 0: 45%|████▌ | 230/507 [00:07<00:11, 24.03it/s] Loading 0: 46%|████▋ | 235/507 [00:08<00:09, 29.01it/s] Loading 0: 47%|████▋ | 239/507 [00:08<00:09, 27.11it/s] Loading 0: 49%|████▊ | 246/507 [00:08<00:07, 33.66it/s] Loading 0: 49%|████▉ | 250/507 [00:08<00:07, 32.38it/s] Loading 0: 50%|█████ | 255/507 [00:08<00:07, 33.77it/s] Loading 0: 51%|█████ | 259/507 [00:08<00:07, 32.42it/s] Loading 0: 52%|█████▏ | 264/507 [00:08<00:07, 34.39it/s] Loading 0: 53%|█████▎ | 268/507 [00:09<00:07, 31.70it/s] Loading 0: 54%|█████▍ | 273/507 [00:09<00:06, 34.05it/s] Loading 0: 55%|█████▍ | 277/507 [00:09<00:07, 31.78it/s] Loading 0: 55%|█████▌ | 281/507 [00:09<00:06, 33.69it/s] Loading 0: 56%|█████▌ | 285/507 [00:09<00:09, 24.47it/s] Loading 0: 57%|█████▋ | 288/507 [00:09<00:09, 23.62it/s] Loading 0: 58%|█████▊ | 293/507 [00:10<00:08, 24.52it/s] Loading 0: 59%|█████▉ | 298/507 [00:10<00:07, 29.53it/s] Loading 0: 59%|█████▉ | 299/507 [00:24<00:07, 29.53it/s] Loading 0: 59%|█████▉ | 300/507 [00:24<04:14, 1.23s/it] Loading 0: 60%|█████▉ | 302/507 [00:25<03:27, 1.01s/it] Loading 0: 61%|██████ | 307/507 [00:25<02:04, 1.61it/s] Loading 0: 61%|██████ | 310/507 [00:25<01:33, 2.11it/s] Loading 0: 62%|██████▏ | 313/507 [00:25<01:09, 2.80it/s] Loading 0: 62%|██████▏ | 316/507 [00:25<00:51, 3.73it/s] Loading 0: 63%|██████▎ | 320/507 [00:25<00:35, 5.20it/s] Loading 0: 64%|██████▍ | 325/507 [00:26<00:23, 7.86it/s] Loading 0: 65%|██████▍ | 329/507 [00:26<00:18, 9.80it/s] Loading 0: 66%|██████▌ | 335/507 [00:26<00:12, 14.17it/s] Loading 0: 67%|██████▋ | 339/507 [00:26<00:09, 17.20it/s] Loading 0: 68%|██████▊ | 343/507 [00:26<00:10, 15.55it/s] Loading 0: 68%|██████▊ | 347/507 [00:26<00:09, 16.94it/s] Loading 0: 70%|██████▉ | 354/507 [00:27<00:06, 23.66it/s] Loading 0: 71%|███████ | 358/507 [00:27<00:06, 24.72it/s] Loading 0: 71%|███████▏ | 362/507 [00:27<00:05, 26.86it/s] Loading 0: 72%|███████▏ | 366/507 [00:27<00:05, 25.78it/s] Loading 0: 73%|███████▎ | 370/507 [00:27<00:04, 28.53it/s] Loading 0: 74%|███████▍ | 374/507 [00:27<00:05, 26.46it/s] Loading 0: 75%|███████▌ | 381/507 [00:27<00:03, 33.53it/s] Loading 0: 76%|███████▌ | 385/507 [00:28<00:03, 32.96it/s] Loading 0: 77%|███████▋ | 389/507 [00:28<00:03, 32.95it/s] Loading 0: 78%|███████▊ | 393/507 [00:28<00:03, 32.07it/s] Loading 0: 78%|███████▊ | 397/507 [00:28<00:04, 23.60it/s] Loading 0: 79%|███████▉ | 401/507 [00:28<00:04, 22.99it/s] Loading 0: 80%|████████ | 406/507 [00:28<00:03, 27.90it/s] Loading 0: 81%|████████ | 410/507 [00:29<00:03, 26.63it/s] Loading 0: 82%|████████▏ | 415/507 [00:29<00:02, 31.40it/s] Loading 0: 83%|████████▎ | 419/507 [00:29<00:03, 29.00it/s] Loading 0: 84%|████████▎ | 424/507 [00:29<00:02, 33.58it/s] Loading 0: 84%|████████▍ | 428/507 [00:29<00:02, 29.88it/s] Loading 0: 85%|████████▌ | 433/507 [00:29<00:02, 34.37it/s] Loading 0: 86%|████████▌ | 437/507 [00:29<00:02, 30.80it/s] Loading 0: 88%|████████▊ | 444/507 [00:29<00:01, 38.40it/s] Loading 0: 89%|████████▊ | 449/507 [00:30<00:01, 36.47it/s] Loading 0: 90%|████████▉ | 454/507 [00:30<00:01, 35.04it/s] Loading 0: 90%|█████████ | 458/507 [00:32<00:08, 5.89it/s] Loading 0: 91%|█████████ | 461/507 [00:32<00:06, 7.00it/s] Loading 0: 92%|█████████▏| 465/507 [00:32<00:04, 8.82it/s] Loading 0: 93%|█████████▎| 472/507 [00:33<00:02, 13.45it/s] Loading 0: 94%|█████████▍| 476/507 [00:33<00:01, 15.61it/s] Loading 0: 95%|█████████▍| 481/507 [00:33<00:01, 19.29it/s] Loading 0: 96%|█████████▌| 485/507 [00:33<00:01, 21.59it/s] Loading 0: 97%|█████████▋| 490/507 [00:33<00:00, 24.94it/s] Loading 0: 97%|█████████▋| 494/507 [00:33<00:00, 26.00it/s] Loading 0: 98%|█████████▊| 498/507 [00:33<00:00, 28.78it/s] Loading 0: 99%|█████████▉| 502/507 [00:33<00:00, 26.76it/s] Loading 0: 100%|█████████▉| 506/507 [00:34<00:00, 28.41it/s]
Job zonemercy-lexical-viral-5364-v2-mkmlizer completed after 157.14s with status: succeeded
Stopping job with name zonemercy-lexical-viral-5364-v2-mkmlizer
Pipeline stage MKMLizer completed in 157.71s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.18s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service zonemercy-lexical-viral-5364-v2
Waiting for inference service zonemercy-lexical-viral-5364-v2 to be ready
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Inference service zonemercy-lexical-viral-5364-v2 ready after 221.77384757995605s
Pipeline stage MKMLDeployer completed in 222.45s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.009092092514038s
Received healthy response to inference request in 2.4410033226013184s
Received healthy response to inference request in 2.3810276985168457s
Received healthy response to inference request in 2.525416851043701s
5 requests
1 failed requests
5th percentile: 2.39302282333374
10th percentile: 2.405017948150635
20th percentile: 2.429008197784424
30th percentile: 2.4578860282897947
40th percentile: 2.491651439666748
50th percentile: 2.525416851043701
60th percentile: 2.718886947631836
70th percentile: 2.9123570442199704
80th percentile: 6.448871850967411
90th percentile: 13.328431367874147
95th percentile: 16.76821112632751
99th percentile: 19.52003493309021
mean time: 6.112906169891358
%s, retrying in %s seconds...
Received healthy response to inference request in 2.6836447715759277s
Received healthy response to inference request in 2.7500510215759277s
Received healthy response to inference request in 2.302591323852539s
Received healthy response to inference request in 2.683135747909546s
Received healthy response to inference request in 2.22613787651062s
5 requests
0 failed requests
5th percentile: 2.241428565979004
10th percentile: 2.2567192554473876
20th percentile: 2.287300634384155
30th percentile: 2.3787002086639406
40th percentile: 2.5309179782867433
50th percentile: 2.683135747909546
60th percentile: 2.6833393573760986
70th percentile: 2.6835429668426514
80th percentile: 2.6969260215759276
90th percentile: 2.723488521575928
95th percentile: 2.736769771575928
99th percentile: 2.747394771575928
mean time: 2.529112148284912
Pipeline stage StressChecker completed in 46.57s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.79s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 3.80s
Shutdown handler de-registered
zonemercy-lexical-viral-_5364_v2 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3135.15s
Shutdown handler de-registered
zonemercy-lexical-viral-_5364_v2 status is now inactive due to auto deactivation removed underperforming models