developer_uid: chai_backend_admin
submission_id: zonemercy-lexical-viral-_3982_v7
model_name: tempv1-1
model_group: zonemercy/Lexical-Viral-
status: inactive
timestamp: 2024-11-20T10:35:43+00:00
num_battles: 11464
num_wins: 6038
celo_rating: 1272.45
family_friendly_score: 0.5944
family_friendly_standard_error: 0.006943898616771417
submission_type: basic
model_repo: zonemercy/Lexical-Viral-v6ava-22b11e5r256
model_architecture: MistralForCausalLM
model_num_parameters: 22247282688.0
best_of: 4
max_input_tokens: 1024
max_output_tokens: 64
latencies: [{'batch_size': 1, 'throughput': 0.3891163986403314, 'latency_mean': 2.5698282611370087, 'latency_p50': 2.5416650772094727, 'latency_p90': 2.8400869369506836}, {'batch_size': 3, 'throughput': 0.8264381030923006, 'latency_mean': 3.6186043667793273, 'latency_p50': 3.600284218788147, 'latency_p90': 3.989972710609436}, {'batch_size': 5, 'throughput': 1.0839619682628965, 'latency_mean': 4.579213995933532, 'latency_p50': 4.601201891899109, 'latency_p90': 5.136701321601867}, {'batch_size': 6, 'throughput': 1.174699320860571, 'latency_mean': 5.090132170915604, 'latency_p50': 5.123173475265503, 'latency_p90': 5.680956268310547}, {'batch_size': 10, 'throughput': 1.382593238923772, 'latency_mean': 7.169349364042282, 'latency_p50': 7.205963492393494, 'latency_p90': 8.012424564361572}]
gpu_counts: {'NVIDIA RTX A6000': 1}
display_name: tempv1-1
is_internal_developer: True
language_model: zonemercy/Lexical-Viral-v6ava-22b11e5r256
model_size: 22B
ranking_group: single
throughput_3p7s: 0.86
us_pacific_date: 2024-11-20
win_ratio: 0.526692254012561
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', '####', 'Bot:', 'User:', 'You:', '<|im_end|>', '<|eot_id|>'], 'max_input_tokens': 1024, 'best_of': 4, 'max_output_tokens': 64}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name zonemercy-lexical-viral-3982-v7-mkmlizer
Waiting for job on zonemercy-lexical-viral-3982-v7-mkmlizer to finish
zonemercy-lexical-viral-3982-v7-mkmlizer: Downloaded to shared memory in 74.476s
zonemercy-lexical-viral-3982-v7-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpyefhelwb, device:0
zonemercy-lexical-viral-3982-v7-mkmlizer: Saving flywheel model at /dev/shm/model_cache
zonemercy-lexical-viral-3982-v7-mkmlizer: quantized model in 45.738s
zonemercy-lexical-viral-3982-v7-mkmlizer: Processed model zonemercy/Lexical-Viral-v6ava-22b11e5r256 in 120.214s
zonemercy-lexical-viral-3982-v7-mkmlizer: creating bucket guanaco-mkml-models
zonemercy-lexical-viral-3982-v7-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zonemercy-lexical-viral-3982-v7-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zonemercy-lexical-viral-3982-v7
zonemercy-lexical-viral-3982-v7-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zonemercy-lexical-viral-3982-v7/config.json
zonemercy-lexical-viral-3982-v7-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zonemercy-lexical-viral-3982-v7/special_tokens_map.json
zonemercy-lexical-viral-3982-v7-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zonemercy-lexical-viral-3982-v7/tokenizer_config.json
zonemercy-lexical-viral-3982-v7-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zonemercy-lexical-viral-3982-v7/tokenizer.json
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
zonemercy-lexical-viral-3982-v7-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/zonemercy-lexical-viral-3982-v7/flywheel_model.1.safetensors
zonemercy-lexical-viral-3982-v7-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zonemercy-lexical-viral-3982-v7/flywheel_model.0.safetensors
zonemercy-lexical-viral-3982-v7-mkmlizer: Loading 0: 0%| | 0/507 [00:00<?, ?it/s] Loading 0: 1%| | 5/507 [00:00<00:19, 25.72it/s] Loading 0: 2%|▏ | 12/507 [00:00<00:12, 40.17it/s] Loading 0: 3%|▎ | 17/507 [00:00<00:13, 37.07it/s] Loading 0: 4%|▍ | 21/507 [00:00<00:13, 37.10it/s] Loading 0: 5%|▍ | 25/507 [00:00<00:13, 35.07it/s] Loading 0: 6%|▌ | 30/507 [00:00<00:12, 37.91it/s] Loading 0: 7%|▋ | 34/507 [00:00<00:13, 35.34it/s] Loading 0: 8%|▊ | 39/507 [00:01<00:12, 36.75it/s] Loading 0: 8%|▊ | 43/507 [00:01<00:13, 33.76it/s] Loading 0: 9%|▉ | 47/507 [00:01<00:14, 31.44it/s] Loading 0: 10%|█ | 51/507 [00:01<00:15, 28.95it/s] Loading 0: 11%|█ | 54/507 [00:01<00:24, 18.14it/s] Loading 0: 11%|█ | 57/507 [00:02<00:23, 18.83it/s] Loading 0: 12%|█▏ | 61/507 [00:02<00:20, 21.98it/s] Loading 0: 13%|█▎ | 65/507 [00:02<00:20, 21.74it/s] Loading 0: 14%|█▍ | 70/507 [00:02<00:16, 26.57it/s] Loading 0: 15%|█▍ | 74/507 [00:02<00:15, 28.68it/s] Loading 0: 16%|█▌ | 80/507 [00:02<00:15, 28.30it/s] Loading 0: 17%|█▋ | 85/507 [00:02<00:13, 30.80it/s] Loading 0: 18%|█▊ | 89/507 [00:03<00:15, 27.82it/s] Loading 0: 19%|█▊ | 94/507 [00:03<00:13, 30.65it/s] Loading 0: 19%|█▉ | 98/507 [00:03<00:15, 27.22it/s] Loading 0: 20%|██ | 103/507 [00:03<00:13, 30.13it/s] Loading 0: 21%|██ | 107/507 [00:03<00:14, 26.74it/s] Loading 0: 22%|██▏ | 112/507 [00:03<00:13, 29.41it/s] Loading 0: 23%|██▎ | 116/507 [00:04<00:20, 19.52it/s] Loading 0: 24%|██▍ | 122/507 [00:04<00:17, 22.55it/s] Loading 0: 25%|██▌ | 127/507 [00:04<00:14, 26.20it/s] Loading 0: 26%|██▌ | 131/507 [00:04<00:15, 25.02it/s] Loading 0: 27%|██▋ | 136/507 [00:04<00:13, 28.42it/s] Loading 0: 28%|██▊ | 140/507 [00:05<00:13, 26.51it/s] Loading 0: 29%|██▊ | 145/507 [00:05<00:12, 29.41it/s] Loading 0: 29%|██▉ | 149/507 [00:05<00:13, 26.88it/s] Loading 0: 30%|███ | 154/507 [00:05<00:11, 30.12it/s] Loading 0: 31%|███ | 158/507 [00:05<00:12, 27.35it/s] Loading 0: 32%|███▏ | 163/507 [00:05<00:11, 31.26it/s] Loading 0: 33%|███▎ | 168/507 [00:05<00:10, 32.09it/s] Loading 0: 34%|███▍ | 172/507 [00:06<00:15, 21.98it/s] Loading 0: 35%|███▍ | 176/507 [00:06<00:14, 22.22it/s] Loading 0: 36%|███▌ | 181/507 [00:06<00:12, 26.93it/s] Loading 0: 36%|███▋ | 185/507 [00:06<00:11, 27.17it/s] Loading 0: 38%|███▊ | 192/507 [00:06<00:08, 35.11it/s] Loading 0: 39%|███▉ | 197/507 [00:06<00:08, 35.88it/s] Loading 0: 40%|███▉ | 202/507 [00:07<00:08, 37.16it/s] Loading 0: 41%|████ | 207/507 [00:07<00:07, 39.72it/s] Loading 0: 42%|████▏ | 212/507 [00:07<00:08, 33.13it/s] Loading 0: 43%|████▎ | 218/507 [00:07<00:07, 37.92it/s] Loading 0: 44%|████▍ | 223/507 [00:07<00:08, 34.31it/s] Loading 0: 45%|████▍ | 227/507 [00:07<00:09, 28.96it/s] Loading 0: 46%|████▌ | 231/507 [00:08<00:09, 28.22it/s] Loading 0: 47%|████▋ | 237/507 [00:08<00:08, 32.60it/s] Loading 0: 48%|████▊ | 241/507 [00:08<00:08, 31.39it/s] Loading 0: 49%|████▊ | 246/507 [00:08<00:07, 33.33it/s] Loading 0: 49%|████▉ | 250/507 [00:08<00:08, 31.12it/s] Loading 0: 50%|█████ | 255/507 [00:08<00:07, 33.47it/s] Loading 0: 51%|█████ | 259/507 [00:08<00:07, 33.17it/s] Loading 0: 52%|█████▏ | 264/507 [00:08<00:06, 35.65it/s] Loading 0: 53%|█████▎ | 268/507 [00:09<00:06, 34.57it/s] Loading 0: 54%|█████▍ | 273/507 [00:09<00:06, 37.09it/s] Loading 0: 55%|█████▍ | 277/507 [00:09<00:06, 35.46it/s] Loading 0: 56%|█████▌ | 283/507 [00:09<00:06, 36.25it/s] Loading 0: 57%|█████▋ | 287/507 [00:09<00:09, 24.25it/s] Loading 0: 58%|█████▊ | 293/507 [00:10<00:08, 25.62it/s] Loading 0: 59%|█████▉ | 298/507 [00:10<00:07, 29.34it/s] Loading 0: 60%|█████▉ | 302/507 [00:24<03:18, 1.03it/s] Loading 0: 61%|██████ | 307/507 [00:25<02:15, 1.48it/s] Loading 0: 61%|██████▏ | 311/507 [00:25<01:40, 1.95it/s] Loading 0: 63%|██████▎ | 317/507 [00:25<01:03, 2.99it/s] Loading 0: 63%|██████▎ | 321/507 [00:25<00:48, 3.85it/s] Loading 0: 64%|██████▍ | 327/507 [00:25<00:31, 5.71it/s] Loading 0: 65%|██████▌ | 331/507 [00:25<00:24, 7.18it/s] Loading 0: 66%|██████▌ | 335/507 [00:26<00:19, 9.05it/s] Loading 0: 67%|██████▋ | 340/507 [00:26<00:15, 10.91it/s] Loading 0: 68%|██████▊ | 344/507 [00:26<00:12, 13.22it/s] Loading 0: 69%|██████▊ | 348/507 [00:26<00:10, 15.19it/s] Loading 0: 70%|██████▉ | 354/507 [00:26<00:07, 20.32it/s] Loading 0: 71%|███████ | 358/507 [00:26<00:06, 22.35it/s] Loading 0: 72%|███████▏ | 363/507 [00:26<00:05, 26.22it/s] Loading 0: 72%|███████▏ | 367/507 [00:27<00:05, 27.04it/s] Loading 0: 73%|███████▎ | 372/507 [00:27<00:04, 30.57it/s] Loading 0: 74%|███████▍ | 376/507 [00:27<00:04, 30.98it/s] Loading 0: 75%|███████▌ | 381/507 [00:27<00:03, 33.87it/s] Loading 0: 76%|███████▌ | 385/507 [00:27<00:03, 33.04it/s] Loading 0: 77%|███████▋ | 389/507 [00:27<00:03, 33.16it/s] Loading 0: 78%|███████▊ | 393/507 [00:27<00:03, 32.67it/s] Loading 0: 78%|███████▊ | 397/507 [00:28<00:04, 24.46it/s] Loading 0: 79%|███████▉ | 401/507 [00:28<00:04, 24.45it/s] Loading 0: 80%|████████ | 406/507 [00:28<00:03, 29.48it/s] Loading 0: 81%|████████ | 410/507 [00:28<00:03, 27.73it/s] Loading 0: 82%|████████▏ | 415/507 [00:28<00:02, 32.54it/s] Loading 0: 83%|████████▎ | 419/507 [00:28<00:02, 30.01it/s] Loading 0: 84%|████████▍ | 426/507 [00:28<00:02, 36.78it/s] Loading 0: 85%|████████▍ | 430/507 [00:29<00:02, 34.73it/s] Loading 0: 86%|████████▌ | 435/507 [00:29<00:01, 36.60it/s] Loading 0: 87%|████████▋ | 439/507 [00:29<00:01, 34.82it/s] Loading 0: 88%|████████▊ | 444/507 [00:29<00:01, 36.74it/s] Loading 0: 88%|████████▊ | 448/507 [00:29<00:01, 35.21it/s] Loading 0: 89%|████████▉ | 453/507 [00:29<00:01, 38.48it/s] Loading 0: 90%|█████████ | 457/507 [00:31<00:08, 5.87it/s] Loading 0: 91%|█████████ | 460/507 [00:32<00:06, 7.02it/s] Loading 0: 92%|█████████▏| 465/507 [00:32<00:04, 9.49it/s] Loading 0: 93%|█████████▎| 470/507 [00:32<00:02, 12.87it/s] Loading 0: 93%|█████████▎| 474/507 [00:32<00:02, 14.72it/s] Loading 0: 94%|█████████▍| 479/507 [00:32<00:01, 18.90it/s] Loading 0: 95%|█████████▌| 483/507 [00:32<00:01, 20.31it/s] Loading 0: 96%|█████████▋| 488/507 [00:32<00:00, 24.89it/s] Loading 0: 97%|█████████▋| 492/507 [00:33<00:00, 24.50it/s] Loading 0: 98%|█████████▊| 497/507 [00:33<00:00, 28.81it/s] Loading 0: 99%|█████████▉| 501/507 [00:33<00:00, 27.29it/s] Loading 0: 100%|█████████▉| 506/507 [00:33<00:00, 31.54it/s]
Job zonemercy-lexical-viral-3982-v7-mkmlizer completed after 154.48s with status: succeeded
Stopping job with name zonemercy-lexical-viral-3982-v7-mkmlizer
Pipeline stage MKMLizer completed in 155.01s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.21s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service zonemercy-lexical-viral-3982-v7
Waiting for inference service zonemercy-lexical-viral-3982-v7 to be ready
Inference service zonemercy-lexical-viral-3982-v7 ready after 220.962468624115s
Pipeline stage MKMLDeployer completed in 221.57s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.712014675140381s
Received healthy response to inference request in 2.208693504333496s
Received healthy response to inference request in 2.401475191116333s
Received healthy response to inference request in 2.011545419692993s
5 requests
1 failed requests
5th percentile: 2.050975036621094
10th percentile: 2.0904046535491942
20th percentile: 2.1692638874053953
30th percentile: 2.2472498416900635
40th percentile: 2.3243625164031982
50th percentile: 2.401475191116333
60th percentile: 2.5256909847259523
70th percentile: 2.649906778335571
80th percentile: 6.200817298889163
90th percentile: 13.17842254638672
95th percentile: 16.667225170135495
99th percentile: 19.45826726913452
mean time: 5.897951316833496
%s, retrying in %s seconds...
Received healthy response to inference request in 2.4960391521453857s
Received healthy response to inference request in 2.3896889686584473s
Received healthy response to inference request in 2.415201187133789s
Received healthy response to inference request in 2.464207410812378s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 2.2304627895355225s
5 requests
0 failed requests
5th percentile: 2.2623080253601073
10th percentile: 2.294153261184692
20th percentile: 2.3578437328338624
30th percentile: 2.3947914123535154
40th percentile: 2.4049962997436523
50th percentile: 2.415201187133789
60th percentile: 2.434803676605225
70th percentile: 2.45440616607666
80th percentile: 2.4705737590789796
90th percentile: 2.4833064556121824
95th percentile: 2.489672803878784
99th percentile: 2.4947658824920653
mean time: 2.3991199016571043
Pipeline stage StressChecker completed in 44.44s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.76s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 3.30s
Shutdown handler de-registered
zonemercy-lexical-viral-_3982_v7 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3189.48s
Shutdown handler de-registered
zonemercy-lexical-viral-_3982_v7 status is now inactive due to auto deactivation removed underperforming models