developer_uid: chai_backend_admin
submission_id: zonemercy-lexical-viral-_2744_v1
model_name: tempv1-2
model_group: zonemercy/Lexical-Viral-
status: inactive
timestamp: 2024-11-19T14:22:21+00:00
num_battles: 11816
num_wins: 6272
celo_rating: 1271.98
family_friendly_score: 0.5933999999999999
family_friendly_standard_error: 0.006946602622865367
submission_type: basic
model_repo: zonemercy/Lexical-Viral-v6ava-22b11e5r256b005
model_architecture: MistralForCausalLM
model_num_parameters: 22247282688.0
best_of: 4
max_input_tokens: 1024
max_output_tokens: 64
latencies: [{'batch_size': 1, 'throughput': 0.3880090106831347, 'latency_mean': 2.577188345193863, 'latency_p50': 2.570794105529785, 'latency_p90': 2.8363982677459716}, {'batch_size': 3, 'throughput': 0.8177747495832137, 'latency_mean': 3.660635055303574, 'latency_p50': 3.667447328567505, 'latency_p90': 4.003407287597656}, {'batch_size': 5, 'throughput': 1.088784237068102, 'latency_mean': 4.560286523103714, 'latency_p50': 4.5873624086380005, 'latency_p90': 5.083187794685363}, {'batch_size': 6, 'throughput': 1.1720990601078418, 'latency_mean': 5.09754754781723, 'latency_p50': 5.092857003211975, 'latency_p90': 5.772873044013977}, {'batch_size': 10, 'throughput': 1.3752253670019974, 'latency_mean': 7.18801526427269, 'latency_p50': 7.127569913864136, 'latency_p90': 8.104216384887696}]
gpu_counts: {'NVIDIA RTX A6000': 1}
display_name: tempv1-2
is_internal_developer: True
language_model: zonemercy/Lexical-Viral-v6ava-22b11e5r256b005
model_size: 22B
ranking_group: single
throughput_3p7s: 0.83
us_pacific_date: 2024-11-19
win_ratio: 0.5308056872037915
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', '####', 'Bot:', 'User:', 'You:', '<|im_end|>', '<|eot_id|>'], 'max_input_tokens': 1024, 'best_of': 4, 'max_output_tokens': 64}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name zonemercy-lexical-viral-2744-v1-mkmlizer
Waiting for job on zonemercy-lexical-viral-2744-v1-mkmlizer to finish
Failed to get response for submission bbchicago-nana-nemo-12b-v1-0_v8: ('http://bbchicago-nana-nemo-12b-v1-0-v8-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
zonemercy-lexical-viral-2744-v1-mkmlizer: Downloaded to shared memory in 96.106s
zonemercy-lexical-viral-2744-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpwanddmi_, device:0
zonemercy-lexical-viral-2744-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
zonemercy-lexical-viral-2744-v1-mkmlizer: quantized model in 47.994s
zonemercy-lexical-viral-2744-v1-mkmlizer: Processed model zonemercy/Lexical-Viral-v6ava-22b11e5r256b005 in 144.100s
zonemercy-lexical-viral-2744-v1-mkmlizer: creating bucket guanaco-mkml-models
zonemercy-lexical-viral-2744-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zonemercy-lexical-viral-2744-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zonemercy-lexical-viral-2744-v1
zonemercy-lexical-viral-2744-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zonemercy-lexical-viral-2744-v1/special_tokens_map.json
zonemercy-lexical-viral-2744-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zonemercy-lexical-viral-2744-v1/tokenizer_config.json
zonemercy-lexical-viral-2744-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zonemercy-lexical-viral-2744-v1/tokenizer.json
zonemercy-lexical-viral-2744-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/zonemercy-lexical-viral-2744-v1/flywheel_model.1.safetensors
zonemercy-lexical-viral-2744-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zonemercy-lexical-viral-2744-v1/flywheel_model.0.safetensors
zonemercy-lexical-viral-2744-v1-mkmlizer: Loading 0: 0%| | 0/507 [00:00<?, ?it/s] Loading 0: 1%| | 5/507 [00:00<00:24, 20.52it/s] Loading 0: 2%|▏ | 10/507 [00:00<00:17, 28.94it/s] Loading 0: 3%|▎ | 14/507 [00:00<00:19, 25.46it/s] Loading 0: 4%|▎ | 19/507 [00:00<00:15, 30.86it/s] Loading 0: 5%|▍ | 23/507 [00:00<00:17, 28.16it/s] Loading 0: 6%|▌ | 28/507 [00:00<00:15, 31.25it/s] Loading 0: 6%|▋ | 32/507 [00:01<00:16, 28.20it/s] Loading 0: 7%|▋ | 37/507 [00:01<00:14, 31.98it/s] Loading 0: 8%|▊ | 41/507 [00:01<00:16, 29.08it/s] Loading 0: 9%|▉ | 46/507 [00:01<00:13, 33.33it/s] Loading 0: 10%|▉ | 50/507 [00:01<00:15, 30.27it/s] Loading 0: 11%|█ | 54/507 [00:02<00:21, 21.19it/s] Loading 0: 11%|█ | 57/507 [00:02<00:21, 21.10it/s] Loading 0: 12%|█▏ | 61/507 [00:02<00:18, 24.01it/s] Loading 0: 13%|█▎ | 65/507 [00:02<00:18, 23.36it/s] Loading 0: 14%|█▍ | 70/507 [00:02<00:15, 27.85it/s] Loading 0: 15%|█▍ | 74/507 [00:02<00:14, 30.31it/s] Loading 0: 16%|█▌ | 80/507 [00:02<00:14, 30.13it/s] Loading 0: 17%|█▋ | 85/507 [00:02<00:12, 33.80it/s] Loading 0: 18%|█▊ | 89/507 [00:03<00:13, 29.91it/s] Loading 0: 19%|█▊ | 94/507 [00:03<00:12, 33.10it/s] Loading 0: 19%|█▉ | 98/507 [00:03<00:13, 29.53it/s] Loading 0: 20%|██ | 103/507 [00:03<00:12, 32.65it/s] Loading 0: 21%|██ | 107/507 [00:03<00:13, 29.23it/s] Loading 0: 22%|██▏ | 112/507 [00:03<00:12, 31.97it/s] Loading 0: 23%|██▎ | 116/507 [00:04<00:19, 20.38it/s] Loading 0: 24%|██▍ | 122/507 [00:04<00:16, 23.18it/s] Loading 0: 25%|██▌ | 127/507 [00:04<00:14, 27.08it/s] Loading 0: 26%|██▌ | 131/507 [00:04<00:14, 25.38it/s] Loading 0: 27%|██▋ | 136/507 [00:04<00:12, 29.08it/s] Loading 0: 28%|██▊ | 140/507 [00:05<00:13, 26.75it/s] Loading 0: 29%|██▊ | 145/507 [00:05<00:11, 30.44it/s] Loading 0: 29%|██▉ | 149/507 [00:05<00:12, 27.84it/s] Loading 0: 30%|███ | 154/507 [00:05<00:11, 31.68it/s] Loading 0: 31%|███ | 158/507 [00:05<00:12, 28.95it/s] Loading 0: 32%|███▏ | 163/507 [00:05<00:10, 33.14it/s] Loading 0: 33%|███▎ | 168/507 [00:05<00:09, 34.00it/s] Loading 0: 34%|███▍ | 172/507 [00:06<00:14, 22.47it/s] Loading 0: 35%|███▍ | 176/507 [00:06<00:14, 22.37it/s] Loading 0: 36%|███▌ | 181/507 [00:06<00:12, 26.77it/s] Loading 0: 36%|███▋ | 185/507 [00:06<00:12, 25.84it/s] Loading 0: 37%|███▋ | 190/507 [00:06<00:10, 29.92it/s] Loading 0: 38%|███▊ | 194/507 [00:06<00:11, 27.28it/s] Loading 0: 39%|███▉ | 199/507 [00:07<00:09, 31.30it/s] Loading 0: 40%|████ | 203/507 [00:07<00:10, 28.91it/s] Loading 0: 41%|████ | 208/507 [00:07<00:08, 33.36it/s] Loading 0: 42%|████▏ | 212/507 [00:07<00:09, 29.87it/s] Loading 0: 43%|████▎ | 217/507 [00:07<00:08, 33.96it/s] Loading 0: 44%|████▎ | 221/507 [00:07<00:09, 30.88it/s] Loading 0: 44%|████▍ | 225/507 [00:08<00:12, 21.75it/s] Loading 0: 45%|████▌ | 230/507 [00:08<00:11, 23.83it/s] Loading 0: 46%|████▋ | 235/507 [00:08<00:09, 27.81it/s] Loading 0: 47%|████▋ | 239/507 [00:08<00:10, 26.18it/s] Loading 0: 48%|████▊ | 244/507 [00:08<00:08, 30.08it/s] Loading 0: 49%|████▉ | 248/507 [00:08<00:09, 27.87it/s] Loading 0: 50%|████▉ | 253/507 [00:08<00:07, 31.94it/s] Loading 0: 51%|█████ | 257/507 [00:09<00:08, 29.25it/s] Loading 0: 52%|█████▏ | 262/507 [00:09<00:07, 33.04it/s] Loading 0: 52%|█████▏ | 266/507 [00:09<00:08, 29.20it/s] Loading 0: 53%|█████▎ | 271/507 [00:09<00:07, 32.80it/s] Loading 0: 54%|█████▍ | 275/507 [00:09<00:07, 30.14it/s] Loading 0: 55%|█████▌ | 280/507 [00:09<00:06, 34.31it/s] Loading 0: 56%|█████▌ | 284/507 [00:10<00:08, 25.51it/s] Loading 0: 57%|█████▋ | 288/507 [00:10<00:08, 25.43it/s] Loading 0: 58%|█████▊ | 293/507 [00:10<00:08, 26.43it/s] Loading 0: 59%|█████▉ | 298/507 [00:10<00:06, 30.96it/s] Loading 0: 59%|█████▉ | 299/507 [00:25<00:06, 30.96it/s] Loading 0: 59%|█████▉ | 300/507 [00:25<04:06, 1.19s/it] Loading 0: 60%|█████▉ | 302/507 [00:25<03:22, 1.01it/s] Loading 0: 61%|██████ | 307/507 [00:25<02:02, 1.63it/s] Loading 0: 61%|██████ | 310/507 [00:25<01:32, 2.12it/s] Loading 0: 62%|██████▏ | 313/507 [00:26<01:09, 2.81it/s] Loading 0: 62%|██████▏ | 316/507 [00:26<00:51, 3.73it/s] Loading 0: 63%|██████▎ | 320/507 [00:26<00:35, 5.25it/s] Loading 0: 64%|██████▍ | 325/507 [00:26<00:22, 7.94it/s] Loading 0: 65%|██████▍ | 329/507 [00:26<00:18, 9.78it/s] Loading 0: 66%|██████▌ | 334/507 [00:26<00:12, 13.47it/s] Loading 0: 67%|██████▋ | 339/507 [00:26<00:09, 17.15it/s] Loading 0: 68%|██████▊ | 343/507 [00:27<00:10, 15.37it/s] Loading 0: 68%|██████▊ | 347/507 [00:27<00:09, 16.95it/s] Loading 0: 69%|██████▉ | 352/507 [00:27<00:07, 21.38it/s] Loading 0: 70%|███████ | 356/507 [00:27<00:06, 22.02it/s] Loading 0: 71%|███████ | 361/507 [00:27<00:05, 26.40it/s] Loading 0: 72%|███████▏ | 365/507 [00:27<00:05, 25.11it/s] Loading 0: 73%|███████▎ | 370/507 [00:28<00:04, 29.10it/s] Loading 0: 74%|███████▍ | 374/507 [00:28<00:04, 27.50it/s] Loading 0: 75%|███████▍ | 379/507 [00:28<00:04, 31.73it/s] Loading 0: 76%|███████▌ | 383/507 [00:28<00:04, 29.02it/s] Loading 0: 77%|███████▋ | 388/507 [00:28<00:03, 33.20it/s] Loading 0: 77%|███████▋ | 392/507 [00:28<00:03, 30.43it/s] Loading 0: 78%|███████▊ | 396/507 [00:29<00:05, 21.71it/s] Loading 0: 79%|███████▉ | 401/507 [00:29<00:04, 23.72it/s] Loading 0: 80%|████████ | 406/507 [00:29<00:03, 28.00it/s] Loading 0: 81%|████████ | 410/507 [00:29<00:03, 26.10it/s] Loading 0: 82%|████████▏ | 415/507 [00:29<00:03, 30.14it/s] Loading 0: 83%|████████▎ | 419/507 [00:29<00:03, 27.26it/s] Loading 0: 84%|████████▎ | 424/507 [00:30<00:02, 31.27it/s] Loading 0: 84%|████████▍ | 428/507 [00:30<00:02, 28.94it/s] Loading 0: 85%|████████▌ | 433/507 [00:30<00:02, 33.11it/s] Loading 0: 86%|████████▌ | 437/507 [00:30<00:02, 30.30it/s] Loading 0: 88%|████████▊ | 444/507 [00:30<00:01, 36.96it/s] Loading 0: 88%|████████▊ | 448/507 [00:30<00:01, 35.00it/s] Loading 0: 89%|████████▉ | 453/507 [00:30<00:01, 37.87it/s] Loading 0: 90%|█████████ | 457/507 [00:33<00:08, 5.86it/s] Loading 0: 91%|█████████ | 460/507 [00:33<00:06, 6.93it/s] Loading 0: 92%|█████████▏| 465/507 [00:33<00:04, 9.28it/s] Loading 0: 93%|█████████▎| 470/507 [00:33<00:02, 12.55it/s] Loading 0: 93%|█████████▎| 474/507 [00:33<00:02, 14.33it/s] Loading 0: 94%|█████████▍| 479/507 [00:33<00:01, 18.36it/s] Loading 0: 95%|█████████▌| 483/507 [00:34<00:01, 19.57it/s] Loading 0: 96%|█████████▋| 488/507 [00:34<00:00, 23.87it/s] Loading 0: 97%|█████████▋| 492/507 [00:34<00:00, 23.26it/s] Loading 0: 98%|█████████▊| 497/507 [00:34<00:00, 27.67it/s] Loading 0: 99%|█████████▉| 501/507 [00:34<00:00, 26.51it/s] Loading 0: 100%|█████████▉| 506/507 [00:34<00:00, 30.95it/s]
Job zonemercy-lexical-viral-2744-v1-mkmlizer completed after 185.0s with status: succeeded
Stopping job with name zonemercy-lexical-viral-2744-v1-mkmlizer
Pipeline stage MKMLizer completed in 185.68s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.20s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service zonemercy-lexical-viral-2744-v1
Waiting for inference service zonemercy-lexical-viral-2744-v1 to be ready
Connection pool is full, discarding connection: %s. Connection pool size: %s
Inference service zonemercy-lexical-viral-2744-v1 ready after 211.34659957885742s
Pipeline stage MKMLDeployer completed in 211.93s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.8306477069854736s
Received healthy response to inference request in 2.970393419265747s
Received healthy response to inference request in 2.545407295227051s
Received healthy response to inference request in 2.652012586593628s
Received healthy response to inference request in 2.4715325832366943s
5 requests
0 failed requests
5th percentile: 2.4863075256347655
10th percentile: 2.5010824680328367
20th percentile: 2.5306323528289796
30th percentile: 2.566728353500366
40th percentile: 2.609370470046997
50th percentile: 2.652012586593628
60th percentile: 2.723466634750366
70th percentile: 2.7949206829071045
80th percentile: 2.858596849441528
90th percentile: 2.9144951343536376
95th percentile: 2.9424442768096926
99th percentile: 2.9648035907745363
mean time: 2.6939987182617187
Pipeline stage StressChecker completed in 14.70s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.27s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 2.14s
Shutdown handler de-registered
zonemercy-lexical-viral-_2744_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3171.39s
Shutdown handler de-registered
zonemercy-lexical-viral-_2744_v1 status is now inactive due to auto deactivation removed underperforming models