developer_uid: rirv938
submission_id: rirv938-anthropic-beta-_22448_v1
model_name: rirv938-anthropic-beta-_22448_v1
model_group: rirv938/anthropic_beta_2
status: torndown
timestamp: 2024-12-29T17:34:25+00:00
num_battles: 28558
num_wins: 14839
celo_rating: 1283.05
family_friendly_score: 0.6004
family_friendly_standard_error: 0.006927046123709586
submission_type: basic
model_repo: rirv938/anthropic_beta_2_40k_624_bo8_v2
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 4
max_input_tokens: 512
max_output_tokens: 64
latencies: [{'batch_size': 1, 'throughput': 0.7195131944047823, 'latency_mean': 1.3897669899463654, 'latency_p50': 1.3864517211914062, 'latency_p90': 1.5382816314697265}, {'batch_size': 5, 'throughput': 2.1590684479176283, 'latency_mean': 2.305686801671982, 'latency_p50': 2.3150484561920166, 'latency_p90': 2.568450284004211}, {'batch_size': 10, 'throughput': 2.865704482861323, 'latency_mean': 3.4555197739601136, 'latency_p50': 3.4781490564346313, 'latency_p90': 3.885119414329529}, {'batch_size': 15, 'throughput': 3.0940819514879943, 'latency_mean': 4.777771104574203, 'latency_p50': 4.773493051528931, 'latency_p90': 5.4196448802948}, {'batch_size': 20, 'throughput': 3.1959471560932156, 'latency_mean': 6.165191299915314, 'latency_p50': 6.167546272277832, 'latency_p90': 6.982821202278137}]
gpu_counts: {'NVIDIA RTX A5000': 1}
display_name: rirv938-anthropic-beta-_22448_v1
is_internal_developer: True
language_model: rirv938/anthropic_beta_2_40k_624_bo8_v2
model_size: 13B
ranking_group: single
throughput_3p7s: 2.95
us_pacific_date: 2024-12-29
win_ratio: 0.5196092163316759
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', '####', 'Bot:', 'User:', 'You:', '<|im_end|>', '<|eot_id|>'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name rirv938-anthropic-beta-22448-v1-mkmlizer
Waiting for job on rirv938-anthropic-beta-22448-v1-mkmlizer to finish
rirv938-anthropic-beta-22448-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ _____ __ __ ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ /___/ ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ Version: 0.11.12 ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ https://mk1.ai ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ The license key for the current software has been verified as ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ belonging to: ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ Chai Research Corp. ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ║ ║
rirv938-anthropic-beta-22448-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Connection pool is full, discarding connection: %s. Connection pool size: %s
rirv938-anthropic-beta-22448-v1-mkmlizer: Downloaded to shared memory in 93.073s
rirv938-anthropic-beta-22448-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp96xt7aq2, device:0
rirv938-anthropic-beta-22448-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
rirv938-anthropic-beta-22448-v1-mkmlizer: quantized model in 40.747s
rirv938-anthropic-beta-22448-v1-mkmlizer: Processed model rirv938/anthropic_beta_2_40k_624_bo8_v2 in 133.820s
rirv938-anthropic-beta-22448-v1-mkmlizer: creating bucket guanaco-mkml-models
rirv938-anthropic-beta-22448-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
rirv938-anthropic-beta-22448-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/rirv938-anthropic-beta-22448-v1
rirv938-anthropic-beta-22448-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/rirv938-anthropic-beta-22448-v1/config.json
rirv938-anthropic-beta-22448-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/rirv938-anthropic-beta-22448-v1/special_tokens_map.json
rirv938-anthropic-beta-22448-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/rirv938-anthropic-beta-22448-v1/tokenizer_config.json
rirv938-anthropic-beta-22448-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/rirv938-anthropic-beta-22448-v1/tokenizer.json
rirv938-anthropic-beta-22448-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/rirv938-anthropic-beta-22448-v1/flywheel_model.0.safetensors
rirv938-anthropic-beta-22448-v1-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:15, 23.55it/s] Loading 0: 3%|▎ | 10/363 [00:00<00:12, 28.75it/s] Loading 0: 4%|▍ | 14/363 [00:00<00:13, 25.67it/s] Loading 0: 6%|▌ | 21/363 [00:00<00:09, 36.90it/s] Loading 0: 7%|▋ | 26/363 [00:01<00:14, 22.85it/s] Loading 0: 9%|▊ | 31/363 [00:01<00:11, 27.76it/s] Loading 0: 10%|▉ | 35/363 [00:01<00:11, 29.27it/s] Loading 0: 11%|█ | 39/363 [00:01<00:10, 30.45it/s] Loading 0: 12%|█▏ | 43/363 [00:01<00:10, 29.61it/s] Loading 0: 13%|█▎ | 48/363 [00:01<00:09, 32.52it/s] Loading 0: 14%|█▍ | 52/363 [00:01<00:10, 30.31it/s] Loading 0: 15%|█▌ | 56/363 [00:01<00:10, 30.38it/s] Loading 0: 17%|█▋ | 60/363 [00:02<00:09, 32.53it/s] Loading 0: 18%|█▊ | 64/363 [00:02<00:13, 21.87it/s] Loading 0: 20%|█▉ | 71/363 [00:02<00:10, 28.95it/s] Loading 0: 21%|██ | 75/363 [00:02<00:09, 28.88it/s] Loading 0: 22%|██▏ | 79/363 [00:02<00:10, 28.09it/s] Loading 0: 23%|██▎ | 84/363 [00:02<00:09, 30.79it/s] Loading 0: 24%|██▍ | 88/363 [00:03<00:09, 29.64it/s] Loading 0: 26%|██▌ | 93/363 [00:03<00:08, 31.62it/s] Loading 0: 27%|██▋ | 97/363 [00:03<00:08, 29.75it/s] Loading 0: 28%|██▊ | 101/363 [00:03<00:10, 24.92it/s] Loading 0: 29%|██▊ | 104/363 [00:03<00:11, 21.91it/s] Loading 0: 31%|███ | 111/363 [00:03<00:08, 28.90it/s] Loading 0: 32%|███▏ | 115/363 [00:04<00:08, 27.77it/s] Loading 0: 33%|███▎ | 120/363 [00:04<00:08, 30.21it/s] Loading 0: 34%|███▍ | 124/363 [00:04<00:08, 28.91it/s] Loading 0: 36%|███▌ | 129/363 [00:04<00:07, 31.44it/s] Loading 0: 37%|███▋ | 133/363 [00:04<00:07, 29.91it/s] Loading 0: 38%|███▊ | 137/363 [00:04<00:07, 29.93it/s] Loading 0: 39%|███▉ | 142/363 [00:05<00:08, 26.33it/s] Loading 0: 40%|███▉ | 145/363 [00:05<00:08, 24.58it/s] Loading 0: 41%|████ | 148/363 [00:05<00:08, 25.62it/s] Loading 0: 42%|████▏ | 151/363 [00:05<00:08, 25.85it/s] Loading 0: 43%|████▎ | 156/363 [00:05<00:06, 29.70it/s] Loading 0: 44%|████▍ | 160/363 [00:05<00:07, 28.49it/s] Loading 0: 45%|████▌ | 165/363 [00:05<00:06, 31.51it/s] Loading 0: 47%|████▋ | 169/363 [00:05<00:06, 29.84it/s] Loading 0: 48%|████▊ | 174/363 [00:06<00:05, 32.02it/s] Loading 0: 49%|████▉ | 178/363 [00:06<00:06, 29.87it/s] Loading 0: 50%|█████ | 182/363 [00:06<00:07, 24.56it/s] Loading 0: 51%|█████ | 185/363 [00:06<00:08, 21.48it/s] Loading 0: 53%|█████▎ | 192/363 [00:06<00:05, 28.54it/s] Loading 0: 54%|█████▍ | 196/363 [00:06<00:06, 27.53it/s] Loading 0: 55%|█████▌ | 201/363 [00:07<00:05, 30.52it/s] Loading 0: 56%|█████▋ | 205/363 [00:07<00:05, 29.66it/s] Loading 0: 58%|█████▊ | 210/363 [00:07<00:04, 31.96it/s] Loading 0: 59%|█████▉ | 214/363 [00:07<00:04, 30.64it/s] Loading 0: 60%|██████ | 218/363 [00:07<00:04, 31.31it/s] Loading 0: 61%|██████▏ | 223/363 [00:07<00:05, 27.56it/s] Loading 0: 62%|██████▏ | 226/363 [00:08<00:05, 25.41it/s] Loading 0: 63%|██████▎ | 230/363 [00:08<00:05, 24.27it/s] Loading 0: 65%|██████▌ | 237/363 [00:08<00:04, 31.21it/s] Loading 0: 66%|██████▋ | 241/363 [00:08<00:04, 30.09it/s] Loading 0: 68%|██████▊ | 246/363 [00:08<00:03, 32.69it/s] Loading 0: 69%|██████▉ | 250/363 [00:08<00:03, 30.68it/s] Loading 0: 70%|███████ | 255/363 [00:08<00:03, 33.06it/s] Loading 0: 71%|███████▏ | 259/363 [00:09<00:03, 31.23it/s] Loading 0: 72%|███████▏ | 263/363 [00:09<00:03, 26.23it/s] Loading 0: 73%|███████▎ | 266/363 [00:09<00:04, 22.84it/s] Loading 0: 75%|███████▌ | 273/363 [00:09<00:03, 29.84it/s] Loading 0: 76%|███████▋ | 277/363 [00:09<00:02, 29.26it/s] Loading 0: 78%|███████▊ | 282/363 [00:09<00:02, 32.30it/s] Loading 0: 79%|███████▉ | 286/363 [00:09<00:02, 31.10it/s] Loading 0: 80%|████████ | 291/363 [00:10<00:02, 33.42it/s] Loading 0: 81%|████████▏ | 295/363 [00:10<00:02, 30.63it/s] Loading 0: 82%|████████▏ | 299/363 [00:10<00:02, 31.12it/s] Loading 0: 84%|████████▎ | 304/363 [00:10<00:02, 27.97it/s] Loading 0: 85%|████████▍ | 307/363 [00:10<00:02, 25.39it/s] Loading 0: 86%|████████▌ | 311/363 [00:10<00:02, 23.70it/s] Loading 0: 88%|████████▊ | 318/363 [00:11<00:01, 30.48it/s] Loading 0: 89%|████████▊ | 322/363 [00:11<00:01, 29.12it/s] Loading 0: 90%|█████████ | 327/363 [00:11<00:01, 31.11it/s] Loading 0: 91%|█████████ | 331/363 [00:11<00:01, 29.28it/s] Loading 0: 93%|█████████▎| 336/363 [00:11<00:00, 31.47it/s] Loading 0: 94%|█████████▎| 340/363 [00:11<00:00, 30.07it/s] Loading 0: 95%|█████████▍| 344/363 [00:18<00:09, 1.97it/s] Loading 0: 96%|█████████▌| 348/363 [00:18<00:05, 2.65it/s] Loading 0: 97%|█████████▋| 353/363 [00:19<00:02, 3.84it/s] Loading 0: 98%|█████████▊| 357/363 [00:19<00:01, 4.98it/s]
Job rirv938-anthropic-beta-22448-v1-mkmlizer completed after 155.89s with status: succeeded
Stopping job with name rirv938-anthropic-beta-22448-v1-mkmlizer
Pipeline stage MKMLizer completed in 156.46s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.17s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service rirv938-anthropic-beta-22448-v1
Waiting for inference service rirv938-anthropic-beta-22448-v1 to be ready
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Inference service rirv938-anthropic-beta-22448-v1 ready after 291.36616921424866s
Pipeline stage MKMLDeployer completed in 291.95s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.056039333343506s
Received healthy response to inference request in 2.54774808883667s
Received healthy response to inference request in 1.509065866470337s
Received healthy response to inference request in 2.191537857055664s
Received healthy response to inference request in 3.9912655353546143s
5 requests
0 failed requests
5th percentile: 1.6184605598449706
10th percentile: 1.7278552532196045
20th percentile: 1.9466446399688722
30th percentile: 2.0831390380859376
40th percentile: 2.1373384475708006
50th percentile: 2.191537857055664
60th percentile: 2.3340219497680663
70th percentile: 2.4765060424804686
80th percentile: 2.836451578140259
90th percentile: 3.413858556747437
95th percentile: 3.702562046051025
99th percentile: 3.9335248374938963
mean time: 2.4591313362121583
Pipeline stage StressChecker completed in 13.65s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.81s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.81s
Shutdown handler de-registered
rirv938-anthropic-beta-_22448_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2523.14s
Shutdown handler de-registered
rirv938-anthropic-beta-_22448_v1 status is now inactive due to auto deactivation removed underperforming models
rirv938-anthropic-beta-_22448_v1 status is now torndown due to DeploymentManager action
rirv938-anthropic-beta-_22448_v1 status is now torndown due to DeploymentManager action
rirv938-anthropic-beta-_22448_v1 status is now torndown due to DeploymentManager action
ChatRequest
Generation Params
Prompt Formatter
Chat History
ChatMessage 1