developer_uid: zmeeks
submission_id: zmeeks-capitanito-54-2800_v4
model_name: capitanito_t10__z54-2800__ppmmp
model_group: zmeeks/capitanito__54-28
status: torndown
timestamp: 2025-07-18T03:03:16+00:00
num_battles: 7381
num_wins: 3634
celo_rating: 1269.25
family_friendly_score: 0.5774
family_friendly_standard_error: 0.006985831947592212
submission_type: basic
model_repo: zmeeks/capitanito__54-2800
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 8
max_input_tokens: 3072
max_output_tokens: 64
reward_model: default
latencies: [{'batch_size': 1, 'throughput': 0.40408494177421705, 'latency_mean': 2.4745689749717714, 'latency_p50': 2.4746837615966797, 'latency_p90': 2.6545534849166867}, {'batch_size': 2, 'throughput': 0.5248625742769213, 'latency_mean': 3.806627243757248, 'latency_p50': 3.8027158975601196, 'latency_p90': 4.134437036514282}, {'batch_size': 3, 'throughput': 0.5878741711444445, 'latency_mean': 5.090508906841278, 'latency_p50': 5.0633968114852905, 'latency_p90': 5.766021394729614}, {'batch_size': 5, 'throughput': 0.6403265801370153, 'latency_mean': 7.776892054080963, 'latency_p50': 7.769336938858032, 'latency_p90': 8.7059987783432}]
gpu_counts: {'NVIDIA RTX A5000': 1}
display_name: capitanito_t10__z54-2800__ppmmp
is_internal_developer: False
language_model: zmeeks/capitanito__54-2800
model_size: 13B
ranking_group: single
throughput_3p7s: 0.52
us_pacific_date: 2025-07-17
win_ratio: 0.49234521067606013
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 25, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 3072, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name zmeeks-capitanito-54-2800-v4-mkmlizer
Waiting for job on zmeeks-capitanito-54-2800-v4-mkmlizer to finish
zmeeks-capitanito-54-2800-v4-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ Version: 0.29.15 ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ https://mk1.ai ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ The license key for the current software has been verified as ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ belonging to: ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ Chai Research Corp. ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ║ ║
zmeeks-capitanito-54-2800-v4-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
zmeeks-capitanito-54-2800-v4-mkmlizer: Downloaded to shared memory in 39.661s
zmeeks-capitanito-54-2800-v4-mkmlizer: Checking if zmeeks/capitanito__54-2800 already exists in ChaiML
zmeeks-capitanito-54-2800-v4-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpa0t_kj4y, device:0
zmeeks-capitanito-54-2800-v4-mkmlizer: Saving flywheel model at /dev/shm/model_cache
zmeeks-capitanito-54-2800-v4-mkmlizer: quantized model in 39.451s
zmeeks-capitanito-54-2800-v4-mkmlizer: Processed model zmeeks/capitanito__54-2800 in 79.197s
zmeeks-capitanito-54-2800-v4-mkmlizer: creating bucket guanaco-mkml-models
zmeeks-capitanito-54-2800-v4-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zmeeks-capitanito-54-2800-v4-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zmeeks-capitanito-54-2800-v4/nvidia
zmeeks-capitanito-54-2800-v4-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zmeeks-capitanito-54-2800-v4/nvidia/config.json
zmeeks-capitanito-54-2800-v4-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zmeeks-capitanito-54-2800-v4/nvidia/special_tokens_map.json
zmeeks-capitanito-54-2800-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zmeeks-capitanito-54-2800-v4/nvidia/tokenizer_config.json
zmeeks-capitanito-54-2800-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zmeeks-capitanito-54-2800-v4/nvidia/tokenizer.json
zmeeks-capitanito-54-2800-v4-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zmeeks-capitanito-54-2800-v4/nvidia/flywheel_model.0.safetensors
zmeeks-capitanito-54-2800-v4-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:14, 25.33it/s] Loading 0: 3%|▎ | 12/363 [00:00<00:08, 40.65it/s] Loading 0: 5%|▍ | 17/363 [00:00<00:09, 38.42it/s] Loading 0: 6%|▌ | 22/363 [00:00<00:09, 37.38it/s] Loading 0: 7%|▋ | 26/363 [00:00<00:08, 37.56it/s] Loading 0: 8%|▊ | 30/363 [00:00<00:08, 37.68it/s] Loading 0: 9%|▉ | 34/363 [00:00<00:09, 35.69it/s] Loading 0: 11%|█ | 39/363 [00:01<00:08, 37.46it/s] Loading 0: 12%|█▏ | 43/363 [00:01<00:08, 36.02it/s] Loading 0: 13%|█▎ | 48/363 [00:01<00:08, 38.40it/s] Loading 0: 14%|█▍ | 52/363 [00:01<00:08, 35.64it/s] Loading 0: 15%|█▌ | 56/363 [00:01<00:08, 35.30it/s] Loading 0: 17%|█▋ | 60/363 [00:01<00:08, 36.36it/s] Loading 0: 18%|█▊ | 64/363 [00:02<00:14, 20.53it/s] Loading 0: 20%|█▉ | 71/363 [00:02<00:10, 28.37it/s] Loading 0: 21%|██ | 75/363 [00:02<00:09, 28.95it/s] Loading 0: 22%|██▏ | 79/363 [00:02<00:09, 28.63it/s] Loading 0: 23%|██▎ | 83/363 [00:02<00:10, 27.51it/s] Loading 0: 25%|██▍ | 89/363 [00:02<00:08, 33.15it/s] Loading 0: 26%|██▌ | 93/363 [00:02<00:08, 31.95it/s] Loading 0: 27%|██▋ | 98/363 [00:02<00:07, 34.00it/s] Loading 0: 28%|██▊ | 102/363 [00:03<00:08, 32.48it/s] Loading 0: 29%|██▉ | 106/363 [00:03<00:07, 32.45it/s] Loading 0: 30%|███ | 110/363 [00:03<00:07, 33.10it/s] Loading 0: 31%|███▏ | 114/363 [00:03<00:08, 30.70it/s] Loading 0: 33%|███▎ | 118/363 [00:03<00:08, 29.17it/s] Loading 0: 34%|███▍ | 123/363 [00:03<00:07, 33.97it/s] Loading 0: 35%|███▍ | 127/363 [00:03<00:07, 30.81it/s] Loading 0: 36%|███▋ | 132/363 [00:04<00:06, 35.33it/s] Loading 0: 37%|███▋ | 136/363 [00:04<00:07, 31.27it/s] Loading 0: 39%|███▉ | 141/363 [00:04<00:06, 35.65it/s] Loading 0: 40%|███▉ | 145/363 [00:04<00:09, 22.90it/s] Loading 0: 41%|████ | 149/363 [00:04<00:09, 23.38it/s] Loading 0: 42%|████▏ | 154/363 [00:04<00:07, 28.27it/s] Loading 0: 44%|████▎ | 158/363 [00:05<00:07, 27.05it/s] Loading 0: 45%|████▍ | 163/363 [00:05<00:06, 30.44it/s] Loading 0: 46%|████▌ | 167/363 [00:05<00:06, 28.35it/s] Loading 0: 47%|████▋ | 172/363 [00:05<00:05, 32.14it/s] Loading 0: 48%|████▊ | 176/363 [00:05<00:06, 28.61it/s] Loading 0: 50%|████▉ | 181/363 [00:05<00:05, 32.94it/s] Loading 0: 51%|█████ | 185/363 [00:05<00:05, 31.03it/s] Loading 0: 53%|█████▎ | 192/363 [00:06<00:04, 37.27it/s] Loading 0: 54%|█████▍ | 196/363 [00:06<00:04, 34.96it/s] Loading 0: 55%|█████▌ | 201/363 [00:06<00:04, 36.61it/s] Loading 0: 56%|█████▋ | 205/363 [00:06<00:04, 33.83it/s] Loading 0: 58%|█████▊ | 210/363 [00:06<00:04, 35.16it/s] Loading 0: 59%|█████▉ | 214/363 [00:06<00:04, 33.21it/s] Loading 0: 60%|██████ | 218/363 [00:06<00:04, 32.62it/s] Loading 0: 61%|██████ | 222/363 [00:06<00:04, 33.96it/s] Loading 0: 62%|██████▏ | 226/363 [00:07<00:06, 20.01it/s] Loading 0: 63%|██████▎ | 230/363 [00:07<00:06, 20.57it/s] Loading 0: 65%|██████▍ | 235/363 [00:07<00:05, 25.00it/s] Loading 0: 66%|██████▌ | 239/363 [00:07<00:05, 24.07it/s] Loading 0: 67%|██████▋ | 244/363 [00:07<00:04, 28.06it/s] Loading 0: 68%|██████▊ | 248/363 [00:08<00:04, 26.26it/s] Loading 0: 70%|██████▉ | 253/363 [00:08<00:03, 30.24it/s] Loading 0: 71%|███████ | 257/363 [00:08<00:03, 27.47it/s] Loading 0: 72%|███████▏ | 262/363 [00:08<00:03, 30.83it/s] Loading 0: 73%|███████▎ | 266/363 [00:08<00:03, 27.87it/s] Loading 0: 75%|███████▍ | 271/363 [00:08<00:02, 32.24it/s] Loading 0: 76%|███████▌ | 275/363 [00:08<00:03, 28.86it/s] Loading 0: 77%|███████▋ | 280/363 [00:09<00:02, 33.36it/s] Loading 0: 78%|███████▊ | 284/363 [00:09<00:02, 30.91it/s] Loading 0: 80%|███████▉ | 289/363 [00:09<00:02, 34.86it/s] Loading 0: 81%|████████ | 293/363 [00:09<00:02, 30.41it/s] Loading 0: 82%|████████▏ | 298/363 [00:09<00:01, 33.53it/s] Loading 0: 83%|████████▎ | 303/363 [00:09<00:01, 34.91it/s] Loading 0: 85%|████████▍ | 307/363 [00:10<00:03, 16.71it/s] Loading 0: 86%|████████▌ | 312/363 [00:10<00:02, 19.14it/s] Loading 0: 87%|████████▋ | 317/363 [00:10<00:01, 23.73it/s] Loading 0: 88%|████████▊ | 321/363 [00:10<00:01, 23.47it/s] Loading 0: 90%|████████▉ | 326/363 [00:10<00:01, 27.93it/s] Loading 0: 91%|█████████ | 330/363 [00:11<00:01, 27.02it/s] Loading 0: 93%|█████████▎| 337/363 [00:11<00:00, 33.85it/s] Loading 0: 94%|█████████▍| 341/363 [00:11<00:00, 32.07it/s] Loading 0: 95%|█████████▌| 346/363 [00:11<00:00, 34.23it/s] Loading 0: 96%|█████████▋| 350/363 [00:11<00:00, 33.36it/s] Loading 0: 98%|█████████▊| 355/363 [00:11<00:00, 35.64it/s] Loading 0: 99%|█████████▉| 359/363 [00:11<00:00, 34.35it/s]
Job zmeeks-capitanito-54-2800-v4-mkmlizer completed after 106.56s with status: succeeded
Stopping job with name zmeeks-capitanito-54-2800-v4-mkmlizer
Pipeline stage MKMLizer completed in 107.18s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.19s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service zmeeks-capitanito-54-2800-v4
Waiting for inference service zmeeks-capitanito-54-2800-v4 to be ready
Inference service zmeeks-capitanito-54-2800-v4 ready after 312.2807996273041s
Pipeline stage MKMLDeployer completed in 313.40s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.073779582977295s
Received healthy response to inference request in 1.5243945121765137s
Received healthy response to inference request in 0.9245271682739258s
Received healthy response to inference request in 1.5070579051971436s
Received healthy response to inference request in 1.8852157592773438s
5 requests
0 failed requests
5th percentile: 1.0410333156585694
10th percentile: 1.157539463043213
20th percentile: 1.3905517578125
30th percentile: 1.5105252265930176
40th percentile: 1.5174598693847656
50th percentile: 1.5243945121765137
60th percentile: 1.6687230110168456
70th percentile: 1.8130515098571776
80th percentile: 1.922928524017334
90th percentile: 1.9983540534973145
95th percentile: 2.0360668182373045
99th percentile: 2.066237030029297
mean time: 1.5829949855804444
Pipeline stage StressChecker completed in 9.27s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.71s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.69s
Shutdown handler de-registered
zmeeks-capitanito-54-2800_v4 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 5378.96s
Shutdown handler de-registered
zmeeks-capitanito-54-2800_v4 status is now inactive due to auto deactivation removed underperforming models
zmeeks-capitanito-54-2800_v4 status is now torndown due to DeploymentManager action
zmeeks-capitanito-54-2800_v4 status is now torndown due to DeploymentManager action