developer_uid: zmeeks
submission_id: zmeeks-capitanito-50-1600_v2
model_name: capitanito_t11__50-1600__ppmmp
model_group: zmeeks/capitanito__50-16
status: torndown
timestamp: 2025-07-16T14:47:43+00:00
num_battles: 11844
num_wins: 5297
celo_rating: 1243.9
family_friendly_score: 0.5688
family_friendly_standard_error: 0.00700380696478708
submission_type: basic
model_repo: zmeeks/capitanito__50-1600
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
reward_model: default
latencies: [{'batch_size': 1, 'throughput': 0.6036018609714221, 'latency_mean': 1.656606252193451, 'latency_p50': 1.65065336227417, 'latency_p90': 1.8046510457992553}, {'batch_size': 3, 'throughput': 1.0880092397610845, 'latency_mean': 2.749240812063217, 'latency_p50': 2.75131893157959, 'latency_p90': 3.0300966024398805}, {'batch_size': 5, 'throughput': 1.3200982086917326, 'latency_mean': 3.7630553340911863, 'latency_p50': 3.782181143760681, 'latency_p90': 4.223428678512573}, {'batch_size': 6, 'throughput': 1.3777535053643999, 'latency_mean': 4.330460793972016, 'latency_p50': 4.357667446136475, 'latency_p90': 4.839846968650818}, {'batch_size': 8, 'throughput': 1.4512111154493013, 'latency_mean': 5.488387362957001, 'latency_p50': 5.4718122482299805, 'latency_p90': 6.1715384244918825}, {'batch_size': 10, 'throughput': 1.4886740911455114, 'latency_mean': 6.6626342689991, 'latency_p50': 6.699276447296143, 'latency_p90': 7.499529957771301}]
gpu_counts: {'NVIDIA RTX A5000': 1}
display_name: capitanito_t11__50-1600__ppmmp
is_internal_developer: False
language_model: zmeeks/capitanito__50-1600
model_size: 13B
ranking_group: single
throughput_3p7s: 1.32
us_pacific_date: 2025-07-16
win_ratio: 0.4472306653157717
generation_params: {'temperature': 1.1, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 25, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name zmeeks-capitanito-50-1600-v2-mkmlizer
Waiting for job on zmeeks-capitanito-50-1600-v2-mkmlizer to finish
zmeeks-capitanito-50-1600-v2-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ Version: 0.29.15 ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ https://mk1.ai ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ The license key for the current software has been verified as ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ belonging to: ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ Chai Research Corp. ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ║ ║
zmeeks-capitanito-50-1600-v2-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
zmeeks-capitanito-50-1600-v2-mkmlizer: Downloaded to shared memory in 34.682s
zmeeks-capitanito-50-1600-v2-mkmlizer: Checking if zmeeks/capitanito__50-1600 already exists in ChaiML
zmeeks-capitanito-50-1600-v2-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpd9ygvv5s, device:0
zmeeks-capitanito-50-1600-v2-mkmlizer: Saving flywheel model at /dev/shm/model_cache
zmeeks-capitanito-50-1600-v2-mkmlizer: quantized model in 32.107s
zmeeks-capitanito-50-1600-v2-mkmlizer: Processed model zmeeks/capitanito__50-1600 in 66.882s
zmeeks-capitanito-50-1600-v2-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zmeeks-capitanito-50-1600-v2-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zmeeks-capitanito-50-1600-v2/nvidia
zmeeks-capitanito-50-1600-v2-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zmeeks-capitanito-50-1600-v2/nvidia/special_tokens_map.json
zmeeks-capitanito-50-1600-v2-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zmeeks-capitanito-50-1600-v2/nvidia/config.json
zmeeks-capitanito-50-1600-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zmeeks-capitanito-50-1600-v2/nvidia/tokenizer_config.json
zmeeks-capitanito-50-1600-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zmeeks-capitanito-50-1600-v2/nvidia/tokenizer.json
zmeeks-capitanito-50-1600-v2-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zmeeks-capitanito-50-1600-v2/nvidia/flywheel_model.0.safetensors
zmeeks-capitanito-50-1600-v2-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:11, 30.37it/s] Loading 0: 4%|▎ | 13/363 [00:00<00:07, 47.65it/s] Loading 0: 5%|▍ | 18/363 [00:00<00:07, 43.42it/s] Loading 0: 6%|▋ | 23/363 [00:00<00:10, 32.85it/s] Loading 0: 8%|▊ | 30/363 [00:00<00:07, 41.90it/s] Loading 0: 10%|▉ | 35/363 [00:00<00:08, 40.64it/s] Loading 0: 11%|█ | 40/363 [00:00<00:07, 41.20it/s] Loading 0: 12%|█▏ | 45/363 [00:01<00:07, 43.21it/s] Loading 0: 14%|█▍ | 50/363 [00:01<00:08, 35.91it/s] Loading 0: 15%|█▌ | 56/363 [00:01<00:07, 41.30it/s] Loading 0: 17%|█▋ | 61/363 [00:01<00:10, 29.76it/s] Loading 0: 18%|█▊ | 65/363 [00:01<00:10, 29.11it/s] Loading 0: 20%|█▉ | 71/363 [00:01<00:08, 34.27it/s] Loading 0: 21%|██ | 75/363 [00:02<00:08, 33.13it/s] Loading 0: 22%|██▏ | 80/363 [00:02<00:08, 35.26it/s] Loading 0: 23%|██▎ | 84/363 [00:02<00:08, 33.83it/s] Loading 0: 25%|██▍ | 89/363 [00:02<00:07, 35.94it/s] Loading 0: 26%|██▌ | 93/363 [00:02<00:08, 33.68it/s] Loading 0: 27%|██▋ | 98/363 [00:02<00:07, 36.83it/s] Loading 0: 28%|██▊ | 102/363 [00:02<00:07, 34.84it/s] Loading 0: 29%|██▉ | 106/363 [00:02<00:07, 34.52it/s] Loading 0: 31%|███ | 112/363 [00:03<00:06, 38.46it/s] Loading 0: 32%|███▏ | 116/363 [00:03<00:06, 36.48it/s] Loading 0: 33%|███▎ | 120/363 [00:03<00:06, 34.86it/s] Loading 0: 34%|███▍ | 125/363 [00:03<00:06, 36.66it/s] Loading 0: 36%|███▌ | 129/363 [00:03<00:06, 34.26it/s] Loading 0: 37%|███▋ | 133/363 [00:03<00:06, 35.38it/s] Loading 0: 38%|███▊ | 137/363 [00:03<00:07, 31.69it/s] Loading 0: 39%|███▉ | 142/363 [00:04<00:08, 24.61it/s] Loading 0: 40%|███▉ | 145/363 [00:04<00:08, 24.96it/s] Loading 0: 41%|████ | 149/363 [00:04<00:08, 25.56it/s] Loading 0: 43%|████▎ | 156/363 [00:04<00:05, 34.53it/s] Loading 0: 44%|████▍ | 160/363 [00:04<00:05, 34.65it/s] Loading 0: 45%|████▌ | 165/363 [00:04<00:05, 38.22it/s] Loading 0: 47%|████▋ | 170/363 [00:04<00:04, 39.85it/s] Loading 0: 48%|████▊ | 175/363 [00:04<00:04, 41.74it/s] Loading 0: 50%|████▉ | 181/363 [00:05<00:04, 40.62it/s] Loading 0: 51%|█████ | 186/363 [00:05<00:04, 40.36it/s] Loading 0: 53%|█████▎ | 192/363 [00:05<00:03, 44.21it/s] Loading 0: 54%|█████▍ | 197/363 [00:05<00:03, 42.45it/s] Loading 0: 56%|█████▌ | 202/363 [00:05<00:03, 41.81it/s] Loading 0: 57%|█████▋ | 207/363 [00:05<00:03, 42.67it/s] Loading 0: 58%|█████▊ | 212/363 [00:05<00:04, 34.68it/s] Loading 0: 60%|██████ | 218/363 [00:06<00:03, 38.21it/s] Loading 0: 61%|██████▏ | 223/363 [00:06<00:04, 28.82it/s] Loading 0: 63%|██████▎ | 227/363 [00:06<00:04, 29.17it/s] Loading 0: 64%|██████▎ | 231/363 [00:06<00:04, 27.97it/s] Loading 0: 65%|██████▌ | 237/363 [00:06<00:03, 32.60it/s] Loading 0: 66%|██████▋ | 241/363 [00:06<00:03, 31.93it/s] Loading 0: 68%|██████▊ | 246/363 [00:06<00:03, 34.26it/s] Loading 0: 69%|██████▉ | 250/363 [00:07<00:03, 32.59it/s] Loading 0: 70%|██████▉ | 254/363 [00:07<00:03, 34.21it/s] Loading 0: 71%|███████ | 258/363 [00:07<00:03, 29.88it/s] Loading 0: 72%|███████▏ | 262/363 [00:07<00:03, 31.53it/s] Loading 0: 73%|███████▎ | 266/363 [00:07<00:03, 28.90it/s] Loading 0: 75%|███████▍ | 271/363 [00:07<00:02, 33.23it/s] Loading 0: 76%|███████▌ | 275/363 [00:07<00:02, 30.42it/s] Loading 0: 78%|███████▊ | 282/363 [00:08<00:02, 37.39it/s] Loading 0: 79%|███████▉ | 286/363 [00:08<00:02, 34.86it/s] Loading 0: 80%|████████ | 291/363 [00:08<00:01, 36.08it/s] Loading 0: 81%|████████▏ | 295/363 [00:08<00:01, 34.21it/s] Loading 0: 82%|████████▏ | 299/363 [00:08<00:01, 33.29it/s] Loading 0: 84%|████████▎ | 304/363 [00:09<00:03, 18.75it/s] Loading 0: 85%|████████▍ | 307/363 [00:09<00:02, 19.61it/s] Loading 0: 86%|████████▌ | 312/363 [00:09<00:02, 21.53it/s] Loading 0: 88%|████████▊ | 319/363 [00:09<00:01, 29.32it/s] Loading 0: 89%|████████▉ | 323/363 [00:09<00:01, 30.40it/s] Loading 0: 91%|█████████ | 329/363 [00:09<00:00, 35.65it/s] Loading 0: 92%|█████████▏| 334/363 [00:09<00:00, 36.71it/s] Loading 0: 93%|█████████▎| 339/363 [00:10<00:00, 33.18it/s] Loading 0: 96%|█████████▌| 347/363 [00:10<00:00, 41.75it/s] Loading 0: 97%|█████████▋| 353/363 [00:10<00:00, 41.17it/s] Loading 0: 99%|█████████▊| 358/363 [00:10<00:00, 39.86it/s]
Failed to get response for submission junhua024-chai-02-full-062_v2: HTTPConnectionPool(host='junhua024-chai-02-full-062-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Job zmeeks-capitanito-50-1600-v2-mkmlizer completed after 95.7s with status: succeeded
Stopping job with name zmeeks-capitanito-50-1600-v2-mkmlizer
Pipeline stage MKMLizer completed in 96.19s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.53s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service zmeeks-capitanito-50-1600-v2
Waiting for inference service zmeeks-capitanito-50-1600-v2 to be ready
Failed to get response for submission zmeeks-capitanito-50-2000_v1: HTTPConnectionPool(host='zmeeks-capitanito-50-2000-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service zmeeks-capitanito-50-1600-v2 ready after 271.5399236679077s
Pipeline stage MKMLDeployer completed in 272.40s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.4335577487945557s
Received healthy response to inference request in 5.699070692062378s
Received healthy response to inference request in 1.8250095844268799s
Received healthy response to inference request in 1.6511425971984863s
5 requests
1 failed requests
5th percentile: 1.685915994644165
10th percentile: 1.7206893920898438
20th percentile: 1.7902361869812011
30th percentile: 1.946719217300415
40th percentile: 2.1901384830474853
50th percentile: 2.4335577487945557
60th percentile: 3.739762926101684
70th percentile: 5.045968103408812
80th percentile: 8.583821678161623
90th percentile: 14.353323650360108
95th percentile: 17.23807463645935
99th percentile: 19.545875425338746
mean time: 6.346321249008179
%s, retrying in %s seconds...
Received healthy response to inference request in 1.4352068901062012s
Received healthy response to inference request in 1.0230448246002197s
Received healthy response to inference request in 2.1243011951446533s
Received healthy response to inference request in 2.2457337379455566s
Received healthy response to inference request in 1.4854779243469238s
5 requests
0 failed requests
5th percentile: 1.105477237701416
10th percentile: 1.1879096508026123
20th percentile: 1.352774477005005
30th percentile: 1.4452610969543458
40th percentile: 1.4653695106506348
50th percentile: 1.4854779243469238
60th percentile: 1.7410072326660155
70th percentile: 1.9965365409851072
80th percentile: 2.148587703704834
90th percentile: 2.1971607208251953
95th percentile: 2.221447229385376
99th percentile: 2.2408764362335205
mean time: 1.662752914428711
Pipeline stage StressChecker completed in 43.33s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.73s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.75s
Shutdown handler de-registered
zmeeks-capitanito-50-1600_v2 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.13s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.11s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service zmeeks-capitanito-50-1600-v2-profiler
Waiting for inference service zmeeks-capitanito-50-1600-v2-profiler to be ready
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError('None: None')
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3265.99s
Shutdown handler de-registered
zmeeks-capitanito-50-1600_v2 status is now inactive due to auto deactivation removed underperforming models
zmeeks-capitanito-50-1600_v2 status is now torndown due to DeploymentManager action
zmeeks-capitanito-50-1600_v2 status is now torndown due to DeploymentManager action