developer_uid: junhua024
submission_id: junhua024-chai-02-full-01205_v3
model_name: junhua024-chai-02-full-01205_v3
model_group: junhua024/chai_02_full_0
status: torndown
timestamp: 2025-07-16T19:34:36+00:00
num_battles: 6605
num_wins: 3151
celo_rating: 1274.92
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: junhua024/chai_02_full_01205
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
reward_model: default
latencies: [{'batch_size': 1, 'throughput': 0.6007182231672201, 'latency_mean': 1.6645408391952514, 'latency_p50': 1.6522691249847412, 'latency_p90': 1.8516806364059448}, {'batch_size': 3, 'throughput': 1.074788582095875, 'latency_mean': 2.787480630874634, 'latency_p50': 2.8091602325439453, 'latency_p90': 3.0625715255737305}, {'batch_size': 5, 'throughput': 1.2942032367392546, 'latency_mean': 3.84191731095314, 'latency_p50': 3.839444875717163, 'latency_p90': 4.213866662979126}, {'batch_size': 6, 'throughput': 1.346249123559131, 'latency_mean': 4.438100193738937, 'latency_p50': 4.410010933876038, 'latency_p90': 4.988488078117371}, {'batch_size': 8, 'throughput': 1.4179159142013857, 'latency_mean': 5.609099034070969, 'latency_p50': 5.659634709358215, 'latency_p90': 6.250272274017334}, {'batch_size': 10, 'throughput': 1.4446150611093924, 'latency_mean': 6.863081737756729, 'latency_p50': 6.907211542129517, 'latency_p90': 7.690532159805298}]
gpu_counts: {'NVIDIA RTX A5000': 1}
display_name: junhua024-chai-02-full-01205_v3
is_internal_developer: False
language_model: junhua024/chai_02_full_01205
model_size: 13B
ranking_group: single
throughput_3p7s: 1.28
us_pacific_date: 2025-07-16
win_ratio: 0.4770628311884936
generation_params: {'temperature': 1.0, 'top_p': 0.88, 'min_p': 0.0, 'top_k': 10, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name junhua024-chai-02-full-01205-v3-mkmlizer
Waiting for job on junhua024-chai-02-full-01205-v3-mkmlizer to finish
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
Retrying (%r) after connection broken by '%r': %s
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`
junhua024-chai-02-full-01205-v3-mkmlizer: Downloaded to shared memory in 113.745s
junhua024-chai-02-full-01205-v3-mkmlizer: Checking if junhua024/chai_02_full_01205 already exists in ChaiML
junhua024-chai-02-full-01205-v3-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpo7u5a0tk, device:0
junhua024-chai-02-full-01205-v3-mkmlizer: Saving flywheel model at /dev/shm/model_cache
junhua024-chai-02-full-01205-v3-mkmlizer: quantized model in 30.888s
junhua024-chai-02-full-01205-v3-mkmlizer: Processed model junhua024/chai_02_full_01205 in 144.724s
junhua024-chai-02-full-01205-v3-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
junhua024-chai-02-full-01205-v3-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/junhua024-chai-02-full-01205-v3/nvidia
junhua024-chai-02-full-01205-v3-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/junhua024-chai-02-full-01205-v3/nvidia/config.json
junhua024-chai-02-full-01205-v3-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/junhua024-chai-02-full-01205-v3/nvidia/special_tokens_map.json
junhua024-chai-02-full-01205-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/junhua024-chai-02-full-01205-v3/nvidia/tokenizer_config.json
junhua024-chai-02-full-01205-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/junhua024-chai-02-full-01205-v3/nvidia/tokenizer.json
junhua024-chai-02-full-01205-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/junhua024-chai-02-full-01205-v3/nvidia/flywheel_model.0.safetensors
junhua024-chai-02-full-01205-v3-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%| | 2/363 [00:00<00:22, 16.11it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:19, 18.54it/s] Loading 0: 3%|▎ | 12/363 [00:00<00:11, 31.30it/s] Loading 0: 5%|▍ | 17/363 [00:00<00:10, 33.08it/s] Loading 0: 6%|▋ | 23/363 [00:00<00:09, 35.10it/s] Loading 0: 9%|▉ | 33/363 [00:00<00:07, 46.07it/s] Loading 0: 10%|█ | 38/363 [00:01<00:08, 39.51it/s] Loading 0: 12%|█▏ | 43/363 [00:01<00:08, 37.65it/s] Loading 0: 14%|█▍ | 50/363 [00:01<00:07, 42.14it/s] Loading 0: 15%|█▌ | 55/363 [00:01<00:08, 36.17it/s] Loading 0: 16%|█▋ | 59/363 [00:01<00:08, 35.13it/s] Loading 0: 18%|█▊ | 65/363 [00:01<00:08, 36.39it/s] Loading 0: 19%|█▉ | 69/363 [00:01<00:08, 34.65it/s] Loading 0: 21%|██ | 75/363 [00:02<00:08, 34.48it/s] Loading 0: 22%|██▏ | 80/363 [00:02<00:08, 32.98it/s] Loading 0: 24%|██▎ | 86/363 [00:02<00:07, 37.28it/s] Loading 0: 25%|██▌ | 91/363 [00:02<00:07, 36.66it/s] Loading 0: 27%|██▋ | 97/363 [00:02<00:07, 36.04it/s] Loading 0: 28%|██▊ | 101/363 [00:02<00:07, 35.35it/s] Loading 0: 29%|██▉ | 105/363 [00:02<00:07, 32.90it/s] Loading 0: 31%|███ | 113/363 [00:03<00:06, 40.44it/s] Loading 0: 33%|███▎ | 118/363 [00:03<00:07, 34.67it/s] Loading 0: 34%|███▎ | 122/363 [00:03<00:07, 34.06it/s] Loading 0: 35%|███▌ | 128/363 [00:03<00:06, 36.42it/s] Loading 0: 36%|███▋ | 132/363 [00:03<00:06, 34.18it/s] Loading 0: 38%|███▊ | 138/363 [00:03<00:06, 34.77it/s] Loading 0: 39%|███▉ | 143/363 [00:04<00:06, 34.03it/s] Loading 0: 41%|████ | 149/363 [00:04<00:06, 34.79it/s] Loading 0: 43%|████▎ | 157/363 [00:04<00:04, 44.29it/s] Loading 0: 45%|████▍ | 162/363 [00:04<00:05, 34.60it/s] Loading 0: 46%|████▌ | 167/363 [00:04<00:05, 35.41it/s] Loading 0: 48%|████▊ | 176/363 [00:04<00:04, 43.30it/s] Loading 0: 50%|████▉ | 181/363 [00:05<00:04, 37.00it/s] Loading 0: 51%|█████ | 186/363 [00:05<00:04, 38.08it/s] Loading 0: 53%|█████▎ | 191/363 [00:05<00:04, 36.68it/s] Loading 0: 54%|█████▎ | 195/363 [00:05<00:04, 34.72it/s] Loading 0: 55%|█████▌ | 201/363 [00:05<00:04, 35.63it/s] Loading 0: 57%|█████▋ | 206/363 [00:05<00:04, 35.17it/s] Loading 0: 58%|█████▊ | 212/363 [00:05<00:04, 35.82it/s] Loading 0: 61%|██████ | 220/363 [00:05<00:03, 45.30it/s] Loading 0: 62%|██████▏ | 225/363 [00:06<00:04, 33.17it/s] Loading 0: 63%|██████▎ | 230/363 [00:06<00:03, 33.93it/s] Loading 0: 66%|██████▌ | 239/363 [00:06<00:02, 42.40it/s] Loading 0: 67%|██████▋ | 244/363 [00:06<00:03, 36.27it/s] Loading 0: 69%|██████▊ | 249/363 [00:06<00:03, 37.35it/s] Loading 0: 70%|██████▉ | 254/363 [00:07<00:03, 35.69it/s] Loading 0: 71%|███████ | 258/363 [00:07<00:03, 34.22it/s] Loading 0: 73%|███████▎ | 264/363 [00:07<00:02, 34.74it/s] Loading 0: 74%|███████▍ | 269/363 [00:07<00:02, 33.35it/s] Loading 0: 76%|███████▌ | 275/363 [00:07<00:02, 34.51it/s] Loading 0: 78%|███████▊ | 283/363 [00:07<00:01, 43.88it/s] Loading 0: 79%|███████▉ | 288/363 [00:07<00:02, 33.88it/s] Loading 0: 81%|████████ | 293/363 [00:08<00:02, 34.73it/s] Loading 0: 83%|████████▎ | 302/363 [00:08<00:01, 42.74it/s] Loading 0: 85%|████████▍ | 307/363 [00:08<00:01, 36.22it/s] Loading 0: 86%|████████▌ | 311/363 [00:08<00:01, 34.10it/s] Loading 0: 87%|████████▋ | 317/363 [00:08<00:01, 35.17it/s] Loading 0: 88%|████████▊ | 321/363 [00:08<00:01, 33.37it/s] Loading 0: 90%|████████▉ | 326/363 [00:09<00:01, 36.58it/s] Loading 0: 91%|█████████ | 330/363 [00:09<00:00, 36.90it/s] Loading 0: 92%|█████████▏| 334/363 [00:09<00:00, 32.53it/s] Loading 0: 93%|█████████▎| 338/363 [00:09<00:00, 31.25it/s] Loading 0: 95%|█████████▍| 344/363 [00:09<00:00, 36.70it/s] Loading 0: 96%|█████████▌| 349/363 [00:09<00:00, 24.45it/s] Loading 0: 97%|█████████▋| 353/363 [00:10<00:00, 22.51it/s] Loading 0: 98%|█████████▊| 357/363 [00:10<00:00, 24.27it/s]
Job junhua024-chai-02-full-01205-v3-mkmlizer completed after 177.84s with status: succeeded
Stopping job with name junhua024-chai-02-full-01205-v3-mkmlizer
Pipeline stage MKMLizer completed in 178.35s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.14s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service junhua024-chai-02-full-01205-v3
Waiting for inference service junhua024-chai-02-full-01205-v3 to be ready
Failed to get response for submission junhua024-chai-02-full-0615_v1: HTTPConnectionPool(host='junhua024-chai-02-full-0615-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission junhua024-chai-02-full-0615_v1: HTTPConnectionPool(host='junhua024-chai-02-full-0615-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission junhua024-chai-02-full-0615_v3: HTTPConnectionPool(host='junhua024-chai-02-full-0615-v3-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission junhua024-chai-02-full-0615_v2: HTTPConnectionPool(host='junhua024-chai-02-full-0615-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission junhua024-chai-02-full-0615_v1: HTTPConnectionPool(host='junhua024-chai-02-full-0615-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service junhua024-chai-02-full-01205-v3 ready after 260.9662172794342s
Pipeline stage MKMLDeployer completed in 261.66s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.3099842071533203s
Received healthy response to inference request in 1.490767478942871s
Received healthy response to inference request in 1.667618751525879s
Received healthy response to inference request in 1.8028666973114014s
Received healthy response to inference request in 1.785423755645752s
5 requests
0 failed requests
5th percentile: 1.5261377334594726
10th percentile: 1.5615079879760743
20th percentile: 1.6322484970092774
30th percentile: 1.6911797523498535
40th percentile: 1.7383017539978027
50th percentile: 1.785423755645752
60th percentile: 1.7924009323120118
70th percentile: 1.7993781089782714
80th percentile: 1.9042901992797852
90th percentile: 2.1071372032165527
95th percentile: 2.2085607051849365
99th percentile: 2.2896995067596437
mean time: 1.8113321781158447
Pipeline stage StressChecker completed in 10.55s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.71s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.76s
Shutdown handler de-registered
junhua024-chai-02-full-01205_v3 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.14s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.19s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service junhua024-chai-02-full-01205-v3-profiler
Waiting for inference service junhua024-chai-02-full-01205-v3-profiler to be ready
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Received signal 15, running shutdown handler
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 5579.90s
Shutdown handler de-registered
junhua024-chai-02-full-01205_v3 status is now torndown due to DeploymentManager action
junhua024-chai-02-full-01205_v3 status is now torndown due to DeploymentManager action