developer_uid: rirv938
submission_id: chaiml-gy-exp63-dpo-exp_33117_v3
model_name: chaiml-gy-exp63-dpo-exp_33117_v3
model_group: ChaiML/gy-exp63-dpo-exp3
status: torndown
timestamp: 2025-07-03T04:13:05+00:00
num_battles: 6195
num_wins: 3196
celo_rating: 1296.54
family_friendly_score: 0.509
family_friendly_standard_error: 0.007069922206078367
submission_type: basic
model_repo: ChaiML/gy-exp63-dpo-exp32ep8s2-pref-grok3-sub-nis-majib-30jun-ep1
model_architecture: MistralForCausalLM
model_num_parameters: 24096691200.0
best_of: 8
max_input_tokens: 768
max_output_tokens: 64
reward_model: default
latencies: [{'batch_size': 1, 'throughput': 0.546099864806723, 'latency_mean': 1.8310456228256227, 'latency_p50': 1.8446346521377563, 'latency_p90': 2.0253644466400145}, {'batch_size': 3, 'throughput': 1.1367881593409588, 'latency_mean': 2.6315820741653444, 'latency_p50': 2.621324062347412, 'latency_p90': 2.8939144611358643}, {'batch_size': 5, 'throughput': 1.4682737245592778, 'latency_mean': 3.385753515958786, 'latency_p50': 3.3829593658447266, 'latency_p90': 3.820053482055664}, {'batch_size': 6, 'throughput': 1.611507549159141, 'latency_mean': 3.6957708859443663, 'latency_p50': 3.6886141300201416, 'latency_p90': 4.144052004814148}, {'batch_size': 8, 'throughput': 1.7938433705933658, 'latency_mean': 4.4295820868015285, 'latency_p50': 4.415796637535095, 'latency_p90': 4.977216601371765}, {'batch_size': 10, 'throughput': 1.877105917170103, 'latency_mean': 5.272613189220428, 'latency_p50': 5.330881595611572, 'latency_p90': 5.995641541481018}]
gpu_counts: {'NVIDIA A100-SXM4-80GB': 1}
display_name: chaiml-gy-exp63-dpo-exp_33117_v3
ineligible_reason: num_battles<10000
is_internal_developer: True
language_model: ChaiML/gy-exp63-dpo-exp32ep8s2-pref-grok3-sub-nis-majib-30jun-ep1
model_size: 24B
ranking_group: single
throughput_3p7s: 1.62
us_pacific_date: 2025-07-02
win_ratio: 0.5158999192897498
generation_params: {'temperature': 1.0, 'top_p': 0.95, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.45, 'frequency_penalty': 0.45, 'stopping_words': ['\n', '</s>', 'You:', '###'], 'max_input_tokens': 768, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer
Waiting for job on chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer to finish
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ Version: 0.29.15 ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ https://mk1.ai ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ belonging to: ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ Chai Research Corp. ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ║ ║
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: Downloaded to shared memory in 54.823s
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: Checking if ChaiML/gy-exp63-dpo-exp32ep8s2-pref-grok3-sub-nis-majib-30jun-ep1 already exists in ChaiML
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpn_2crkmb, device:0
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: quantized model in 47.706s
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: Processed model ChaiML/gy-exp63-dpo-exp32ep8s2-pref-grok3-sub-nis-majib-30jun-ep1 in 102.529s
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: creating bucket guanaco-mkml-models
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-gy-exp63-dpo-exp-33117-v3/nvidia
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-gy-exp63-dpo-exp-33117-v3/nvidia/config.json
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-gy-exp63-dpo-exp-33117-v3/nvidia/special_tokens_map.json
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-gy-exp63-dpo-exp-33117-v3/nvidia/tokenizer_config.json
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-gy-exp63-dpo-exp-33117-v3/nvidia/tokenizer.json
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-gy-exp63-dpo-exp-33117-v3/nvidia/flywheel_model.1.safetensors
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-gy-exp63-dpo-exp-33117-v3/nvidia/flywheel_model.0.safetensors
chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%| | 4/363 [00:00<00:10, 35.56it/s] Loading 0: 2%|▏ | 8/363 [00:00<00:12, 28.48it/s] Loading 0: 3%|▎ | 12/363 [00:00<00:11, 30.98it/s] Loading 0: 4%|▍ | 16/363 [00:00<00:12, 27.80it/s] Loading 0: 6%|▌ | 21/363 [00:00<00:11, 30.15it/s] Loading 0: 7%|▋ | 25/363 [00:00<00:12, 27.82it/s] Loading 0: 8%|▊ | 30/363 [00:00<00:10, 32.65it/s] Loading 0: 9%|▉ | 34/363 [00:01<00:14, 23.01it/s] Loading 0: 10%|█ | 37/363 [00:01<00:14, 21.76it/s] Loading 0: 11%|█ | 40/363 [00:01<00:14, 23.07it/s] Loading 0: 12%|█▏ | 43/363 [00:01<00:13, 23.49it/s] Loading 0: 13%|█▎ | 48/363 [00:01<00:11, 27.41it/s] Loading 0: 14%|█▍ | 51/363 [00:01<00:12, 24.55it/s] Loading 0: 16%|█▌ | 57/363 [00:02<00:10, 29.21it/s] Loading 0: 17%|█▋ | 61/363 [00:02<00:11, 26.06it/s] Loading 0: 18%|█▊ | 65/363 [00:02<00:11, 26.92it/s] Loading 0: 19%|█▉ | 70/363 [00:02<00:12, 23.68it/s] Loading 0: 20%|██ | 73/363 [00:02<00:14, 20.50it/s] Loading 0: 22%|██▏ | 79/363 [00:03<00:11, 25.68it/s] Loading 0: 23%|██▎ | 82/363 [00:03<00:11, 24.58it/s] Loading 0: 24%|██▎ | 86/363 [00:03<00:10, 26.80it/s] Loading 0: 25%|██▍ | 89/363 [00:03<00:10, 26.37it/s] Loading 0: 25%|██▌ | 92/363 [00:03<00:13, 20.76it/s] Loading 0: 27%|██▋ | 99/363 [00:03<00:09, 28.51it/s] Loading 0: 28%|██▊ | 103/363 [00:03<00:09, 27.12it/s] Loading 0: 29%|██▉ | 107/363 [00:04<00:11, 22.17it/s] Loading 0: 31%|███ | 112/363 [00:04<00:09, 25.21it/s] Loading 0: 32%|███▏ | 115/363 [00:04<00:09, 25.07it/s] Loading 0: 33%|███▎ | 120/363 [00:04<00:08, 28.34it/s] Loading 0: 34%|███▍ | 124/363 [00:04<00:08, 27.27it/s] Loading 0: 36%|███▌ | 129/363 [00:04<00:07, 30.80it/s] Loading 0: 37%|███▋ | 133/363 [00:05<00:08, 28.09it/s] Loading 0: 38%|███▊ | 138/363 [00:05<00:07, 30.43it/s] Loading 0: 39%|███▉ | 142/363 [00:05<00:08, 27.35it/s] Loading 0: 40%|████ | 146/363 [00:05<00:07, 29.68it/s] Loading 0: 41%|████▏ | 150/363 [00:05<00:09, 22.94it/s] Loading 0: 42%|████▏ | 153/363 [00:06<00:10, 20.10it/s] Loading 0: 43%|████▎ | 157/363 [00:06<00:08, 23.47it/s] Loading 0: 44%|████▍ | 160/363 [00:06<00:08, 23.90it/s] Loading 0: 45%|████▌ | 165/363 [00:06<00:07, 27.48it/s] Loading 0: 46%|████▋ | 168/363 [00:06<00:07, 24.56it/s] Loading 0: 48%|████▊ | 174/363 [00:06<00:06, 29.08it/s] Loading 0: 49%|████▉ | 178/363 [00:06<00:06, 27.75it/s] Loading 0: 50%|█████ | 182/363 [00:06<00:06, 27.79it/s] Loading 0: 52%|█████▏ | 187/363 [00:07<00:07, 23.43it/s] Loading 0: 52%|█████▏ | 190/363 [00:07<00:07, 21.80it/s] Loading 0: 53%|█████▎ | 193/363 [00:07<00:07, 23.04it/s] Loading 0: 54%|█████▍ | 196/363 [00:07<00:07, 23.60it/s] Loading 0: 55%|█████▌ | 200/363 [00:22<00:06, 23.60it/s] Loading 0: 55%|█████▌ | 201/363 [00:22<03:01, 1.12s/it] Loading 0: 56%|█████▌ | 203/363 [00:22<02:29, 1.07it/s] Loading 0: 57%|█████▋ | 208/363 [00:22<01:30, 1.72it/s] Loading 0: 58%|█████▊ | 211/363 [00:22<01:09, 2.19it/s] Loading 0: 59%|█████▉ | 214/363 [00:22<00:51, 2.89it/s] Loading 0: 60%|██████ | 218/363 [00:22<00:35, 4.13it/s] Loading 0: 61%|██████ | 221/363 [00:22<00:27, 5.25it/s] Loading 0: 62%|██████▏ | 224/363 [00:23<00:22, 6.15it/s] Loading 0: 63%|██████▎ | 228/363 [00:23<00:15, 8.65it/s] Loading 0: 64%|██████▎ | 231/363 [00:23<00:13, 10.08it/s] Loading 0: 65%|██████▌ | 237/363 [00:23<00:08, 14.94it/s] Loading 0: 66%|██████▌ | 240/363 [00:23<00:07, 15.68it/s] Loading 0: 68%|██████▊ | 246/363 [00:23<00:05, 21.65it/s] Loading 0: 69%|██████▉ | 250/363 [00:24<00:05, 22.34it/s] Loading 0: 70%|███████ | 255/363 [00:24<00:04, 25.81it/s] Loading 0: 71%|███████▏ | 259/363 [00:24<00:04, 25.32it/s] Loading 0: 73%|███████▎ | 264/363 [00:24<00:03, 29.93it/s] Loading 0: 74%|███████▍ | 268/363 [00:24<00:04, 22.67it/s] Loading 0: 75%|███████▍ | 271/363 [00:24<00:04, 21.50it/s] Loading 0: 75%|███████▌ | 274/363 [00:25<00:03, 22.79it/s] Loading 0: 76%|███████▋ | 277/363 [00:25<00:03, 23.41it/s] Loading 0: 78%|███████▊ | 282/363 [00:25<00:02, 27.29it/s] Loading 0: 79%|███████▊ | 285/363 [00:25<00:03, 24.62it/s] Loading 0: 80%|████████ | 291/363 [00:25<00:02, 29.15it/s] Loading 0: 81%|████████▏ | 295/363 [00:25<00:02, 27.52it/s] Loading 0: 82%|████████▏ | 299/363 [00:25<00:02, 28.08it/s] Loading 0: 84%|████████▎ | 304/363 [00:26<00:02, 24.73it/s] Loading 0: 85%|████████▍ | 307/363 [00:26<00:02, 22.60it/s] Loading 0: 85%|████████▌ | 310/363 [00:26<00:02, 23.66it/s] Loading 0: 86%|████████▌ | 313/363 [00:26<00:02, 23.94it/s] Loading 0: 88%|████████▊ | 318/363 [00:26<00:01, 27.81it/s] Loading 0: 88%|████████▊ | 321/363 [00:26<00:01, 25.02it/s] Loading 0: 90%|█████████ | 327/363 [00:27<00:01, 30.40it/s] Loading 0: 91%|█████████ | 331/363 [00:27<00:01, 28.64it/s] Loading 0: 92%|█████████▏| 335/363 [00:27<00:00, 29.86it/s] Loading 0: 93%|█████████▎| 339/363 [00:27<00:00, 29.81it/s] Loading 0: 94%|█████████▍| 343/363 [00:27<00:01, 17.12it/s] Loading 0: 96%|█████████▌| 348/363 [00:28<00:00, 19.04it/s] Loading 0: 98%|█████████▊| 355/363 [00:28<00:00, 26.18it/s] Loading 0: 99%|█████████▉| 359/363 [00:28<00:00, 25.79it/s]
Job chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer completed after 182.17s with status: succeeded
Stopping job with name chaiml-gy-exp63-dpo-exp-33117-v3-mkmlizer
Pipeline stage MKMLizer completed in 182.76s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.18s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-gy-exp63-dpo-exp-33117-v3
Waiting for inference service chaiml-gy-exp63-dpo-exp-33117-v3 to be ready
Inference service chaiml-gy-exp63-dpo-exp-33117-v3 ready after 211.1211633682251s
Pipeline stage MKMLDeployer completed in 211.70s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.3119049072265625s
Received healthy response to inference request in 1.5794017314910889s
Failed to get response for submission chaiml-qwen3-8breranker_67576_v2: HTTPConnectionPool(host='chaiml-qwen3-8breranker-67576-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Received healthy response to inference request in 2.4678471088409424s
Received healthy response to inference request in 1.5433852672576904s
Received healthy response to inference request in 2.1098499298095703s
5 requests
0 failed requests
5th percentile: 1.55058856010437
10th percentile: 1.5577918529510497
20th percentile: 1.5721984386444092
30th percentile: 1.6854913711547852
40th percentile: 1.8976706504821779
50th percentile: 2.1098499298095703
60th percentile: 2.190671920776367
70th percentile: 2.271493911743164
80th percentile: 2.3430933475494387
90th percentile: 2.4054702281951905
95th percentile: 2.4366586685180662
99th percentile: 2.4616094207763672
mean time: 2.002477788925171
Pipeline stage StressChecker completed in 11.55s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.70s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.97s
Shutdown handler de-registered
chaiml-gy-exp63-dpo-exp_33117_v3 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3133.64s
Shutdown handler de-registered
chaiml-gy-exp63-dpo-exp_33117_v3 status is now inactive due to auto deactivation removed underperforming models
chaiml-gy-exp63-dpo-exp_33117_v3 status is now torndown due to DeploymentManager action