developer_uid: ChaiHarshitSheoran
submission_id: chaiml-mn24b-v2preferwi_15887_v1
model_name: chaiml-mn24b-v2preferwi_15887_v1
model_group: ChaiML/mn24b_v2preferwin
status: torndown
timestamp: 2025-09-21T16:51:31+00:00
num_battles: 6492
num_wins: 3404
celo_rating: 1279.77
family_friendly_score: 0.5552
family_friendly_standard_error: 0.007027844050631744
submission_type: basic
model_repo: ChaiML/mn24b_v2preferwin2_qwen_bo16_filt4_tune1
model_architecture: MistralForCausalLM
model_num_parameters: 24096691200.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
reward_model: default
latencies: [{'batch_size': 1, 'throughput': 0.5450761616664541, 'latency_mean': 1.834481691122055, 'latency_p50': 1.8385682106018066, 'latency_p90': 2.0408715486526487}, {'batch_size': 3, 'throughput': 1.1146235996056235, 'latency_mean': 2.6759902799129485, 'latency_p50': 2.6572518348693848, 'latency_p90': 3.0298733234405515}, {'batch_size': 5, 'throughput': 1.4350938159910012, 'latency_mean': 3.464847147464752, 'latency_p50': 3.4777212142944336, 'latency_p90': 3.9006681203842164}, {'batch_size': 6, 'throughput': 1.5285736937765682, 'latency_mean': 3.900294535160065, 'latency_p50': 3.9030654430389404, 'latency_p90': 4.397791576385498}, {'batch_size': 8, 'throughput': 1.6970098066746246, 'latency_mean': 4.6766045689582825, 'latency_p50': 4.687021851539612, 'latency_p90': 5.357261490821839}, {'batch_size': 10, 'throughput': 1.7566380919903832, 'latency_mean': 5.645715593099594, 'latency_p50': 5.699653625488281, 'latency_p90': 6.307616400718689}]
gpu_counts: {'NVIDIA L40S': 1}
display_name: chaiml-mn24b-v2preferwi_15887_v1
is_internal_developer: False
language_model: ChaiML/mn24b_v2preferwin2_qwen_bo16_filt4_tune1
model_size: 24B
ranking_group: single
throughput_3p7s: 1.5
us_pacific_date: 2025-09-21
win_ratio: 0.5243376463339495
generation_params: {'temperature': 0.6, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-mn24b-v2preferwi-15887-v1-mkmlizer
Waiting for job on chaiml-mn24b-v2preferwi-15887-v1-mkmlizer to finish
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ Version: 0.30.2 ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ belonging to: ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ║ ║
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: Downloaded to shared memory in 47.591s
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: Checking if ChaiML/mn24b_v2preferwin2_qwen_bo16_filt4_tune1 already exists in ChaiML
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:q4, folder:/tmp/tmpmmquv5ey, device:0
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Failed to get response for submission chaiml-cogito32b-dpos2_39363_v15: ('http://guanaco-model-mesh-load-balancer.model-mesh.k2.chaiverse.com/models/chaiml-cogito32b-dpos2_39363_v15/predict', '{"detail":"503, message=\'Attempt to decode JSON with unexpected mimetype: text/plain\', url=\'http://10.4.100.226:8080/models/chaiml-cogito32b-dpos2_39363_v15/predict\'"}')
Failed to get response for submission blend_hemen_2025-09-02: HTTPConnectionPool(host='guanaco-model-mesh-load-balancer.model-mesh.k2.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: quantized model in 290.596s
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: Processed model ChaiML/mn24b_v2preferwin2_qwen_bo16_filt4_tune1 in 338.187s
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-mn24b-v2preferwi-15887-v1/nvidia
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-mn24b-v2preferwi-15887-v1/nvidia/config.json
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-mn24b-v2preferwi-15887-v1/nvidia/special_tokens_map.json
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-mn24b-v2preferwi-15887-v1/nvidia/tokenizer_config.json
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-mn24b-v2preferwi-15887-v1/nvidia/tokenizer.json
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-mn24b-v2preferwi-15887-v1/nvidia/flywheel_model.1.safetensors
chaiml-mn24b-v2preferwi-15887-v1-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%| | 3/363 [00:01<03:58, 1.51it/s] Loading 0: 1%| | 4/363 [00:03<06:27, 1.08s/it] Loading 0: 1%|▏ | 5/363 [00:06<08:30, 1.43s/it] Loading 0: 2%|▏ | 8/363 [00:06<03:55, 1.51it/s] Loading 0: 2%|▏ | 9/363 [00:06<03:26, 1.72it/s] Loading 0: 3%|▎ | 10/363 [00:06<02:47, 2.11it/s] Loading 0: 3%|▎ | 12/363 [00:08<03:57, 1.48it/s] Loading 0: 4%|▎ | 13/363 [00:10<05:36, 1.04it/s] Loading 0: 4%|▍ | 14/363 [00:12<07:06, 1.22s/it] Loading 0: 5%|▍ | 17/363 [00:13<03:45, 1.53it/s] Loading 0: 5%|▍ | 18/363 [00:13<03:18, 1.74it/s] Loading 0: 5%|▌ | 19/363 [00:13<02:43, 2.10it/s] Loading 0: 6%|▌ | 21/363 [00:15<03:49, 1.49it/s] Loading 0: 6%|▌ | 22/363 [00:17<05:23, 1.06it/s] Loading 0: 6%|▋ | 23/363 [00:19<06:49, 1.21s/it] Loading 0: 7%|▋ | 26/363 [00:19<03:39, 1.54it/s] Loading 0: 7%|▋ | 27/363 [00:19<03:12, 1.74it/s] Loading 0: 8%|▊ | 28/363 [00:20<02:39, 2.11it/s] Loading 0: 8%|▊ | 30/363 [00:20<01:56, 2.85it/s] Loading 0: 9%|▊ | 31/363 [00:20<01:49, 3.04it/s] Loading 0: 9%|▉ | 32/363 [00:20<01:32, 3.58it/s] Loading 0: 9%|▉ | 33/363 [00:20<01:21, 4.03it/s] Loading 0: 9%|▉ | 34/363 [00:22<03:50, 1.43it/s] Loading 0: 10%|▉ | 35/363 [00:24<05:41, 1.04s/it] Loading 0: 10%|▉ | 36/363 [00:26<07:11, 1.32s/it] Loading 0: 11%|█ | 39/363 [00:28<05:09, 1.05it/s] Loading 0: 11%|█ | 40/363 [00:30<06:15, 1.16s/it] Loading 0: 11%|█▏ | 41/363 [00:32<07:17, 1.36s/it] Loading 0: 12%|█▏ | 44/363 [00:33<03:55, 1.36it/s] Loading 0: 12%|█▏ | 45/363 [00:33<03:25, 1.55it/s] Loading 0: 13%|█▎ | 46/363 [00:33<02:49, 1.87it/s] Loading 0: 13%|█▎ | 48/363 [00:35<03:41, 1.42it/s] Loading 0: 13%|█▎ | 49/363 [00:37<05:04, 1.03it/s] Loading 0: 14%|█▍ | 50/363 [00:39<06:20, 1.22s/it] Loading 0: 15%|█▍ | 53/363 [00:39<03:24, 1.52it/s] Loading 0: 15%|█▍ | 54/363 [00:39<02:58, 1.73it/s] Loading 0: 15%|█▌ | 55/363 [00:40<02:27, 2.08it/s] Loading 0: 16%|█▌ | 57/363 [00:42<03:25, 1.49it/s] Loading 0: 16%|█▌ | 58/363 [00:44<04:48, 1.06it/s] Loading 0: 16%|█▋ | 59/363 [00:46<06:04, 1.20s/it] Loading 0: 17%|█▋ | 62/363 [00:46<03:14, 1.54it/s] Loading 0: 17%|█▋ | 63/363 [00:46<02:50, 1.75it/s] Loading 0: 18%|█▊ | 64/363 [00:46<02:21, 2.12it/s] Loading 0: 18%|█▊ | 65/363 [00:48<04:04, 1.22it/s] Loading 0: 18%|█▊ | 67/363 [00:48<02:42, 1.83it/s] Loading 0: 19%|█▊ | 68/363 [00:49<02:21, 2.08it/s] Loading 0: 19%|█▉ | 69/363 [00:49<01:55, 2.53it/s] Loading 0: 19%|█▉ | 70/363 [00:49<01:36, 3.03it/s] Loading 0: 20%|█▉ | 71/363 [00:51<03:43, 1.31it/s] Loading 0: 20%|█▉ | 72/363 [00:53<05:18, 1.09s/it] Loading 0: 20%|██ | 73/363 [00:55<06:34, 1.36s/it] Loading 0: 21%|██ | 76/363 [00:55<03:12, 1.49it/s] Loading 0: 21%|██ | 77/363 [00:55<02:45, 1.73it/s] Loading 0: 21%|██▏ | 78/363 [00:56<02:14, 2.11it/s] Loading 0: 22%|██▏ | 79/363 [00:57<03:58, 1.19it/s] Loading 0: 22%|██▏ | 80/363 [01:00<05:24, 1.15s/it] Loading 0: 23%|██▎ | 82/363 [01:00<03:22, 1.39it/s] Loading 0: 23%|██▎ | 83/363 [01:00<02:50, 1.64it/s] Loading 0: 23%|██▎ | 84/363 [01:00<02:16, 2.05it/s] Loading 0: 24%|██▎ | 86/363 [01:02<03:12, 1.44it/s] Loading 0: 24%|██▍ | 87/363 [01:04<04:36, 1.00s/it] Loading 0: 25%|██▍ | 90/363 [01:06<03:46, 1.21it/s] Loading 0: 25%|██▌ | 91/363 [01:08<04:43, 1.04s/it] Loading 0: 25%|██▌ | 92/363 [01:10<05:39, 1.25s/it] Loading 0: 26%|██▌ | 95/363 [01:10<03:07, 1.43it/s] Loading 0: 26%|██▋ | 96/363 [01:11<02:43, 1.63it/s] Loading 0: 27%|██▋ | 97/363 [01:11<02:15, 1.96it/s] Loading 0: 27%|██▋ | 99/363 [01:13<03:01, 1.46it/s] Loading 0: 28%|██▊ | 100/363 [01:15<04:11, 1.05it/s] Loading 0: 28%|██▊ | 101/363 [01:17<05:15, 1.20s/it] Loading 0: 29%|██▊ | 104/363 [01:17<02:49, 1.53it/s] Loading 0: 29%|██▉ | 105/363 [01:17<02:28, 1.74it/s] Loading 0: 29%|██▉ | 106/363 [01:17<02:02, 2.10it/s] Loading 0: 29%|██▉ | 107/363 [01:17<01:41, 2.52it/s] Loading 0: 30%|██▉ | 108/363 [01:19<03:21, 1.27it/s] Loading 0: 31%|███ | 111/363 [01:21<03:01, 1.39it/s] Loading 0: 31%|███ | 112/363 [01:23<04:01, 1.04it/s] Loading 0: 31%|███ | 113/363 [01:25<05:00, 1.20s/it] Loading 0: 32%|███▏ | 116/363 [01:26<02:44, 1.50it/s] Loading 0: 32%|███▏ | 117/363 [01:26<02:24, 1.70it/s] Loading 0: 33%|███▎ | 118/363 [01:26<01:59, 2.05it/s] Loading 0: 33%|███▎ | 120/363 [01:28<02:43, 1.48it/s] Loading 0: 33%|███▎ | 121/363 [01:30<03:49, 1.05it/s] Loading 0: 34%|███▎ | 122/363 [01:32<04:49, 1.20s/it] Loading 0: 34%|███▍ | 125/363 [01:32<02:35, 1.53it/s] Loading 0: 35%|███▍ | 126/363 [01:33<02:16, 1.74it/s] Loading 0: 35%|███▍ | 127/363 [01:33<01:52, 2.10it/s] Loading 0: 36%|███▌ | 129/363 [01:35<02:36, 1.49it/s] Loading 0: 36%|███▌ | 130/363 [01:37<03:40, 1.06it/s] Loading 0: 36%|███▌ | 131/363 [01:39<04:38, 1.20s/it] Loading 0: 37%|███▋ | 134/363 [01:39<02:28, 1.55it/s] Loading 0: 37%|███▋ | 135/363 [01:39<02:09, 1.76it/s] Loading 0: 37%|███▋ | 136/363 [01:39<01:47, 2.12it/s] Loading 0: 38%|███▊ | 138/363 [01:41<02:30, 1.50it/s] Loading 0: 38%|███▊ | 139/363 [01:43<03:31, 1.06it/s] Loading 0: 39%|███▊ | 140/363 [01:45<04:28, 1.20s/it] Loading 0: 39%|███▉ | 143/363 [01:46<02:23, 1.54it/s] Loading 0: 40%|███▉ | 144/363 [01:46<02:05, 1.75it/s] Loading 0: 40%|███▉ | 145/363 [01:46<01:43, 2.10it/s] Loading 0: 40%|████ | 147/363 [01:46<01:16, 2.83it/s] Loading 0: 41%|████ | 148/363 [01:47<01:11, 3.03it/s] Loading 0: 41%|████ | 149/363 [01:47<01:00, 3.56it/s] Loading 0: 41%|████▏ | 150/363 [01:47<00:52, 4.04it/s] Loading 0: 42%|████▏ | 151/363 [01:49<02:28, 1.43it/s] Loading 0: 42%|████▏ | 152/363 [01:51<03:40, 1.05s/it] Loading 0: 42%|████▏ | 153/363 [01:53<04:38, 1.32s/it] Loading 0: 43%|████▎ | 156/363 [01:55<03:18, 1.04it/s] Loading 0: 43%|████▎ | 157/363 [01:57<03:59, 1.16s/it] Loading 0: 44%|████▎ | 158/363 [01:59<04:39, 1.36s/it] Loading 0: 44%|████▍ | 161/363 [01:59<02:28, 1.36it/s] Loading 0: 45%|████▍ | 162/363 [01:59<02:09, 1.56it/s] Loading 0: 45%|████▍ | 163/363 [01:59<01:46, 1.88it/s] Loading 0: 45%|████▌ | 165/363 [02:01<02:19, 1.42it/s] Loading 0: 46%|████▌ | 166/363 [02:03<03:11, 1.03it/s] Loading 0: 46%|████▌ | 167/363 [02:05<03:58, 1.22s/it] Loading 0: 47%|████▋ | 170/363 [02:06<02:07, 1.52it/s] Loading 0: 47%|████▋ | 171/363 [02:06<01:51, 1.72it/s] Loading 0: 47%|████▋ | 172/363 [02:06<01:31, 2.08it/s] Loading 0: 48%|████▊ | 174/363 [02:08<02:06, 1.49it/s] Loading 0: 48%|████▊ | 175/363 [02:10<02:57, 1.06it/s] Loading 0: 48%|████▊ | 176/363 [02:12<03:44, 1.20s/it] Loading 0: 49%|████▉ | 179/363 [02:12<01:59, 1.54it/s] Loading 0: 50%|████▉ | 180/363 [02:12<01:44, 1.75it/s] Loading 0: 50%|████▉ | 181/363 [02:13<01:26, 2.11it/s] Loading 0: 50%|█████ | 182/363 [02:15<02:29, 1.21it/s] Loading 0: 51%|█████ | 184/363 [02:15<01:41, 1.77it/s] Loading 0: 51%|█████ | 185/363 [02:15<01:28, 2.02it/s] Loading 0: 51%|█████ | 186/363 [02:15<01:12, 2.44it/s] Loading 0: 52%|█████▏ | 187/363 [02:15<00:59, 2.94it/s] Loading 0: 52%|█████▏ | 188/363 [02:17<02:18, 1.26it/s] Loading 0: 52%|█████▏ | 189/363 [02:20<03:20, 1.15s/it] Loading 0: 53%|█████▎ | 192/363 [02:22<02:32, 1.12it/s] Loading 0: 53%|█████▎ | 193/363 [02:24<03:10, 1.12s/it] Loading 0: 53%|█████▎ | 194/363 [02:26<03:46, 1.34s/it] Loading 0: 54%|█████▍ | 197/363 [02:26<02:01, 1.37it/s] Loading 0: 55%|█████▍ | 198/363 [02:26<01:45, 1.56it/s] Loading 0: 55%|█████▍ | 199/363 [02:26<01:26, 1.89it/s] Loading 0: 55%|█████▌ | 201/363 [02:28<01:55, 1.41it/s] Loading 0: 56%|█████▌ | 202/363 [02:30<02:39, 1.01it/s] Loading 0: 56%|█████▌ | 203/363 [02:32<03:19, 1.25s/it] Loading 0: 57%|█████▋ | 206/363 [02:33<01:45, 1.48it/s] Loading 0: 57%|█████▋ | 207/363 [02:33<01:32, 1.69it/s] Loading 0: 57%|█████▋ | 208/363 [02:33<01:16, 2.03it/s] Loading 0: 58%|█████▊ | 210/363 [02:35<01:45, 1.45it/s] Loading 0: 58%|█████▊ | 211/363 [02:37<02:27, 1.03it/s] Loading 0: 58%|█████▊ | 212/363 [02:39<03:05, 1.23s/it] Loading 0: 59%|█████▉ | 215/363 [02:40<01:38, 1.50it/s] Loading 0: 60%|█████▉ | 216/363 [02:40<01:26, 1.71it/s] Loading 0: 60%|█████▉ | 217/363 [02:40<01:11, 2.04it/s] Loading 0: 60%|██████ | 218/363 [02:42<02:03, 1.18it/s] Loading 0: 60%|██████ | 219/363 [02:44<02:48, 1.17s/it] Loading 0: 61%|██████ | 221/363 [02:44<01:45, 1.34it/s] Loading 0: 61%|██████ | 222/363 [02:45<01:29, 1.58it/s] Loading 0: 61%|██████▏ | 223/363 [02:45<01:11, 1.95it/s] Loading 0: 62%|██████▏ | 224/363 [02:45<00:58, 2.37it/s] Loading 0: 62%|██████▏ | 225/363 [02:47<02:02, 1.13it/s] Loading 0: 63%|██████▎ | 228/363 [02:49<01:49, 1.23it/s] Loading 0: 63%|██████▎ | 229/363 [02:51<02:22, 1.06s/it] Loading 0: 63%|██████▎ | 230/363 [02:54<02:52, 1.30s/it] Loading 0: 64%|██████▍ | 233/363 [02:54<01:32, 1.40it/s] Loading 0: 64%|██████▍ | 234/363 [02:54<01:20, 1.60it/s] Loading 0: 65%|██████▍ | 235/363 [02:54<01:06, 1.93it/s] Loading 0: 65%|██████▌ | 237/363 [02:56<01:28, 1.42it/s] Loading 0: 66%|██████▌ | 238/363 [02:58<02:03, 1.01it/s] Loading 0: 66%|██████▌ | 239/363 [03:00<02:34, 1.25s/it] Loading 0: 67%|██████▋ | 242/363 [03:01<01:21, 1.48it/s] Loading 0: 67%|██████▋ | 243/363 [03:01<01:11, 1.68it/s] Loading 0: 67%|██████▋ | 244/363 [03:01<00:58, 2.03it/s] Loading 0: 68%|██████▊ | 246/363 [03:03<01:24, 1.38it/s] Loading 0: 68%|██████▊ | 247/363 [03:05<01:58, 1.02s/it] Loading 0: 68%|██████▊ | 248/363 [03:08<02:30, 1.31s/it] Loading 0: 69%|██████▉ | 251/363 [03:08<01:19, 1.42it/s] Loading 0: 69%|██████▉ | 252/363 [03:08<01:08, 1.61it/s] Loading 0: 70%|██████▉ | 253/363 [03:08<00:56, 1.94it/s] Loading 0: 70%|███████ | 255/363 [03:10<01:16, 1.42it/s] Loading 0: 71%|███████ | 256/363 [03:13<01:48, 1.01s/it] Loading 0: 71%|███████ | 257/363 [03:15<02:14, 1.26s/it] Loading 0: 72%|███████▏ | 260/363 [03:15<01:10, 1.46it/s] Loading 0: 72%|███████▏ | 261/363 [03:15<01:01, 1.67it/s] Loading 0: 72%|███████▏ | 262/363 [03:15<00:50, 2.01it/s] Loading 0: 73%|███████▎ | 264/363 [03:16<00:36, 2.72it/s] Loading 0: 73%|███████▎ | 265/363 [03:16<00:33, 2.92it/s] Loading 0: 73%|███████▎ | 266/363 [03:16<00:28, 3.41it/s] Loading 0: 74%|███████▎ | 267/363 [03:16<00:25, 3.84it/s] Loading 0: 74%|███████▍ | 268/363 [03:18<01:11, 1.33it/s] Loading 0: 74%|███████▍ | 269/363 [03:20<01:43, 1.10s/it] Loading 0: 74%|███████▍ | 270/363 [03:22<02:08, 1.39s/it] Loading 0: 75%|███████▌ | 273/363 [03:38<05:08, 3.43s/it] Loading 0: 75%|███████▌ | 274/363 [03:40<04:38, 3.13s/it] Loading 0: 76%|███████▌ | 275/363 [03:42<04:15, 2.91s/it] Loading 0: 77%|███████▋ | 278/363 [03:42<02:09, 1.52s/it] Loading 0: 77%|███████▋ | 279/363 [03:42<01:46, 1.27s/it] Loading 0: 77%|███████▋ | 280/363 [03:43<01:25, 1.03s/it] Loading 0: 78%|███████▊ | 282/363 [03:45<01:22, 1.02s/it] Loading 0: 78%|███████▊ | 283/363 [03:47<01:38, 1.23s/it] Loading 0: 78%|███████▊ | 284/363 [03:49<01:52, 1.43s/it] Loading 0: 79%|███████▉ | 287/363 [03:49<00:57, 1.32it/s] Loading 0: 79%|███████▉ | 288/363 [03:49<00:49, 1.53it/s] Loading 0: 80%|███████▉ | 289/363 [03:49<00:39, 1.86it/s] Loading 0: 80%|████████ | 291/363 [03:51<00:51, 1.39it/s] Loading 0: 80%|████████ | 292/363 [03:53<01:10, 1.01it/s] Loading 0: 81%|████████ | 293/363 [03:55<01:27, 1.25s/it] Loading 0: 82%|████████▏ | 296/363 [03:56<00:44, 1.49it/s] Loading 0: 82%|████████▏ | 297/363 [03:56<00:38, 1.71it/s] Loading 0: 82%|████████▏ | 298/363 [03:56<00:31, 2.07it/s] Loading 0: 82%|████████▏ | 299/363 [03:58<00:53, 1.19it/s] Loading 0: 83%|████████▎ | 301/363 [03:58<00:34, 1.79it/s] Loading 0: 83%|████████▎ | 302/363 [03:58<00:29, 2.06it/s] Loading 0: 83%|████████▎ | 303/363 [03:59<00:23, 2.51it/s] Loading 0: 84%|████████▎ | 304/363 [03:59<00:19, 3.02it/s] Loading 0: 84%|████████▍ | 305/363 [04:01<00:45, 1.28it/s] Loading 0: 84%|████████▍ | 306/363 [04:03<01:05, 1.14s/it] Loading 0: 85%|████████▌ | 309/363 [04:05<00:47, 1.13it/s] Loading 0: 85%|████████▌ | 310/363 [04:07<00:59, 1.12s/it] Loading 0: 86%|████████▌ | 311/363 [04:09<01:09, 1.34s/it] Loading 0: 87%|████████▋ | 314/363 [04:09<00:35, 1.38it/s] Loading 0: 87%|████████▋ | 315/363 [04:09<00:30, 1.58it/s] Loading 0: 87%|████████▋ | 316/363 [04:10<00:24, 1.91it/s] Loading 0: 88%|████████▊ | 318/363 [04:12<00:31, 1.41it/s] Loading 0: 88%|████████▊ | 319/363 [04:14<00:43, 1.01it/s] Loading 0: 88%|████████▊ | 320/363 [04:16<00:53, 1.24s/it] Loading 0: 89%|████████▉ | 323/363 [04:16<00:26, 1.50it/s] Loading 0: 89%|████████▉ | 324/363 [04:16<00:22, 1.72it/s] Loading 0: 90%|████████▉ | 325/363 [04:16<00:18, 2.07it/s] Loading 0: 90%|█████████ | 327/363 [04:18<00:24, 1.47it/s] Loading 0: 90%|█████████ | 328/363 [04:20<00:33, 1.03it/s] Loading 0: 91%|█████████ | 329/363 [04:22<00:41, 1.23s/it] Loading 0: 91%|█████████▏| 332/363 [04:23<00:20, 1.51it/s] Loading 0: 92%|█████████▏| 333/363 [04:23<00:17, 1.74it/s] Loading 0: 92%|█████████▏| 334/363 [04:23<00:13, 2.10it/s] Loading 0: 92%|█████████▏| 335/363 [04:25<00:23, 1.19it/s] Loading 0: 93%|█████████▎| 336/363 [04:27<00:31, 1.15s/it] Loading 0: 93%|█████████▎| 338/363 [04:27<00:18, 1.37it/s] Loading 0: 93%|█████████▎| 339/363 [04:28<00:14, 1.63it/s] Loading 0: 94%|█████████▎| 340/363 [04:28<00:11, 2.04it/s] Loading 0: 94%|█████████▍| 341/363 [04:28<00:09, 2.28it/s] Loading 0: 94%|█████████▍| 343/363 [04:30<00:13, 1.46it/s] Loading 0: 95%|█████████▌| 346/363 [04:32<00:11, 1.47it/s] Loading 0: 96%|█████████▌| 347/363 [04:34<00:14, 1.08it/s] Loading 0: 96%|█████████▌| 348/363 [04:36<00:17, 1.17s/it] Loading 0: 97%|█████████▋| 351/363 [04:37<00:07, 1.51it/s] Loading 0: 97%|█████████▋| 352/363 [04:37<00:06, 1.71it/s] Loading 0: 97%|█████████▋| 353/363 [04:37<00:04, 2.05it/s] Loading 0: 98%|█████████▊| 355/363 [04:39<00:05, 1.45it/s] Loading 0: 98%|█████████▊| 356/363 [04:41<00:06, 1.03it/s] Loading 0: 98%|█████████▊| 357/363 [04:43<00:07, 1.22s/it] Loading 0: 99%|█████████▉| 360/363 [04:43<00:01, 1.51it/s] Loading 0: 99%|█████████▉| 361/363 [04:44<00:01, 1.72it/s] Loading 0: 100%|█████████▉| 362/363 [04:44<00:00, 2.07it/s]
Job chaiml-mn24b-v2preferwi-15887-v1-mkmlizer completed after 408.77s with status: succeeded
Stopping job with name chaiml-mn24b-v2preferwi-15887-v1-mkmlizer
Pipeline stage MKMLizer completed in 409.72s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.14s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-mn24b-v2preferwi-15887-v1
Waiting for inference service chaiml-mn24b-v2preferwi-15887-v1 to be ready
Inference service chaiml-mn24b-v2preferwi-15887-v1 ready after 60.291234254837036s
Pipeline stage MKMLDeployer completed in 61.17s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.455866575241089s
Received healthy response to inference request in 2.8293943405151367s
Received healthy response to inference request in 2.006775140762329s
Received healthy response to inference request in 1.6085364818572998s
5 requests
1 failed requests
5th percentile: 1.6881842136383056
10th percentile: 1.7678319454193114
20th percentile: 1.9271274089813233
30th percentile: 2.096593427658081
40th percentile: 2.276230001449585
50th percentile: 2.455866575241089
60th percentile: 2.605277681350708
70th percentile: 2.754688787460327
80th percentile: 6.2975625038147
90th percentile: 13.233898830413821
95th percentile: 16.702066993713377
99th percentile: 19.476601524353025
mean time: 5.814161539077759
%s, retrying in %s seconds...
Received healthy response to inference request in 1.9248952865600586s
Received healthy response to inference request in 2.087920904159546s
Received healthy response to inference request in 2.4636800289154053s
Received healthy response to inference request in 1.5599498748779297s
Received healthy response to inference request in 1.3148481845855713s
5 requests
0 failed requests
5th percentile: 1.363868522644043
10th percentile: 1.4128888607025147
20th percentile: 1.510929536819458
30th percentile: 1.6329389572143556
40th percentile: 1.778917121887207
50th percentile: 1.9248952865600586
60th percentile: 1.9901055335998534
70th percentile: 2.0553157806396483
80th percentile: 2.1630727291107177
90th percentile: 2.3133763790130617
95th percentile: 2.3885282039642335
99th percentile: 2.448649663925171
mean time: 1.8702588558197022
Pipeline stage StressChecker completed in 41.31s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.69s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.74s
Shutdown handler de-registered
chaiml-mn24b-v2preferwi_15887_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.11s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.41s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service chaiml-mn24b-v2preferwi-15887-v1-profiler
Waiting for inference service chaiml-mn24b-v2preferwi-15887-v1-profiler to be ready
Inference service chaiml-mn24b-v2preferwi-15887-v1-profiler ready after 60.611814975738525s
Pipeline stage MKMLProfilerDeployer completed in 61.52s
run pipeline stage %s
Running pipeline stage MKMLProfilerRunner
kubectl --kubeconfig /code/guanaco/guanaco_services/resources/kchai_coreweave_us_east_04a.yaml cp /code/guanaco/guanaco_inference_services/src/inference_scripts tenant-chaiml-guanaco/chaiml-mn24b-v2prefe4999f88626f9021917eebbae1f6d45df-deplo7r72z:/code/chaiverse_profiler_1758474166 --namespace tenant-chaiml-guanaco
kubectl --kubeconfig /code/guanaco/guanaco_services/resources/kchai_coreweave_us_east_04a.yaml exec -it chaiml-mn24b-v2prefe4999f88626f9021917eebbae1f6d45df-deplo7r72z --namespace tenant-chaiml-guanaco -- sh -c 'cd /code/chaiverse_profiler_1758474166 && python profiles.py profile --best_of_n 8 --auto_batch 5 --batches 1,5,10,15,20,25,30,35,40,45,50,55,60,65,70,75,80,85,90,95,100,105,110,115,120,125,130,135,140,145,150,155,160,165,170,175,180,185,190,195 --samples 200 --input_tokens 1024 --output_tokens 64 --summary /code/chaiverse_profiler_1758474166/summary.json'
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2669.89s
Shutdown handler de-registered
chaiml-mn24b-v2preferwi_15887_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-mn24b-v2preferwi_15887_v1 status is now torndown due to DeploymentManager action