developer_uid: NischayDnk
submission_id: chaiml-mistral24b-dpoex_64985_v2
model_name: chaiml-mistral24b-dpoex_64985_v2
model_group: ChaiML/mistral24b-dpoexp
status: torndown
timestamp: 2025-04-26T21:34:41+00:00
num_battles: 11114
num_wins: 6179
celo_rating: 1331.33
family_friendly_score: 0.5234
family_friendly_standard_error: 0.007063319899310805
submission_type: basic
model_repo: ChaiML/mistral24b-dpoexp1-s1-ftsftretryfull1295
model_architecture: MistralForCausalLM
model_num_parameters: 24096691200.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
reward_model: default
latencies: [{'batch_size': 1, 'throughput': 0.5331692375839778, 'latency_mean': 1.8755091261863708, 'latency_p50': 1.8731167316436768, 'latency_p90': 2.0852242946624755}, {'batch_size': 3, 'throughput': 1.0885887293187724, 'latency_mean': 2.7427239239215853, 'latency_p50': 2.7475212812423706, 'latency_p90': 3.0373053789138793}, {'batch_size': 5, 'throughput': 1.4024356190488212, 'latency_mean': 3.535453758239746, 'latency_p50': 3.541851758956909, 'latency_p90': 3.892368531227112}, {'batch_size': 6, 'throughput': 1.5150080835029847, 'latency_mean': 3.9372762525081635, 'latency_p50': 3.959068179130554, 'latency_p90': 4.361896896362304}, {'batch_size': 8, 'throughput': 1.6593106358741267, 'latency_mean': 4.787498224973678, 'latency_p50': 4.785611152648926, 'latency_p90': 5.363169550895691}, {'batch_size': 10, 'throughput': 1.7477620381188608, 'latency_mean': 5.66134446144104, 'latency_p50': 5.698574423789978, 'latency_p90': 6.268945956230164}]
gpu_counts: {'NVIDIA A100-SXM4-80GB': 1}
display_name: chaiml-mistral24b-dpoex_64985_v2
is_internal_developer: False
language_model: ChaiML/mistral24b-dpoexp1-s1-ftsftretryfull1295
model_size: 24B
ranking_group: single
throughput_3p7s: 1.46
us_pacific_date: 2025-04-26
win_ratio: 0.5559654489832644
generation_params: {'temperature': 0.5, 'top_p': 0.95, 'min_p': 0.025, 'top_k': 40, 'presence_penalty': 0.35, 'frequency_penalty': 0.35, 'stopping_words': ['\n', '<|im_start|>', '<|im_end|>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '<|system|>Family Friendly{memory}\n', 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{message}<|im_end|>\n', 'user_template': '<|im_start|>user\nYou:{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-mistral24b-dpoex-64985-v2-mkmlizer
Waiting for job on chaiml-mistral24b-dpoex-64985-v2-mkmlizer to finish
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ _____ __ __ ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ /___/ ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ Version: 0.12.8 ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ https://mk1.ai ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ belonging to: ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ Chai Research Corp. ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ║ ║
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: Downloaded to shared memory in 67.167s
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp80o3w10x, device:0
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Failed to get response for submission chaiml-mistral31-24b-sf_58955_v2: HTTPConnectionPool(host='chaiml-mistral31-24b-sf-58955-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: quantized model in 55.032s
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: Processed model ChaiML/mistral24b-dpoexp1-s1-ftsftretryfull1295 in 122.200s
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: creating bucket guanaco-mkml-models
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-mistral24b-dpoex-64985-v2
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-mistral24b-dpoex-64985-v2/special_tokens_map.json
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-mistral24b-dpoex-64985-v2/config.json
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-mistral24b-dpoex-64985-v2/tokenizer_config.json
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-mistral24b-dpoex-64985-v2/tokenizer.json
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-mistral24b-dpoex-64985-v2/flywheel_model.1.safetensors
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-mistral24b-dpoex-64985-v2/flywheel_model.0.safetensors
chaiml-mistral24b-dpoex-64985-v2-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%| | 4/363 [00:00<00:09, 37.91it/s] Loading 0: 2%|▏ | 8/363 [00:00<00:12, 28.94it/s] Loading 0: 3%|▎ | 12/363 [00:00<00:11, 29.82it/s] Loading 0: 4%|▍ | 16/363 [00:00<00:13, 25.97it/s] Loading 0: 6%|▌ | 21/363 [00:00<00:11, 29.23it/s] Loading 0: 7%|▋ | 25/363 [00:00<00:12, 26.42it/s] Loading 0: 8%|▊ | 30/363 [00:01<00:10, 31.73it/s] Loading 0: 9%|▉ | 34/363 [00:01<00:15, 21.49it/s] Loading 0: 10%|█ | 37/363 [00:01<00:16, 20.11it/s] Loading 0: 11%|█ | 40/363 [00:01<00:15, 21.38it/s] Loading 0: 12%|█▏ | 43/363 [00:01<00:14, 21.78it/s] Loading 0: 13%|█▎ | 48/363 [00:01<00:12, 24.82it/s] Loading 0: 14%|█▍ | 51/363 [00:02<00:14, 21.40it/s] Loading 0: 16%|█▌ | 57/363 [00:02<00:11, 26.39it/s] Loading 0: 17%|█▋ | 60/363 [00:02<00:12, 23.51it/s] Loading 0: 18%|█▊ | 65/363 [00:02<00:11, 26.39it/s] Loading 0: 19%|█▉ | 69/363 [00:02<00:10, 29.24it/s] Loading 0: 20%|██ | 73/363 [00:03<00:16, 17.67it/s] Loading 0: 22%|██▏ | 79/363 [00:03<00:12, 22.24it/s] Loading 0: 23%|██▎ | 82/363 [00:03<00:12, 21.77it/s] Loading 0: 24%|██▎ | 86/363 [00:03<00:11, 23.40it/s] Loading 0: 25%|██▍ | 89/363 [00:03<00:11, 22.98it/s] Loading 0: 25%|██▌ | 92/363 [00:03<00:14, 18.98it/s] Loading 0: 27%|██▋ | 97/363 [00:04<00:10, 24.33it/s] Loading 0: 28%|██▊ | 100/363 [00:04<00:10, 24.70it/s] Loading 0: 28%|██▊ | 103/363 [00:04<00:10, 24.10it/s] Loading 0: 29%|██▉ | 107/363 [00:04<00:13, 19.16it/s] Loading 0: 31%|███ | 111/363 [00:04<00:12, 20.52it/s] Loading 0: 31%|███▏ | 114/363 [00:04<00:12, 19.71it/s] Loading 0: 33%|███▎ | 120/363 [00:05<00:09, 24.45it/s] Loading 0: 34%|███▍ | 123/363 [00:05<00:10, 22.39it/s] Loading 0: 35%|███▍ | 127/363 [00:05<00:09, 25.38it/s] Loading 0: 36%|███▌ | 130/363 [00:05<00:09, 25.78it/s] Loading 0: 37%|███▋ | 133/363 [00:05<00:09, 25.53it/s] Loading 0: 38%|███▊ | 138/363 [00:05<00:08, 27.39it/s] Loading 0: 39%|███▉ | 141/363 [00:05<00:09, 22.78it/s] Loading 0: 40%|████ | 146/363 [00:06<00:07, 28.16it/s] Loading 0: 41%|████▏ | 150/363 [00:06<00:09, 22.15it/s] Loading 0: 42%|████▏ | 153/363 [00:06<00:11, 19.01it/s] Loading 0: 43%|████▎ | 157/363 [00:06<00:09, 22.16it/s] Loading 0: 44%|████▍ | 160/363 [00:06<00:09, 22.36it/s] Loading 0: 45%|████▍ | 163/363 [00:06<00:08, 23.85it/s] Loading 0: 46%|████▌ | 166/363 [00:07<00:08, 24.27it/s] Loading 0: 47%|████▋ | 169/363 [00:07<00:08, 23.93it/s] Loading 0: 48%|████▊ | 174/363 [00:07<00:07, 26.83it/s] Loading 0: 49%|████▉ | 177/363 [00:07<00:08, 22.90it/s] Loading 0: 50%|█████ | 182/363 [00:07<00:07, 25.67it/s] Loading 0: 52%|█████▏ | 187/363 [00:07<00:08, 21.74it/s] Loading 0: 52%|█████▏ | 190/363 [00:08<00:08, 20.54it/s] Loading 0: 53%|█████▎ | 193/363 [00:08<00:07, 21.70it/s] Loading 0: 54%|█████▍ | 196/363 [00:08<00:07, 21.71it/s] Loading 0: 55%|█████▍ | 199/363 [00:08<00:07, 22.95it/s] Loading 0: 55%|█████▌ | 200/363 [00:23<00:07, 22.95it/s] Loading 0: 55%|█████▌ | 201/363 [00:23<04:07, 1.53s/it] Loading 0: 56%|█████▌ | 203/363 [00:23<03:12, 1.20s/it] Loading 0: 57%|█████▋ | 208/363 [00:23<01:44, 1.48it/s] Loading 0: 58%|█████▊ | 211/363 [00:23<01:17, 1.96it/s] Loading 0: 59%|█████▉ | 214/363 [00:23<00:55, 2.66it/s] Loading 0: 60%|██████ | 218/363 [00:23<00:37, 3.91it/s] Loading 0: 61%|██████ | 221/363 [00:23<00:28, 5.06it/s] Loading 0: 62%|██████▏ | 224/363 [00:24<00:22, 6.07it/s] Loading 0: 63%|██████▎ | 229/363 [00:24<00:14, 9.07it/s] Loading 0: 64%|██████▍ | 232/363 [00:24<00:12, 10.75it/s] Loading 0: 65%|██████▌ | 237/363 [00:24<00:08, 14.56it/s] Loading 0: 66%|██████▌ | 240/363 [00:24<00:07, 15.47it/s] Loading 0: 68%|██████▊ | 246/363 [00:24<00:05, 20.63it/s] Loading 0: 69%|██████▊ | 249/363 [00:25<00:05, 20.30it/s] Loading 0: 70%|███████ | 255/363 [00:25<00:04, 25.38it/s] Loading 0: 71%|███████▏ | 259/363 [00:25<00:04, 25.26it/s] Loading 0: 73%|███████▎ | 264/363 [00:25<00:03, 30.08it/s] Loading 0: 74%|███████▍ | 268/363 [00:25<00:04, 22.93it/s] Loading 0: 75%|███████▍ | 271/363 [00:25<00:04, 21.95it/s] Loading 0: 75%|███████▌ | 274/363 [00:26<00:03, 23.12it/s] Loading 0: 76%|███████▋ | 277/363 [00:26<00:03, 23.91it/s] Loading 0: 78%|███████▊ | 282/363 [00:26<00:02, 27.23it/s] Loading 0: 79%|███████▊ | 285/363 [00:26<00:03, 24.83it/s] Loading 0: 80%|████████ | 291/363 [00:26<00:02, 29.84it/s] Loading 0: 81%|████████▏ | 295/363 [00:26<00:02, 28.36it/s] Loading 0: 82%|████████▏ | 299/363 [00:26<00:02, 28.10it/s] Loading 0: 84%|████████▎ | 304/363 [00:27<00:02, 24.36it/s] Loading 0: 85%|████████▍ | 307/363 [00:27<00:02, 22.88it/s] Loading 0: 85%|████████▌ | 310/363 [00:27<00:02, 23.83it/s] Loading 0: 86%|████████▌ | 313/363 [00:27<00:02, 24.41it/s] Loading 0: 88%|████████▊ | 318/363 [00:27<00:01, 27.23it/s] Loading 0: 88%|████████▊ | 321/363 [00:27<00:01, 24.34it/s] Loading 0: 90%|█████████ | 327/363 [00:27<00:01, 29.52it/s] Loading 0: 91%|█████████ | 331/363 [00:28<00:01, 28.19it/s] Loading 0: 92%|█████████▏| 335/363 [00:28<00:01, 27.73it/s] Loading 0: 93%|█████████▎| 338/363 [00:28<00:00, 26.36it/s] Loading 0: 94%|█████████▍| 341/363 [00:35<00:13, 1.66it/s] Loading 0: 96%|█████████▌| 347/363 [00:35<00:05, 2.78it/s] Loading 0: 96%|█████████▋| 350/363 [00:35<00:03, 3.49it/s] Loading 0: 98%|█████████▊| 355/363 [00:35<00:01, 5.16it/s] Loading 0: 99%|█████████▉| 359/363 [00:35<00:00, 6.68it/s]
Failed to get response for submission chaiml-mistral31-24b-sf_58955_v2: HTTPConnectionPool(host='chaiml-mistral31-24b-sf-58955-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Job chaiml-mistral24b-dpoex-64985-v2-mkmlizer completed after 156.68s with status: succeeded
Stopping job with name chaiml-mistral24b-dpoex-64985-v2-mkmlizer
Pipeline stage MKMLizer completed in 157.15s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-mistral24b-dpoex-64985-v2
Waiting for inference service chaiml-mistral24b-dpoex-64985-v2 to be ready
Inference service chaiml-mistral24b-dpoex-64985-v2 ready after 130.42077326774597s
Pipeline stage MKMLDeployer completed in 130.92s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.5340051651000977s
Received healthy response to inference request in 1.9682738780975342s
Received healthy response to inference request in 2.1127190589904785s
Received healthy response to inference request in 1.966921329498291s
Received healthy response to inference request in 2.1584842205047607s
5 requests
0 failed requests
5th percentile: 1.9671918392181396
10th percentile: 1.9674623489379883
20th percentile: 1.9680033683776856
30th percentile: 1.9971629142761231
40th percentile: 2.0549409866333006
50th percentile: 2.1127190589904785
60th percentile: 2.1310251235961912
70th percentile: 2.1493311882019044
80th percentile: 2.2335884094238283
90th percentile: 2.383796787261963
95th percentile: 2.45890097618103
99th percentile: 2.518984327316284
mean time: 2.1480807304382323
Pipeline stage StressChecker completed in 11.93s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.94s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.66s
Shutdown handler de-registered
chaiml-mistral24b-dpoex_64985_v2 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.12s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service chaiml-mistral24b-dpoex-64985-v2-profiler
Waiting for inference service chaiml-mistral24b-dpoex-64985-v2-profiler to be ready
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 5003.40s
Shutdown handler de-registered
chaiml-mistral24b-dpoex_64985_v2 status is now inactive due to auto deactivation removed underperforming models
chaiml-mistral24b-dpoex_64985_v2 status is now torndown due to DeploymentManager action