developer_uid: ZZYABC
submission_id: zzyabc-unsloth3_v2
model_name: zzyabc-unsloth3_v2
model_group: ZZYABC/unsloth3
status: torndown
timestamp: 2025-01-10T09:57:50+00:00
num_battles: 10109
num_wins: 4934
celo_rating: 1252.98
family_friendly_score: 0.596
family_friendly_standard_error: 0.006939510069161943
submission_type: basic
model_repo: ZZYABC/unsloth3
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
latencies: [{'batch_size': 1, 'throughput': 0.6145227563216542, 'latency_mean': 1.6272130608558655, 'latency_p50': 1.6300787925720215, 'latency_p90': 1.780651593208313}, {'batch_size': 3, 'throughput': 1.161243127980456, 'latency_mean': 2.5767478823661802, 'latency_p50': 2.578060269355774, 'latency_p90': 2.8254116058349608}, {'batch_size': 5, 'throughput': 1.4116792811043832, 'latency_mean': 3.5301609110832213, 'latency_p50': 3.5131722688674927, 'latency_p90': 4.009198927879333}, {'batch_size': 6, 'throughput': 1.4622919811846087, 'latency_mean': 4.075997632741928, 'latency_p50': 4.11133337020874, 'latency_p90': 4.568452382087708}, {'batch_size': 8, 'throughput': 1.5471912848199414, 'latency_mean': 5.146336812973022, 'latency_p50': 5.11857533454895, 'latency_p90': 5.735694813728332}, {'batch_size': 10, 'throughput': 1.5835107812533864, 'latency_mean': 6.264237396717071, 'latency_p50': 6.250047326087952, 'latency_p90': 7.221095132827759}]
gpu_counts: {'NVIDIA RTX A5000': 1}
display_name: zzyabc-unsloth3_v2
is_internal_developer: False
language_model: ZZYABC/unsloth3
model_size: 13B
ranking_group: single
throughput_3p7s: 1.44
us_pacific_date: 2025-01-10
win_ratio: 0.48807992877633793
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name zzyabc-unsloth3-v2-mkmlizer
Waiting for job on zzyabc-unsloth3-v2-mkmlizer to finish
zzyabc-unsloth3-v2-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
zzyabc-unsloth3-v2-mkmlizer: ║ _____ __ __ ║
zzyabc-unsloth3-v2-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
zzyabc-unsloth3-v2-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
zzyabc-unsloth3-v2-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
zzyabc-unsloth3-v2-mkmlizer: ║ /___/ ║
zzyabc-unsloth3-v2-mkmlizer: ║ ║
zzyabc-unsloth3-v2-mkmlizer: ║ Version: 0.11.12 ║
zzyabc-unsloth3-v2-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
zzyabc-unsloth3-v2-mkmlizer: ║ https://mk1.ai ║
zzyabc-unsloth3-v2-mkmlizer: ║ ║
zzyabc-unsloth3-v2-mkmlizer: ║ The license key for the current software has been verified as ║
zzyabc-unsloth3-v2-mkmlizer: ║ belonging to: ║
zzyabc-unsloth3-v2-mkmlizer: ║ ║
zzyabc-unsloth3-v2-mkmlizer: ║ Chai Research Corp. ║
zzyabc-unsloth3-v2-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
zzyabc-unsloth3-v2-mkmlizer: ║ Expiration: 2025-04-15 23:59:59 ║
zzyabc-unsloth3-v2-mkmlizer: ║ ║
zzyabc-unsloth3-v2-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
zzyabc-unsloth3-v2-mkmlizer: Downloaded to shared memory in 60.789s
zzyabc-unsloth3-v2-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpgeeurmdw, device:0
zzyabc-unsloth3-v2-mkmlizer: Saving flywheel model at /dev/shm/model_cache
zzyabc-unsloth3-v2-mkmlizer: quantized model in 40.645s
zzyabc-unsloth3-v2-mkmlizer: Processed model ZZYABC/unsloth3 in 101.435s
zzyabc-unsloth3-v2-mkmlizer: creating bucket guanaco-mkml-models
zzyabc-unsloth3-v2-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
zzyabc-unsloth3-v2-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/zzyabc-unsloth3-v2
zzyabc-unsloth3-v2-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/zzyabc-unsloth3-v2/config.json
zzyabc-unsloth3-v2-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/zzyabc-unsloth3-v2/special_tokens_map.json
zzyabc-unsloth3-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/zzyabc-unsloth3-v2/tokenizer_config.json
zzyabc-unsloth3-v2-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/zzyabc-unsloth3-v2/tokenizer.json
zzyabc-unsloth3-v2-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/zzyabc-unsloth3-v2/flywheel_model.0.safetensors
zzyabc-unsloth3-v2-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:16, 22.00it/s] Loading 0: 3%|▎ | 10/363 [00:00<00:12, 27.21it/s] Loading 0: 4%|▍ | 14/363 [00:00<00:14, 24.14it/s] Loading 0: 6%|▌ | 21/363 [00:00<00:09, 35.45it/s] Loading 0: 7%|▋ | 26/363 [00:01<00:15, 21.37it/s] Loading 0: 9%|▉ | 32/363 [00:01<00:13, 24.30it/s] Loading 0: 11%|█ | 39/363 [00:01<00:10, 30.24it/s] Loading 0: 12%|█▏ | 43/363 [00:01<00:11, 28.87it/s] Loading 0: 13%|█▎ | 48/363 [00:01<00:10, 31.31it/s] Loading 0: 14%|█▍ | 52/363 [00:01<00:10, 30.30it/s] Loading 0: 15%|█▌ | 56/363 [00:01<00:10, 30.20it/s] Loading 0: 17%|█▋ | 61/363 [00:02<00:11, 26.39it/s] Loading 0: 18%|█▊ | 64/363 [00:02<00:13, 22.96it/s] Loading 0: 19%|█▉ | 69/363 [00:02<00:10, 28.09it/s] Loading 0: 20%|██ | 73/363 [00:02<00:10, 27.76it/s] Loading 0: 21%|██ | 77/363 [00:02<00:10, 26.00it/s] Loading 0: 23%|██▎ | 84/363 [00:02<00:08, 32.05it/s] Loading 0: 24%|██▍ | 88/363 [00:03<00:09, 30.28it/s] Loading 0: 26%|██▌ | 93/363 [00:03<00:08, 32.74it/s] Loading 0: 27%|██▋ | 97/363 [00:03<00:08, 30.96it/s] Loading 0: 28%|██▊ | 101/363 [00:03<00:10, 25.42it/s] Loading 0: 29%|██▊ | 104/363 [00:03<00:11, 22.27it/s] Loading 0: 31%|███ | 111/363 [00:03<00:08, 28.92it/s] Loading 0: 32%|███▏ | 115/363 [00:04<00:08, 28.03it/s] Loading 0: 33%|███▎ | 120/363 [00:04<00:07, 30.60it/s] Loading 0: 34%|███▍ | 124/363 [00:04<00:08, 28.48it/s] Loading 0: 36%|███▌ | 129/363 [00:04<00:07, 30.68it/s] Loading 0: 37%|███▋ | 133/363 [00:04<00:07, 29.22it/s] Loading 0: 38%|███▊ | 137/363 [00:04<00:07, 30.19it/s] Loading 0: 39%|███▉ | 142/363 [00:05<00:08, 26.74it/s] Loading 0: 40%|███▉ | 145/363 [00:05<00:08, 25.00it/s] Loading 0: 41%|████ | 149/363 [00:05<00:09, 23.39it/s] Loading 0: 42%|████▏ | 154/363 [00:05<00:07, 28.36it/s] Loading 0: 44%|████▎ | 158/363 [00:05<00:07, 25.76it/s] Loading 0: 45%|████▍ | 163/363 [00:05<00:06, 30.77it/s] Loading 0: 46%|████▌ | 167/363 [00:06<00:07, 26.83it/s] Loading 0: 48%|████▊ | 174/363 [00:06<00:05, 33.21it/s] Loading 0: 49%|████▉ | 178/363 [00:06<00:05, 31.46it/s] Loading 0: 50%|█████ | 182/363 [00:06<00:06, 26.02it/s] Loading 0: 51%|█████ | 185/363 [00:06<00:07, 22.83it/s] Loading 0: 53%|█████▎ | 192/363 [00:06<00:05, 29.46it/s] Loading 0: 54%|█████▍ | 196/363 [00:07<00:05, 28.82it/s] Loading 0: 55%|█████▌ | 201/363 [00:07<00:05, 31.01it/s] Loading 0: 56%|█████▋ | 205/363 [00:07<00:05, 30.71it/s] Loading 0: 58%|█████▊ | 210/363 [00:07<00:04, 33.37it/s] Loading 0: 59%|█████▉ | 214/363 [00:07<00:04, 32.21it/s] Loading 0: 60%|██████ | 218/363 [00:07<00:04, 32.78it/s] Loading 0: 61%|██████▏ | 223/363 [00:07<00:04, 29.54it/s] Loading 0: 63%|██████▎ | 227/363 [00:08<00:04, 28.73it/s] Loading 0: 63%|██████▎ | 230/363 [00:08<00:05, 25.04it/s] Loading 0: 65%|██████▌ | 237/363 [00:08<00:03, 32.10it/s] Loading 0: 66%|██████▋ | 241/363 [00:08<00:04, 30.21it/s] Loading 0: 68%|██████▊ | 246/363 [00:08<00:03, 32.16it/s] Loading 0: 69%|██████▉ | 250/363 [00:08<00:03, 30.33it/s] Loading 0: 70%|███████ | 255/363 [00:08<00:03, 31.93it/s] Loading 0: 71%|███████▏ | 259/363 [00:09<00:03, 30.31it/s] Loading 0: 72%|███████▏ | 263/363 [00:09<00:03, 25.24it/s] Loading 0: 73%|███████▎ | 266/363 [00:09<00:04, 21.90it/s] Loading 0: 75%|███████▌ | 273/363 [00:09<00:03, 29.43it/s] Loading 0: 76%|███████▋ | 277/363 [00:09<00:02, 29.55it/s] Loading 0: 78%|███████▊ | 282/363 [00:09<00:02, 32.33it/s] Loading 0: 79%|███████▉ | 286/363 [00:10<00:02, 31.22it/s] Loading 0: 80%|████████ | 291/363 [00:10<00:02, 33.93it/s] Loading 0: 81%|████████▏ | 295/363 [00:10<00:02, 31.60it/s] Loading 0: 82%|████████▏ | 299/363 [00:10<00:02, 31.46it/s] Loading 0: 83%|████████▎ | 303/363 [00:10<00:01, 33.23it/s] Loading 0: 85%|████████▍ | 307/363 [00:10<00:02, 23.24it/s] Loading 0: 86%|████████▌ | 311/363 [00:11<00:02, 22.08it/s] Loading 0: 88%|████████▊ | 318/363 [00:11<00:01, 28.52it/s] Loading 0: 89%|████████▊ | 322/363 [00:11<00:01, 27.67it/s] Loading 0: 90%|█████████ | 327/363 [00:11<00:01, 30.54it/s] Loading 0: 91%|█████████ | 331/363 [00:11<00:01, 29.35it/s] Loading 0: 93%|█████████▎| 336/363 [00:11<00:00, 31.94it/s] Loading 0: 94%|█████████▎| 340/363 [00:11<00:00, 30.77it/s] Loading 0: 95%|█████████▍| 344/363 [00:18<00:09, 2.00it/s] Loading 0: 96%|█████████▌| 348/363 [00:18<00:05, 2.69it/s] Loading 0: 97%|█████████▋| 353/363 [00:19<00:02, 3.91it/s] Loading 0: 98%|█████████▊| 357/363 [00:19<00:01, 5.08it/s]
Job zzyabc-unsloth3-v2-mkmlizer completed after 128.14s with status: succeeded
Stopping job with name zzyabc-unsloth3-v2-mkmlizer
Pipeline stage MKMLizer completed in 129.11s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service zzyabc-unsloth3-v2
Waiting for inference service zzyabc-unsloth3-v2 to be ready
Failed to get response for submission zzyabc-unsloth3_v1: HTTPConnectionPool(host='zzyabc-unsloth3-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission zzyabc-unsloth3_v1: HTTPConnectionPool(host='zzyabc-unsloth3-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Retrying (%r) after connection broken by '%r': %s
Inference service zzyabc-unsloth3-v2 ready after 372.3723223209381s
Pipeline stage MKMLDeployer completed in 373.30s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.4795541763305664s
Received healthy response to inference request in 1.1252429485321045s
read tcp 127.0.0.1:60092->127.0.0.1:8080: read: connection reset by peer
Received unhealthy response to inference request!
Received healthy response to inference request in 1.8065104484558105s
Received healthy response to inference request in 1.4057824611663818s
5 requests
1 failed requests
5th percentile: 0.3842353343963623
10th percentile: 0.5694872379302979
20th percentile: 0.939991044998169
30th percentile: 1.1813508510589599
40th percentile: 1.2935666561126709
50th percentile: 1.4057824611663818
60th percentile: 1.4352911472320558
70th percentile: 1.4647998332977294
80th percentile: 1.5449454307556152
90th percentile: 1.675727939605713
95th percentile: 1.7411191940307618
99th percentile: 1.7934321975708007
mean time: 1.203214693069458
%s, retrying in %s seconds...
Received healthy response to inference request in 1.356543779373169s
Received healthy response to inference request in 1.7264013290405273s
Received healthy response to inference request in 1.080320119857788s
dial tcp 127.0.0.1:8080: connect: connection refused
Received unhealthy response to inference request!
Received healthy response to inference request in 1.9786770343780518s
5 requests
1 failed requests
5th percentile: 0.3141788005828857
10th percentile: 0.5057141304016113
20th percentile: 0.8887847900390625
30th percentile: 1.1355648517608643
40th percentile: 1.2460543155670165
50th percentile: 1.356543779373169
60th percentile: 1.5044867992401123
70th percentile: 1.6524298191070557
80th percentile: 1.7768564701080323
90th percentile: 1.877766752243042
95th percentile: 1.9282218933105468
99th percentile: 1.9685860061645508
mean time: 1.2529171466827393
%s, retrying in %s seconds...
Received healthy response to inference request in 0.6627931594848633s
Received healthy response to inference request in 1.961585521697998s
Received healthy response to inference request in 1.8436932563781738s
Received healthy response to inference request in 0.9641973972320557s
Received healthy response to inference request in 0.9142904281616211s
5 requests
0 failed requests
5th percentile: 0.7130926132202149
10th percentile: 0.7633920669555664
20th percentile: 0.8639909744262695
30th percentile: 0.924271821975708
40th percentile: 0.9442346096038818
50th percentile: 0.9641973972320557
60th percentile: 1.3159957408905028
70th percentile: 1.66779408454895
80th percentile: 1.8672717094421387
90th percentile: 1.9144286155700683
95th percentile: 1.9380070686340332
99th percentile: 1.956869831085205
mean time: 1.2693119525909424
Pipeline stage StressChecker completed in 22.51s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.70s
Shutdown handler de-registered
zzyabc-unsloth3_v2 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2645.36s
Shutdown handler de-registered
zzyabc-unsloth3_v2 status is now inactive due to auto deactivation removed underperforming models
zzyabc-unsloth3_v2 status is now torndown due to DeploymentManager action