developer_uid: bogoconic1
submission_id: bogoconic1-nemo-280k-av_36109_v1
model_name: bogoconic1-nemo-280k-av_36109_v1
model_group: bogoconic1/nemo-280k-avg
status: torndown
timestamp: 2025-05-02T12:35:52+00:00
num_battles: 7838
num_wins: 3478
celo_rating: 1249.06
family_friendly_score: 0.5611999999999999
family_friendly_standard_error: 0.0070178994008178825
submission_type: basic
model_repo: bogoconic1/nemo-280k-avg-chai-simpo-step200
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
latencies: [{'batch_size': 1, 'throughput': 0.6004045691988095, 'latency_mean': 1.6654461812973023, 'latency_p50': 1.658644437789917, 'latency_p90': 1.8441770792007446}, {'batch_size': 3, 'throughput': 1.0931988621041697, 'latency_mean': 2.739738144874573, 'latency_p50': 2.750510334968567, 'latency_p90': 3.0251948356628415}, {'batch_size': 5, 'throughput': 1.3215097929522892, 'latency_mean': 3.76529665350914, 'latency_p50': 3.7561779022216797, 'latency_p90': 4.182358884811402}, {'batch_size': 6, 'throughput': 1.3827381355128365, 'latency_mean': 4.309449011087418, 'latency_p50': 4.293062329292297, 'latency_p90': 4.932888078689575}, {'batch_size': 8, 'throughput': 1.43297909393696, 'latency_mean': 5.536604747772217, 'latency_p50': 5.544779896736145, 'latency_p90': 6.163014817237854}, {'batch_size': 10, 'throughput': 1.4791526855975448, 'latency_mean': 6.704898787736893, 'latency_p50': 6.698365926742554, 'latency_p90': 7.593003296852111}]
gpu_counts: {'NVIDIA RTX A5000': 1}
display_name: bogoconic1-nemo-280k-av_36109_v1
is_internal_developer: True
language_model: bogoconic1/nemo-280k-avg-chai-simpo-step200
model_size: 13B
ranking_group: single
throughput_3p7s: 1.32
us_pacific_date: 2025-05-02
win_ratio: 0.4437356468486859
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name bogoconic1-nemo-280k-av-36109-v1-mkmlizer
Waiting for job on bogoconic1-nemo-280k-av-36109-v1-mkmlizer to finish
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ _____ __ __ ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ /___/ ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ Version: 0.12.8 ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ https://mk1.ai ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ The license key for the current software has been verified as ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ belonging to: ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ Chai Research Corp. ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ║ ║
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: Downloaded to shared memory in 43.178s
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpotb3gi8e, device:0
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: quantized model in 36.157s
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: Processed model bogoconic1/nemo-280k-avg-chai-simpo-step200 in 79.336s
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: creating bucket guanaco-mkml-models
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/bogoconic1-nemo-280k-av-36109-v1
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/bogoconic1-nemo-280k-av-36109-v1/special_tokens_map.json
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/bogoconic1-nemo-280k-av-36109-v1/config.json
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/bogoconic1-nemo-280k-av-36109-v1/tokenizer_config.json
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/bogoconic1-nemo-280k-av-36109-v1/tokenizer.json
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/bogoconic1-nemo-280k-av-36109-v1/flywheel_model.0.safetensors
bogoconic1-nemo-280k-av-36109-v1-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:12, 28.98it/s] Loading 0: 3%|▎ | 12/363 [00:00<00:07, 47.27it/s] Loading 0: 5%|▍ | 18/363 [00:00<00:06, 49.55it/s] Loading 0: 7%|▋ | 24/363 [00:00<00:08, 41.91it/s] Loading 0: 9%|▊ | 31/363 [00:00<00:06, 48.50it/s] Loading 0: 10%|█ | 37/363 [00:00<00:07, 45.01it/s] Loading 0: 12%|█▏ | 42/363 [00:00<00:07, 44.04it/s] Loading 0: 13%|█▎ | 49/363 [00:01<00:06, 49.19it/s] Loading 0: 15%|█▌ | 55/363 [00:01<00:06, 46.13it/s] Loading 0: 17%|█▋ | 61/363 [00:01<00:08, 35.13it/s] Loading 0: 18%|█▊ | 65/363 [00:01<00:08, 34.36it/s] Loading 0: 20%|█▉ | 72/363 [00:01<00:07, 40.46it/s] Loading 0: 21%|██▏ | 78/363 [00:01<00:07, 39.73it/s] Loading 0: 23%|██▎ | 83/363 [00:01<00:06, 40.24it/s] Loading 0: 25%|██▍ | 89/363 [00:02<00:06, 44.79it/s] Loading 0: 26%|██▌ | 94/363 [00:02<00:06, 44.74it/s] Loading 0: 27%|██▋ | 99/363 [00:02<00:05, 44.86it/s] Loading 0: 29%|██▉ | 105/363 [00:02<00:06, 42.41it/s] Loading 0: 30%|███ | 110/363 [00:02<00:05, 43.75it/s] Loading 0: 32%|███▏ | 115/363 [00:02<00:05, 43.67it/s] Loading 0: 33%|███▎ | 120/363 [00:02<00:05, 41.57it/s] Loading 0: 35%|███▍ | 126/363 [00:02<00:05, 44.10it/s] Loading 0: 36%|███▌ | 131/363 [00:03<00:05, 45.09it/s] Loading 0: 37%|███▋ | 136/363 [00:03<00:06, 36.83it/s] Loading 0: 39%|███▉ | 142/363 [00:03<00:06, 31.61it/s] Loading 0: 40%|████ | 146/363 [00:03<00:06, 32.45it/s] Loading 0: 41%|████▏ | 150/363 [00:03<00:06, 31.74it/s] Loading 0: 43%|████▎ | 156/363 [00:03<00:05, 37.25it/s] Loading 0: 44%|████▍ | 160/363 [00:03<00:05, 37.01it/s] Loading 0: 45%|████▌ | 165/363 [00:04<00:05, 38.65it/s] Loading 0: 47%|████▋ | 170/363 [00:04<00:04, 39.34it/s] Loading 0: 48%|████▊ | 175/363 [00:04<00:04, 39.37it/s] Loading 0: 50%|████▉ | 180/363 [00:04<00:04, 41.59it/s] Loading 0: 51%|█████ | 185/363 [00:04<00:05, 34.68it/s] Loading 0: 53%|█████▎ | 192/363 [00:04<00:04, 41.72it/s] Loading 0: 54%|█████▍ | 197/363 [00:04<00:03, 42.31it/s] Loading 0: 56%|█████▌ | 202/363 [00:04<00:03, 42.73it/s] Loading 0: 57%|█████▋ | 208/363 [00:05<00:03, 40.89it/s] Loading 0: 59%|█████▊ | 213/363 [00:05<00:03, 39.53it/s] Loading 0: 60%|██████ | 218/363 [00:05<00:03, 41.58it/s] Loading 0: 61%|██████▏ | 223/363 [00:05<00:04, 32.54it/s] Loading 0: 63%|██████▎ | 227/363 [00:05<00:04, 33.43it/s] Loading 0: 64%|██████▎ | 231/363 [00:05<00:04, 32.99it/s] Loading 0: 65%|██████▌ | 237/363 [00:05<00:03, 38.71it/s] Loading 0: 67%|██████▋ | 242/363 [00:06<00:03, 40.17it/s] Loading 0: 68%|██████▊ | 247/363 [00:06<00:02, 41.38it/s] Loading 0: 69%|██████▉ | 252/363 [00:06<00:02, 43.06it/s] Loading 0: 71%|███████ | 257/363 [00:06<00:02, 36.32it/s] Loading 0: 73%|███████▎ | 264/363 [00:06<00:02, 43.79it/s] Loading 0: 74%|███████▍ | 269/363 [00:06<00:02, 42.45it/s] Loading 0: 75%|███████▌ | 274/363 [00:06<00:02, 40.63it/s] Loading 0: 77%|███████▋ | 279/363 [00:06<00:02, 41.76it/s] Loading 0: 78%|███████▊ | 284/363 [00:07<00:02, 35.20it/s] Loading 0: 80%|████████ | 291/363 [00:07<00:01, 41.31it/s] Loading 0: 82%|████████▏ | 296/363 [00:07<00:01, 41.53it/s] Loading 0: 83%|████████▎ | 301/363 [00:07<00:01, 43.59it/s] Loading 0: 84%|████████▍ | 306/363 [00:14<00:23, 2.47it/s] Loading 0: 85%|████████▌ | 310/363 [00:14<00:16, 3.21it/s] Loading 0: 87%|████████▋ | 314/363 [00:14<00:11, 4.21it/s] Loading 0: 88%|████████▊ | 320/363 [00:14<00:06, 6.28it/s] Loading 0: 90%|████████▉ | 326/363 [00:14<00:04, 8.78it/s] Loading 0: 91%|█████████ | 330/363 [00:14<00:03, 10.72it/s] Loading 0: 93%|█████████▎| 338/363 [00:14<00:01, 16.49it/s] Loading 0: 95%|█████████▍| 344/363 [00:15<00:00, 19.95it/s] Loading 0: 96%|█████████▌| 349/363 [00:15<00:00, 22.91it/s] Loading 0: 98%|█████████▊| 356/363 [00:15<00:00, 29.24it/s] Loading 0: 100%|█████████▉| 362/363 [00:15<00:00, 31.48it/s]
Job bogoconic1-nemo-280k-av-36109-v1-mkmlizer completed after 104.86s with status: succeeded
Stopping job with name bogoconic1-nemo-280k-av-36109-v1-mkmlizer
Pipeline stage MKMLizer completed in 105.35s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service bogoconic1-nemo-280k-av-36109-v1
Waiting for inference service bogoconic1-nemo-280k-av-36109-v1 to be ready
Inference service bogoconic1-nemo-280k-av-36109-v1 ready after 150.5405776500702s
Pipeline stage MKMLDeployer completed in 150.97s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.5914034843444824s
Received healthy response to inference request in 1.9385936260223389s
Received healthy response to inference request in 1.6923205852508545s
Received healthy response to inference request in 1.6478595733642578s
5 requests
1 failed requests
5th percentile: 1.656751775741577
10th percentile: 1.6656439781188965
20th percentile: 1.6834283828735352
30th percentile: 1.7415751934051513
40th percentile: 1.8400844097137452
50th percentile: 1.9385936260223389
60th percentile: 2.199717569351196
70th percentile: 2.460841512680054
80th percentile: 6.0981984138488805
90th percentile: 13.111788272857668
95th percentile: 16.618583202362057
99th percentile: 19.424019145965577
mean time: 5.599111080169678
%s, retrying in %s seconds...
Received healthy response to inference request in 1.586597204208374s
Received healthy response to inference request in 1.7530708312988281s
Received healthy response to inference request in 1.7381010055541992s
Received healthy response to inference request in 1.8997552394866943s
Received healthy response to inference request in 2.0210886001586914s
5 requests
0 failed requests
5th percentile: 1.616897964477539
10th percentile: 1.6471987247467041
20th percentile: 1.7078002452850343
30th percentile: 1.741094970703125
40th percentile: 1.7470829010009765
50th percentile: 1.7530708312988281
60th percentile: 1.8117445945739745
70th percentile: 1.8704183578491211
80th percentile: 1.9240219116210937
90th percentile: 1.9725552558898927
95th percentile: 1.996821928024292
99th percentile: 2.0162352657318117
mean time: 1.7997225761413573
Pipeline stage StressChecker completed in 39.41s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.71s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.72s
Shutdown handler de-registered
bogoconic1-nemo-280k-av_36109_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.12s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.11s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service bogoconic1-nemo-280k-av-36109-v1-profiler
Waiting for inference service bogoconic1-nemo-280k-av-36109-v1-profiler to be ready
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2887.18s
Shutdown handler de-registered
bogoconic1-nemo-280k-av_36109_v1 status is now inactive due to auto deactivation removed underperforming models
bogoconic1-nemo-280k-av_36109_v1 status is now torndown due to DeploymentManager action