developer_uid: chai_backend_admin
submission_id: jellywibble-tyler-james_83302_v1
model_name: jellywibble-tyler-james_83302_v1
model_group: Jellywibble/Tyler-James-
status: torndown
timestamp: 2025-03-23T10:39:21+00:00
num_battles: 9166
num_wins: 4221
celo_rating: 1251.21
family_friendly_score: 0.5562
family_friendly_standard_error: 0.007026258748437891
submission_type: basic
model_repo: Jellywibble/Tyler-James-Park-Detective-Park_Tyler-250323081558
model_architecture: MistralForCausalLM
model_num_parameters: 22247282688.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
reward_model: default
latencies: [{'batch_size': 1, 'throughput': 0.3749324312944808, 'latency_mean': 2.667070838212967, 'latency_p50': 2.658678650856018, 'latency_p90': 2.960782289505005}, {'batch_size': 2, 'throughput': 0.5826238919308736, 'latency_mean': 3.4214627861976625, 'latency_p50': 3.421158790588379, 'latency_p90': 3.7402286529541016}, {'batch_size': 3, 'throughput': 0.7383484306296553, 'latency_mean': 4.049583455324173, 'latency_p50': 4.058091163635254, 'latency_p90': 4.4513383388519285}, {'batch_size': 4, 'throughput': 0.8525593987412846, 'latency_mean': 4.670076733827591, 'latency_p50': 4.696420073509216, 'latency_p90': 5.193815588951111}, {'batch_size': 5, 'throughput': 0.9339391374179263, 'latency_mean': 5.3266505682468415, 'latency_p50': 5.3001627922058105, 'latency_p90': 5.992231392860412}]
gpu_counts: {'NVIDIA RTX A6000': 1}
display_name: jellywibble-tyler-james_83302_v1
is_internal_developer: True
language_model: Jellywibble/Tyler-James-Park-Detective-Park_Tyler-250323081558
model_size: 22B
ranking_group: single
throughput_3p7s: 0.66
us_pacific_date: 2025-03-23
win_ratio: 0.4605062186340825
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '####\n', 'You:', '\n', '####'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': 'You: {message}\n', 'response_template': '####\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name jellywibble-tyler-james-83302-v1-mkmlizer
Waiting for job on jellywibble-tyler-james-83302-v1-mkmlizer to finish
jellywibble-tyler-james-83302-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
jellywibble-tyler-james-83302-v1-mkmlizer: ║ _____ __ __ ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ /___/ ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ Version: 0.12.8 ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ https://mk1.ai ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ The license key for the current software has been verified as ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ belonging to: ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ Chai Research Corp. ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ Expiration: 2025-04-15 23:59:59 ║
jellywibble-tyler-james-83302-v1-mkmlizer: ║ ║
jellywibble-tyler-james-83302-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
jellywibble-tyler-james-83302-v1-mkmlizer: Downloaded to shared memory in 70.234s
jellywibble-tyler-james-83302-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpb61wwusw, device:0
jellywibble-tyler-james-83302-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
jellywibble-tyler-james-83302-v1-mkmlizer: quantized model in 43.649s
jellywibble-tyler-james-83302-v1-mkmlizer: Processed model Jellywibble/Tyler-James-Park-Detective-Park_Tyler-250323081558 in 113.884s
jellywibble-tyler-james-83302-v1-mkmlizer: creating bucket guanaco-mkml-models
jellywibble-tyler-james-83302-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
jellywibble-tyler-james-83302-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/jellywibble-tyler-james-83302-v1
jellywibble-tyler-james-83302-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/jellywibble-tyler-james-83302-v1/config.json
jellywibble-tyler-james-83302-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/jellywibble-tyler-james-83302-v1/special_tokens_map.json
jellywibble-tyler-james-83302-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/jellywibble-tyler-james-83302-v1/tokenizer_config.json
jellywibble-tyler-james-83302-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/jellywibble-tyler-james-83302-v1/tokenizer.json
jellywibble-tyler-james-83302-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/jellywibble-tyler-james-83302-v1/flywheel_model.1.safetensors
jellywibble-tyler-james-83302-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/jellywibble-tyler-james-83302-v1/flywheel_model.0.safetensors
jellywibble-tyler-james-83302-v1-mkmlizer: Loading 0: 0%| | 0/507 [00:00<?, ?it/s] Loading 0: 1%| | 5/507 [00:00<00:20, 23.97it/s] Loading 0: 2%|▏ | 12/507 [00:00<00:12, 39.59it/s] Loading 0: 3%|▎ | 17/507 [00:00<00:12, 38.61it/s] Loading 0: 4%|▍ | 22/507 [00:00<00:12, 39.15it/s] Loading 0: 5%|▌ | 27/507 [00:00<00:11, 40.47it/s] Loading 0: 6%|▋ | 32/507 [00:00<00:14, 33.45it/s] Loading 0: 8%|▊ | 39/507 [00:01<00:11, 40.85it/s] Loading 0: 9%|▊ | 44/507 [00:01<00:11, 40.22it/s] Loading 0: 10%|▉ | 49/507 [00:01<00:13, 34.35it/s] Loading 0: 10%|█ | 53/507 [00:01<00:17, 26.26it/s] Loading 0: 11%|█ | 57/507 [00:01<00:17, 26.17it/s] Loading 0: 12%|█▏ | 63/507 [00:01<00:14, 31.35it/s] Loading 0: 13%|█▎ | 67/507 [00:02<00:14, 31.41it/s] Loading 0: 14%|█▍ | 72/507 [00:02<00:12, 35.50it/s] Loading 0: 15%|█▌ | 78/507 [00:02<00:10, 39.97it/s] Loading 0: 16%|█▋ | 83/507 [00:02<00:11, 37.75it/s] Loading 0: 17%|█▋ | 87/507 [00:02<00:11, 37.41it/s] Loading 0: 18%|█▊ | 91/507 [00:02<00:11, 35.94it/s] Loading 0: 19%|█▉ | 96/507 [00:02<00:10, 38.01it/s] Loading 0: 20%|█▉ | 100/507 [00:02<00:11, 36.39it/s] Loading 0: 21%|██ | 105/507 [00:02<00:10, 38.52it/s] Loading 0: 21%|██▏ | 109/507 [00:03<00:10, 36.31it/s] Loading 0: 22%|██▏ | 113/507 [00:03<00:14, 26.72it/s] Loading 0: 23%|██▎ | 117/507 [00:03<00:15, 25.89it/s] Loading 0: 24%|██▍ | 122/507 [00:03<00:14, 27.36it/s] Loading 0: 25%|██▌ | 129/507 [00:03<00:10, 34.74it/s] Loading 0: 26%|██▌ | 133/507 [00:03<00:10, 34.32it/s] Loading 0: 27%|██▋ | 138/507 [00:04<00:10, 36.66it/s] Loading 0: 28%|██▊ | 142/507 [00:04<00:10, 35.45it/s] Loading 0: 29%|██▉ | 147/507 [00:04<00:09, 38.53it/s] Loading 0: 30%|██▉ | 151/507 [00:04<00:09, 37.09it/s] Loading 0: 31%|███ | 156/507 [00:04<00:08, 39.57it/s] Loading 0: 32%|███▏ | 161/507 [00:04<00:08, 39.71it/s] Loading 0: 33%|███▎ | 166/507 [00:04<00:08, 41.47it/s] Loading 0: 34%|███▎ | 171/507 [00:05<00:12, 25.95it/s] Loading 0: 35%|███▍ | 176/507 [00:05<00:11, 27.94it/s] Loading 0: 36%|███▌ | 183/507 [00:05<00:09, 34.95it/s] Loading 0: 37%|███▋ | 188/507 [00:05<00:08, 35.45it/s] Loading 0: 38%|███▊ | 193/507 [00:05<00:08, 36.21it/s] Loading 0: 39%|███▉ | 197/507 [00:05<00:08, 35.68it/s] Loading 0: 40%|███▉ | 202/507 [00:05<00:08, 36.92it/s] Loading 0: 41%|████ | 207/507 [00:05<00:07, 39.41it/s] Loading 0: 42%|████▏ | 212/507 [00:06<00:08, 33.34it/s] Loading 0: 43%|████▎ | 218/507 [00:06<00:07, 38.38it/s] Loading 0: 44%|████▍ | 223/507 [00:06<00:08, 35.01it/s] Loading 0: 45%|████▍ | 227/507 [00:06<00:09, 30.72it/s] Loading 0: 46%|████▌ | 231/507 [00:06<00:09, 29.13it/s] Loading 0: 47%|████▋ | 237/507 [00:06<00:07, 33.89it/s] Loading 0: 48%|████▊ | 241/507 [00:07<00:08, 33.17it/s] Loading 0: 49%|████▊ | 246/507 [00:07<00:07, 35.19it/s] Loading 0: 49%|████▉ | 250/507 [00:07<00:07, 34.54it/s] Loading 0: 50%|█████ | 255/507 [00:07<00:06, 36.60it/s] Loading 0: 51%|█████ | 259/507 [00:07<00:06, 35.44it/s] Loading 0: 52%|█████▏ | 264/507 [00:07<00:06, 38.08it/s] Loading 0: 53%|█████▎ | 268/507 [00:07<00:06, 35.54it/s] Loading 0: 54%|█████▍ | 273/507 [00:07<00:06, 37.75it/s] Loading 0: 55%|█████▍ | 277/507 [00:07<00:06, 36.53it/s] Loading 0: 56%|█████▌ | 283/507 [00:08<00:05, 38.35it/s] Loading 0: 57%|█████▋ | 287/507 [00:08<00:08, 25.09it/s] Loading 0: 58%|█████▊ | 293/507 [00:08<00:07, 28.13it/s] Loading 0: 59%|█████▉ | 299/507 [00:23<00:07, 28.13it/s] Loading 0: 59%|█████▉ | 300/507 [00:23<02:45, 1.25it/s] Loading 0: 60%|█████▉ | 302/507 [00:23<02:24, 1.42it/s] Loading 0: 61%|██████ | 307/507 [00:23<01:37, 2.05it/s] Loading 0: 61%|██████ | 310/507 [00:23<01:17, 2.55it/s] Loading 0: 62%|██████▏ | 314/507 [00:23<00:55, 3.49it/s] Loading 0: 63%|██████▎ | 319/507 [00:23<00:36, 5.08it/s] Loading 0: 64%|██████▍ | 324/507 [00:23<00:25, 7.17it/s] Loading 0: 65%|██████▍ | 328/507 [00:24<00:19, 9.17it/s] Loading 0: 65%|██████▌ | 332/507 [00:24<00:15, 11.63it/s] Loading 0: 66%|██████▋ | 337/507 [00:24<00:10, 15.46it/s] Loading 0: 67%|██████▋ | 341/507 [00:24<00:10, 15.42it/s] Loading 0: 68%|██████▊ | 345/507 [00:24<00:09, 17.89it/s] Loading 0: 69%|██████▉ | 349/507 [00:24<00:07, 20.53it/s] Loading 0: 70%|██████▉ | 354/507 [00:24<00:06, 25.03it/s] Loading 0: 71%|███████ | 358/507 [00:25<00:05, 26.80it/s] Loading 0: 72%|███████▏ | 363/507 [00:25<00:04, 29.93it/s] Loading 0: 72%|███████▏ | 367/507 [00:25<00:04, 30.51it/s] Loading 0: 73%|███████▎ | 372/507 [00:25<00:03, 34.09it/s] Loading 0: 74%|███████▍ | 376/507 [00:25<00:03, 33.82it/s] Loading 0: 75%|███████▌ | 381/507 [00:25<00:03, 37.43it/s] Loading 0: 76%|███████▌ | 385/507 [00:25<00:03, 36.61it/s] Loading 0: 77%|███████▋ | 389/507 [00:25<00:03, 36.66it/s] Loading 0: 78%|███████▊ | 393/507 [00:25<00:03, 35.64it/s] Loading 0: 78%|███████▊ | 397/507 [00:26<00:04, 26.02it/s] Loading 0: 79%|███████▉ | 401/507 [00:26<00:04, 25.44it/s] Loading 0: 80%|████████ | 406/507 [00:26<00:03, 30.17it/s] Loading 0: 81%|████████ | 410/507 [00:26<00:03, 27.46it/s] Loading 0: 82%|████████▏ | 417/507 [00:26<00:02, 34.84it/s] Loading 0: 83%|████████▎ | 421/507 [00:26<00:02, 32.45it/s] Loading 0: 84%|████████▍ | 425/507 [00:27<00:02, 32.41it/s] Loading 0: 85%|████████▍ | 429/507 [00:27<00:02, 30.77it/s] Loading 0: 86%|████████▌ | 435/507 [00:27<00:01, 36.18it/s] Loading 0: 87%|████████▋ | 439/507 [00:27<00:01, 35.30it/s] Loading 0: 88%|████████▊ | 444/507 [00:27<00:01, 38.32it/s] Loading 0: 88%|████████▊ | 448/507 [00:27<00:01, 36.89it/s] Loading 0: 90%|████████▉ | 454/507 [00:27<00:01, 38.57it/s] Loading 0: 90%|█████████ | 458/507 [00:30<00:07, 6.16it/s] Loading 0: 91%|█████████ | 461/507 [00:30<00:06, 7.37it/s] Loading 0: 92%|█████████▏| 465/507 [00:30<00:04, 9.38it/s] Loading 0: 93%|█████████▎| 472/507 [00:30<00:02, 14.45it/s] Loading 0: 94%|█████████▍| 476/507 [00:30<00:01, 16.86it/s] Loading 0: 95%|█████████▍| 481/507 [00:30<00:01, 20.95it/s] Loading 0: 96%|█████████▌| 485/507 [00:30<00:00, 23.11it/s] Loading 0: 97%|█████████▋| 490/507 [00:30<00:00, 26.92it/s] Loading 0: 97%|█████████▋| 494/507 [00:31<00:00, 28.28it/s] Loading 0: 98%|█████████▊| 499/507 [00:31<00:00, 31.95it/s] Loading 0: 99%|█████████▉| 503/507 [00:31<00:00, 32.14it/s]
Job jellywibble-tyler-james-83302-v1-mkmlizer completed after 145.39s with status: succeeded
Stopping job with name jellywibble-tyler-james-83302-v1-mkmlizer
Pipeline stage MKMLizer completed in 145.92s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service jellywibble-tyler-james-83302-v1
Waiting for inference service jellywibble-tyler-james-83302-v1 to be ready
Inference service jellywibble-tyler-james-83302-v1 ready after 90.4322612285614s
Pipeline stage MKMLDeployer completed in 90.97s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.1187520027160645s
Received healthy response to inference request in 2.4823269844055176s
Received healthy response to inference request in 2.413788080215454s
Received healthy response to inference request in 2.700895071029663s
Received healthy response to inference request in 2.549315929412842s
5 requests
0 failed requests
5th percentile: 2.4274958610534667
10th percentile: 2.4412036418914793
20th percentile: 2.468619203567505
30th percentile: 2.4957247734069825
40th percentile: 2.522520351409912
50th percentile: 2.549315929412842
60th percentile: 2.6099475860595702
70th percentile: 2.6705792427062987
80th percentile: 2.7844664573669435
90th percentile: 2.951609230041504
95th percentile: 3.035180616378784
99th percentile: 3.1020377254486085
mean time: 2.653015613555908
Pipeline stage StressChecker completed in 14.64s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.74s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.68s
Shutdown handler de-registered
jellywibble-tyler-james_83302_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4864.32s
Shutdown handler de-registered
jellywibble-tyler-james_83302_v1 status is now inactive due to auto deactivation removed underperforming models
jellywibble-tyler-james_83302_v1 status is now torndown due to DeploymentManager action