developer_uid: chai_backend_admin
submission_id: chaiml-ezstorytellingsft_v1
model_name: chaiml-ezstorytellingsft_v1
model_group: ChaiML/EZStorytellingSFT
status: inactive
timestamp: 2024-11-19T05:02:42+00:00
num_battles: 11991
num_wins: 6205
celo_rating: 1262.02
family_friendly_score: 0.571
family_friendly_standard_error: 0.006999414261207862
submission_type: basic
model_repo: ChaiML/EZStorytellingSFT
model_architecture: MistralForCausalLM
model_num_parameters: 22247282688.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
latencies: [{'batch_size': 1, 'throughput': 0.37247111894972884, 'latency_mean': 2.6846907210350035, 'latency_p50': 2.6912052631378174, 'latency_p90': 2.9502146244049072}, {'batch_size': 2, 'throughput': 0.5898553089668106, 'latency_mean': 3.381033843755722, 'latency_p50': 3.388073682785034, 'latency_p90': 3.731257462501526}, {'batch_size': 3, 'throughput': 0.7487008255527687, 'latency_mean': 3.99477211356163, 'latency_p50': 3.9788665771484375, 'latency_p90': 4.4234055280685425}, {'batch_size': 4, 'throughput': 0.8618952947875239, 'latency_mean': 4.604243195056915, 'latency_p50': 4.599393844604492, 'latency_p90': 5.11224217414856}, {'batch_size': 5, 'throughput': 0.9482908736761461, 'latency_mean': 5.23461089015007, 'latency_p50': 5.257051467895508, 'latency_p90': 5.817193555831909}]
gpu_counts: {'NVIDIA RTX A6000': 1}
display_name: chaiml-ezstorytellingsft_v1
is_internal_developer: True
language_model: ChaiML/EZStorytellingSFT
model_size: 22B
ranking_group: single
throughput_3p7s: 0.68
us_pacific_date: 2024-11-18
win_ratio: 0.5174714369110166
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-ezstorytellingsft-v1-mkmlizer
Waiting for job on chaiml-ezstorytellingsft-v1-mkmlizer to finish
chaiml-ezstorytellingsft-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-ezstorytellingsft-v1-mkmlizer: ║ _____ __ __ ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ /___/ ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ Version: 0.11.12 ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ belonging to: ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
chaiml-ezstorytellingsft-v1-mkmlizer: ║ ║
chaiml-ezstorytellingsft-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission blend_sadof_2024-10-11: ('http://chaiml-nemo-20241010-tie-5991-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:52910->127.0.0.1:8080: read: connection reset by peer\n')
chaiml-ezstorytellingsft-v1-mkmlizer: Downloaded to shared memory in 90.050s
chaiml-ezstorytellingsft-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpoop_obas, device:0
chaiml-ezstorytellingsft-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-ezstorytellingsft-v1-mkmlizer: quantized model in 43.439s
chaiml-ezstorytellingsft-v1-mkmlizer: Processed model ChaiML/EZStorytellingSFT in 133.489s
chaiml-ezstorytellingsft-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-ezstorytellingsft-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-ezstorytellingsft-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-ezstorytellingsft-v1
chaiml-ezstorytellingsft-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-ezstorytellingsft-v1/config.json
chaiml-ezstorytellingsft-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-ezstorytellingsft-v1/special_tokens_map.json
chaiml-ezstorytellingsft-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-ezstorytellingsft-v1/tokenizer_config.json
chaiml-ezstorytellingsft-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-ezstorytellingsft-v1/tokenizer.json
chaiml-ezstorytellingsft-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-ezstorytellingsft-v1/flywheel_model.1.safetensors
chaiml-ezstorytellingsft-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-ezstorytellingsft-v1/flywheel_model.0.safetensors
chaiml-ezstorytellingsft-v1-mkmlizer: Loading 0: 0%| | 0/507 [00:00<?, ?it/s] Loading 0: 1%| | 5/507 [00:00<00:20, 24.18it/s] Loading 0: 2%|▏ | 12/507 [00:00<00:12, 40.50it/s] Loading 0: 3%|▎ | 17/507 [00:00<00:12, 38.55it/s] Loading 0: 4%|▍ | 22/507 [00:00<00:12, 39.18it/s] Loading 0: 5%|▌ | 27/507 [00:00<00:11, 40.58it/s] Loading 0: 6%|▋ | 32/507 [00:00<00:13, 34.15it/s] Loading 0: 8%|▊ | 39/507 [00:01<00:11, 41.61it/s] Loading 0: 9%|▊ | 44/507 [00:01<00:11, 41.08it/s] Loading 0: 10%|▉ | 49/507 [00:01<00:12, 36.35it/s] Loading 0: 10%|█ | 53/507 [00:01<00:17, 26.55it/s] Loading 0: 11%|█ | 57/507 [00:01<00:17, 26.20it/s] Loading 0: 12%|█▏ | 63/507 [00:01<00:14, 31.40it/s] Loading 0: 13%|█▎ | 67/507 [00:01<00:14, 31.31it/s] Loading 0: 14%|█▍ | 72/507 [00:02<00:12, 35.17it/s] Loading 0: 15%|█▌ | 78/507 [00:02<00:10, 39.45it/s] Loading 0: 16%|█▋ | 83/507 [00:02<00:11, 37.37it/s] Loading 0: 17%|█▋ | 88/507 [00:02<00:11, 37.70it/s] Loading 0: 18%|█▊ | 92/507 [00:02<00:11, 36.99it/s] Loading 0: 19%|█▉ | 96/507 [00:02<00:11, 37.11it/s] Loading 0: 20%|█▉ | 100/507 [00:02<00:11, 35.05it/s] Loading 0: 21%|██ | 105/507 [00:02<00:10, 37.68it/s] Loading 0: 21%|██▏ | 109/507 [00:03<00:11, 33.87it/s] Loading 0: 22%|██▏ | 113/507 [00:03<00:16, 24.55it/s] Loading 0: 23%|██▎ | 116/507 [00:03<00:16, 23.09it/s] Loading 0: 24%|██▍ | 122/507 [00:03<00:14, 26.13it/s] Loading 0: 25%|██▌ | 127/507 [00:03<00:12, 30.36it/s] Loading 0: 26%|██▌ | 131/507 [00:04<00:13, 27.88it/s] Loading 0: 27%|██▋ | 136/507 [00:04<00:11, 31.41it/s] Loading 0: 28%|██▊ | 140/507 [00:04<00:12, 29.42it/s] Loading 0: 29%|██▉ | 147/507 [00:04<00:09, 37.30it/s] Loading 0: 30%|██▉ | 152/507 [00:04<00:09, 36.82it/s] Loading 0: 31%|███ | 157/507 [00:04<00:09, 37.62it/s] Loading 0: 32%|███▏ | 162/507 [00:04<00:08, 40.31it/s] Loading 0: 33%|███▎ | 167/507 [00:04<00:07, 42.54it/s] Loading 0: 34%|███▍ | 172/507 [00:05<00:12, 27.12it/s] Loading 0: 35%|███▍ | 176/507 [00:05<00:12, 27.54it/s] Loading 0: 36%|███▌ | 183/507 [00:05<00:09, 35.30it/s] Loading 0: 37%|███▋ | 188/507 [00:05<00:08, 35.75it/s] Loading 0: 38%|███▊ | 193/507 [00:05<00:08, 36.92it/s] Loading 0: 39%|███▉ | 198/507 [00:05<00:07, 38.84it/s] Loading 0: 40%|████ | 203/507 [00:06<00:09, 33.04it/s] Loading 0: 41%|████▏ | 210/507 [00:06<00:07, 40.40it/s] Loading 0: 42%|████▏ | 215/507 [00:06<00:07, 40.10it/s] Loading 0: 43%|████▎ | 220/507 [00:06<00:08, 35.43it/s] Loading 0: 44%|████▍ | 224/507 [00:06<00:10, 27.70it/s] Loading 0: 45%|████▌ | 230/507 [00:06<00:09, 29.26it/s] Loading 0: 47%|████▋ | 237/507 [00:07<00:07, 35.57it/s] Loading 0: 48%|████▊ | 241/507 [00:07<00:07, 35.25it/s] Loading 0: 49%|████▊ | 246/507 [00:07<00:06, 37.63it/s] Loading 0: 50%|████▉ | 251/507 [00:07<00:06, 37.38it/s] Loading 0: 50%|█████ | 255/507 [00:07<00:06, 37.82it/s] Loading 0: 51%|█████ | 259/507 [00:07<00:06, 36.61it/s] Loading 0: 52%|█████▏ | 264/507 [00:07<00:06, 38.97it/s] Loading 0: 53%|█████▎ | 268/507 [00:07<00:06, 37.11it/s] Loading 0: 54%|█████▍ | 273/507 [00:07<00:05, 39.48it/s] Loading 0: 55%|█████▍ | 278/507 [00:08<00:05, 38.81it/s] Loading 0: 56%|█████▌ | 282/507 [00:08<00:05, 38.84it/s] Loading 0: 56%|█████▋ | 286/507 [00:08<00:08, 26.29it/s] Loading 0: 57%|█████▋ | 290/507 [00:08<00:07, 27.56it/s] Loading 0: 58%|█████▊ | 294/507 [00:08<00:07, 27.52it/s] Loading 0: 59%|█████▉ | 299/507 [00:23<00:07, 27.52it/s] Loading 0: 59%|█████▉ | 300/507 [00:23<03:15, 1.06it/s] Loading 0: 60%|█████▉ | 302/507 [00:23<02:45, 1.24it/s] Loading 0: 61%|██████ | 307/507 [00:23<01:46, 1.87it/s] Loading 0: 61%|██████ | 310/507 [00:23<01:23, 2.37it/s] Loading 0: 62%|██████▏ | 313/507 [00:23<01:03, 3.08it/s] Loading 0: 63%|██████▎ | 318/507 [00:23<00:39, 4.73it/s] Loading 0: 64%|██████▎ | 322/507 [00:24<00:29, 6.37it/s] Loading 0: 64%|██████▍ | 327/507 [00:24<00:19, 9.10it/s] Loading 0: 65%|██████▌ | 331/507 [00:24<00:15, 11.40it/s] Loading 0: 66%|██████▌ | 335/507 [00:24<00:12, 14.29it/s] Loading 0: 67%|██████▋ | 340/507 [00:24<00:10, 15.92it/s] Loading 0: 68%|██████▊ | 344/507 [00:24<00:08, 18.56it/s] Loading 0: 69%|██████▊ | 348/507 [00:24<00:08, 19.86it/s] Loading 0: 69%|██████▉ | 352/507 [00:24<00:06, 23.14it/s] Loading 0: 70%|███████ | 356/507 [00:25<00:06, 23.57it/s] Loading 0: 71%|███████ | 361/507 [00:25<00:05, 28.39it/s] Loading 0: 72%|███████▏ | 365/507 [00:25<00:05, 26.15it/s] Loading 0: 73%|███████▎ | 370/507 [00:25<00:04, 30.71it/s] Loading 0: 74%|███████▍ | 374/507 [00:25<00:04, 29.42it/s] Loading 0: 75%|███████▌ | 381/507 [00:25<00:03, 38.42it/s] Loading 0: 76%|███████▌ | 386/507 [00:25<00:03, 38.97it/s] Loading 0: 77%|███████▋ | 391/507 [00:26<00:03, 33.74it/s] Loading 0: 78%|███████▊ | 395/507 [00:26<00:04, 26.58it/s] Loading 0: 79%|███████▉ | 401/507 [00:26<00:03, 28.65it/s] Loading 0: 80%|████████ | 408/507 [00:26<00:02, 35.20it/s] Loading 0: 81%|████████▏ | 412/507 [00:26<00:02, 34.42it/s] Loading 0: 82%|████████▏ | 417/507 [00:26<00:02, 36.11it/s] Loading 0: 83%|████████▎ | 421/507 [00:27<00:02, 34.81it/s] Loading 0: 84%|████████▍ | 426/507 [00:27<00:02, 36.36it/s] Loading 0: 85%|████████▍ | 430/507 [00:27<00:02, 33.82it/s] Loading 0: 86%|████████▌ | 435/507 [00:27<00:01, 36.25it/s] Loading 0: 87%|████████▋ | 439/507 [00:27<00:01, 35.56it/s] Loading 0: 88%|████████▊ | 445/507 [00:27<00:01, 38.94it/s] Loading 0: 89%|████████▉ | 451/507 [00:27<00:01, 39.48it/s] Loading 0: 90%|████████▉ | 455/507 [00:30<00:08, 6.38it/s] Loading 0: 91%|█████████ | 459/507 [00:30<00:06, 7.90it/s] Loading 0: 92%|█████████▏| 465/507 [00:30<00:03, 11.04it/s] Loading 0: 93%|█████████▎| 472/507 [00:30<00:02, 15.83it/s] Loading 0: 94%|█████████▍| 476/507 [00:30<00:01, 18.01it/s] Loading 0: 95%|█████████▍| 481/507 [00:30<00:01, 21.73it/s] Loading 0: 96%|█████████▌| 485/507 [00:30<00:00, 23.36it/s] Loading 0: 97%|█████████▋| 490/507 [00:30<00:00, 27.31it/s] Loading 0: 97%|█████████▋| 494/507 [00:31<00:00, 28.80it/s] Loading 0: 98%|█████████▊| 499/507 [00:31<00:00, 32.43it/s] Loading 0: 99%|█████████▉| 503/507 [00:31<00:00, 32.09it/s] Loading 0: 100%|██████████| 507/507 [00:31<00:00, 33.78it/s]
Job chaiml-ezstorytellingsft-v1-mkmlizer completed after 166.06s with status: succeeded
Stopping job with name chaiml-ezstorytellingsft-v1-mkmlizer
Pipeline stage MKMLizer completed in 166.61s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.30s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-ezstorytellingsft-v1
Waiting for inference service chaiml-ezstorytellingsft-v1 to be ready
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Inference service chaiml-ezstorytellingsft-v1 ready after 211.8223888874054s
Pipeline stage MKMLDeployer completed in 212.46s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.168067693710327s
Received healthy response to inference request in 2.904852867126465s
Received healthy response to inference request in 3.1666648387908936s
Received healthy response to inference request in 2.8177332878112793s
Received healthy response to inference request in 2.7204606533050537s
5 requests
0 failed requests
5th percentile: 2.7399151802062987
10th percentile: 2.759369707107544
20th percentile: 2.7982787609100344
30th percentile: 2.8351572036743162
40th percentile: 2.8700050354003905
50th percentile: 2.904852867126465
60th percentile: 3.0095776557922362
70th percentile: 3.1143024444580076
80th percentile: 3.1669454097747805
90th percentile: 3.167506551742554
95th percentile: 3.1677871227264403
99th percentile: 3.16801157951355
mean time: 2.955555868148804
Pipeline stage StressChecker completed in 16.49s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.74s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 2.27s
Shutdown handler de-registered
chaiml-ezstorytellingsft_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3289.12s
Shutdown handler de-registered
chaiml-ezstorytellingsft_v1 status is now inactive due to auto deactivation removed underperforming models