developer_uid: rica40325
submission_id: rica40325-mixdata-1w5step_v4
model_name: rica40325-mixdata-1w5step_v1
model_group: rica40325/mixdata-1w5ste
status: inactive
timestamp: 2024-12-06T06:46:50+00:00
num_battles: 9199
num_wins: 4634
celo_rating: 1260.13
family_friendly_score: 0.5816
family_friendly_standard_error: 0.006976266049972578
submission_type: basic
model_repo: rica40325/mixdata-1w5step
model_architecture: MistralForCausalLM
model_num_parameters: 12772070400.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
latencies: [{'batch_size': 1, 'throughput': 0.6109002037405511, 'latency_mean': 1.6368715274333954, 'latency_p50': 1.6446239948272705, 'latency_p90': 1.7957545280456542}, {'batch_size': 3, 'throughput': 1.1270552658316708, 'latency_mean': 2.657253005504608, 'latency_p50': 2.6348506212234497, 'latency_p90': 2.9194580793380736}, {'batch_size': 5, 'throughput': 1.3726659814034095, 'latency_mean': 3.621292425394058, 'latency_p50': 3.605116844177246, 'latency_p90': 4.07519850730896}, {'batch_size': 6, 'throughput': 1.4363558735246351, 'latency_mean': 4.15871719956398, 'latency_p50': 4.173927068710327, 'latency_p90': 4.652536940574646}, {'batch_size': 8, 'throughput': 1.492580717604994, 'latency_mean': 5.325674246549607, 'latency_p50': 5.303531169891357, 'latency_p90': 5.938041687011719}, {'batch_size': 10, 'throughput': 1.5299182313962223, 'latency_mean': 6.477791068553924, 'latency_p50': 6.536268591880798, 'latency_p90': 7.283669090270996}]
gpu_counts: {'NVIDIA RTX A5000': 1}
display_name: rica40325-mixdata-1w5step_v1
is_internal_developer: False
language_model: rica40325/mixdata-1w5step
model_size: 13B
ranking_group: single
throughput_3p7s: 1.39
us_pacific_date: 2024-12-05
win_ratio: 0.5037504076530057
generation_params: {'temperature': 0.95, 'top_p': 0.95, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name rica40325-mixdata-1w5step-v4-mkmlizer
Waiting for job on rica40325-mixdata-1w5step-v4-mkmlizer to finish
rica40325-mixdata-1w5step-v4-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
rica40325-mixdata-1w5step-v4-mkmlizer: ║ _____ __ __ ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ /___/ ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ Version: 0.11.12 ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ https://mk1.ai ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ The license key for the current software has been verified as ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ belonging to: ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ Chai Research Corp. ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
rica40325-mixdata-1w5step-v4-mkmlizer: ║ ║
rica40325-mixdata-1w5step-v4-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Connection pool is full, discarding connection: %s. Connection pool size: %s
rica40325-mixdata-1w5step-v4-mkmlizer: Downloaded to shared memory in 54.105s
rica40325-mixdata-1w5step-v4-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpgaas1kbd, device:0
rica40325-mixdata-1w5step-v4-mkmlizer: Saving flywheel model at /dev/shm/model_cache
rica40325-mixdata-1w5step-v4-mkmlizer: quantized model in 40.740s
rica40325-mixdata-1w5step-v4-mkmlizer: Processed model rica40325/mixdata-1w5step in 94.845s
rica40325-mixdata-1w5step-v4-mkmlizer: creating bucket guanaco-mkml-models
rica40325-mixdata-1w5step-v4-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
rica40325-mixdata-1w5step-v4-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/rica40325-mixdata-1w5step-v4
rica40325-mixdata-1w5step-v4-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/rica40325-mixdata-1w5step-v4/config.json
rica40325-mixdata-1w5step-v4-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/rica40325-mixdata-1w5step-v4/special_tokens_map.json
rica40325-mixdata-1w5step-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/rica40325-mixdata-1w5step-v4/tokenizer_config.json
rica40325-mixdata-1w5step-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/rica40325-mixdata-1w5step-v4/tokenizer.json
rica40325-mixdata-1w5step-v4-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/rica40325-mixdata-1w5step-v4/flywheel_model.0.safetensors
rica40325-mixdata-1w5step-v4-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%|▏ | 5/363 [00:00<00:16, 22.27it/s] Loading 0: 3%|▎ | 10/363 [00:00<00:12, 29.24it/s] Loading 0: 4%|▍ | 14/363 [00:00<00:13, 25.81it/s] Loading 0: 6%|▌ | 21/363 [00:00<00:09, 37.31it/s] Loading 0: 7%|▋ | 26/363 [00:01<00:15, 21.39it/s] Loading 0: 9%|▊ | 31/363 [00:01<00:12, 26.19it/s] Loading 0: 10%|▉ | 35/363 [00:01<00:11, 28.37it/s] Loading 0: 11%|█ | 39/363 [00:01<00:10, 29.85it/s] Loading 0: 12%|█▏ | 43/363 [00:01<00:10, 29.37it/s] Loading 0: 13%|█▎ | 48/363 [00:01<00:09, 32.43it/s] Loading 0: 14%|█▍ | 52/363 [00:01<00:09, 31.27it/s] Loading 0: 15%|█▌ | 56/363 [00:01<00:09, 31.33it/s] Loading 0: 17%|█▋ | 61/363 [00:02<00:11, 26.41it/s] Loading 0: 18%|█▊ | 64/363 [00:02<00:13, 22.77it/s] Loading 0: 20%|█▉ | 71/363 [00:02<00:09, 29.25it/s] Loading 0: 21%|██ | 75/363 [00:02<00:09, 28.93it/s] Loading 0: 22%|██▏ | 79/363 [00:02<00:09, 28.77it/s] Loading 0: 23%|██▎ | 84/363 [00:02<00:08, 31.22it/s] Loading 0: 24%|██▍ | 88/363 [00:03<00:09, 29.87it/s] Loading 0: 26%|██▌ | 93/363 [00:03<00:08, 30.97it/s] Loading 0: 27%|██▋ | 97/363 [00:03<00:09, 29.56it/s] Loading 0: 28%|██▊ | 101/363 [00:03<00:10, 23.95it/s] Loading 0: 29%|██▊ | 104/363 [00:03<00:12, 21.27it/s] Loading 0: 31%|███ | 111/363 [00:03<00:08, 28.72it/s] Loading 0: 32%|███▏ | 115/363 [00:04<00:08, 28.61it/s] Loading 0: 33%|███▎ | 120/363 [00:04<00:07, 31.37it/s] Loading 0: 34%|███▍ | 124/363 [00:04<00:07, 30.30it/s] Loading 0: 36%|███▌ | 129/363 [00:04<00:07, 32.95it/s] Loading 0: 37%|███▋ | 133/363 [00:04<00:07, 31.47it/s] Loading 0: 38%|███▊ | 137/363 [00:04<00:07, 31.32it/s] Loading 0: 39%|███▉ | 142/363 [00:05<00:08, 26.35it/s] Loading 0: 40%|███▉ | 145/363 [00:05<00:08, 24.73it/s] Loading 0: 41%|████ | 149/363 [00:05<00:09, 23.68it/s] Loading 0: 43%|████▎ | 156/363 [00:05<00:06, 30.95it/s] Loading 0: 44%|████▍ | 160/363 [00:05<00:06, 30.13it/s] Loading 0: 45%|████▌ | 165/363 [00:05<00:06, 32.45it/s] Loading 0: 47%|████▋ | 169/363 [00:05<00:06, 30.86it/s] Loading 0: 48%|████▊ | 174/363 [00:06<00:05, 33.32it/s] Loading 0: 49%|████▉ | 178/363 [00:06<00:06, 30.69it/s] Loading 0: 50%|█████ | 182/363 [00:06<00:07, 24.42it/s] Loading 0: 51%|█████ | 185/363 [00:06<00:08, 21.29it/s] Loading 0: 53%|█████▎ | 192/363 [00:06<00:06, 28.43it/s] Loading 0: 54%|█████▍ | 196/363 [00:06<00:05, 28.36it/s] Loading 0: 55%|█████▌ | 201/363 [00:07<00:05, 31.46it/s] Loading 0: 56%|█████▋ | 205/363 [00:07<00:05, 30.69it/s] Loading 0: 58%|█████▊ | 210/363 [00:07<00:04, 33.09it/s] Loading 0: 59%|█████▉ | 214/363 [00:07<00:04, 31.60it/s] Loading 0: 60%|██████ | 218/363 [00:07<00:04, 31.88it/s] Loading 0: 61%|██████▏ | 223/363 [00:07<00:05, 26.20it/s] Loading 0: 62%|██████▏ | 226/363 [00:07<00:05, 24.29it/s] Loading 0: 63%|██████▎ | 230/363 [00:08<00:05, 23.67it/s] Loading 0: 65%|██████▌ | 237/363 [00:08<00:04, 30.55it/s] Loading 0: 66%|██████▋ | 241/363 [00:08<00:04, 29.93it/s] Loading 0: 68%|██████▊ | 246/363 [00:08<00:03, 32.78it/s] Loading 0: 69%|██████▉ | 250/363 [00:08<00:03, 31.60it/s] Loading 0: 70%|███████ | 255/363 [00:08<00:03, 33.81it/s] Loading 0: 71%|███████▏ | 259/363 [00:08<00:03, 31.26it/s] Loading 0: 72%|███████▏ | 263/363 [00:09<00:04, 24.77it/s] Loading 0: 73%|███████▎ | 266/363 [00:09<00:04, 21.75it/s] Loading 0: 75%|███████▌ | 273/363 [00:09<00:03, 29.27it/s] Loading 0: 76%|███████▋ | 277/363 [00:09<00:02, 28.89it/s] Loading 0: 78%|███████▊ | 282/363 [00:09<00:02, 31.49it/s] Loading 0: 79%|███████▉ | 286/363 [00:09<00:02, 30.82it/s] Loading 0: 80%|████████ | 291/363 [00:10<00:02, 33.57it/s] Loading 0: 81%|████████▏ | 295/363 [00:10<00:02, 31.91it/s] Loading 0: 82%|████████▏ | 299/363 [00:10<00:01, 32.07it/s] Loading 0: 84%|████████▎ | 304/363 [00:10<00:02, 27.25it/s] Loading 0: 85%|████████▍ | 307/363 [00:10<00:02, 25.48it/s] Loading 0: 86%|████████▌ | 311/363 [00:10<00:02, 24.43it/s] Loading 0: 88%|████████▊ | 318/363 [00:11<00:01, 31.80it/s] Loading 0: 89%|████████▊ | 322/363 [00:11<00:01, 31.27it/s] Loading 0: 90%|█████████ | 327/363 [00:11<00:01, 33.80it/s] Loading 0: 91%|█████████ | 331/363 [00:11<00:00, 32.33it/s] Loading 0: 93%|█████████▎| 336/363 [00:11<00:00, 34.86it/s] Loading 0: 94%|█████████▎| 340/363 [00:11<00:00, 32.70it/s] Loading 0: 95%|█████████▍| 344/363 [00:18<00:09, 2.00it/s] Loading 0: 96%|█████████▌| 348/363 [00:18<00:05, 2.70it/s] Loading 0: 97%|█████████▋| 353/363 [00:18<00:02, 3.93it/s] Loading 0: 98%|█████████▊| 357/363 [00:19<00:01, 5.11it/s]
Job rica40325-mixdata-1w5step-v4-mkmlizer completed after 124.31s with status: succeeded
Stopping job with name rica40325-mixdata-1w5step-v4-mkmlizer
Pipeline stage MKMLizer completed in 126.11s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.19s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service rica40325-mixdata-1w5step-v4
Waiting for inference service rica40325-mixdata-1w5step-v4 to be ready
Retrying (%r) after connection broken by '%r': %s
Inference service rica40325-mixdata-1w5step-v4 ready after 183.02805614471436s
Pipeline stage MKMLDeployer completed in 185.15s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.069836139678955s
Received healthy response to inference request in 1.8288381099700928s
Received healthy response to inference request in 1.668210506439209s
Received healthy response to inference request in 1.276036262512207s
5 requests
1 failed requests
5th percentile: 1.3544711112976073
10th percentile: 1.4329059600830079
20th percentile: 1.5897756576538087
30th percentile: 1.7003360271453858
40th percentile: 1.7645870685577392
50th percentile: 1.8288381099700928
60th percentile: 1.9252373218536376
70th percentile: 2.0216365337371824
80th percentile: 5.693565511703494
90th percentile: 12.941024255752566
95th percentile: 16.564753627777097
99th percentile: 19.46373712539673
mean time: 5.40628080368042
%s, retrying in %s seconds...
Received healthy response to inference request in 2.143554449081421s
Received healthy response to inference request in 1.6156468391418457s
Received healthy response to inference request in 1.7409234046936035s
Received healthy response to inference request in 1.8969414234161377s
Received healthy response to inference request in 1.7416296005249023s
5 requests
0 failed requests
5th percentile: 1.6407021522521972
10th percentile: 1.6657574653625489
20th percentile: 1.715868091583252
30th percentile: 1.7410646438598634
40th percentile: 1.7413471221923829
50th percentile: 1.7416296005249023
60th percentile: 1.8037543296813965
70th percentile: 1.8658790588378906
80th percentile: 1.9462640285491943
90th percentile: 2.0449092388153076
95th percentile: 2.0942318439483643
99th percentile: 2.1336899280548094
mean time: 1.827739143371582
Pipeline stage StressChecker completed in 39.19s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.01s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 2.05s
Shutdown handler de-registered
rica40325-mixdata-1w5step_v4 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2579.15s
Shutdown handler de-registered
rica40325-mixdata-1w5step_v4 status is now inactive due to auto deactivation removed underperforming models