Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-espresso-story-24-3275-v4-mkmlizer
Waiting for job on chaiml-espresso-story-24-3275-v4-mkmlizer to finish
chaiml-espresso-story-24-3275-v4-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ _____ __ __ ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ /___/ ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ Version: 0.11.12 ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ https://mk1.ai ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ belonging to: ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ Chai Research Corp. ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ║ ║
chaiml-espresso-story-24-3275-v4-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-espresso-story-24-3275-v4-mkmlizer: quantized model in 43.897s
chaiml-espresso-story-24-3275-v4-mkmlizer: Processed model ChaiML/espresso_story_241205_albert_v1_sft_2epoch_128alpha in 103.502s
chaiml-espresso-story-24-3275-v4-mkmlizer: creating bucket guanaco-mkml-models
chaiml-espresso-story-24-3275-v4-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-espresso-story-24-3275-v4-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-espresso-story-24-3275-v4
chaiml-espresso-story-24-3275-v4-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-espresso-story-24-3275-v4/config.json
chaiml-espresso-story-24-3275-v4-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-espresso-story-24-3275-v4/special_tokens_map.json
chaiml-espresso-story-24-3275-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-espresso-story-24-3275-v4/tokenizer_config.json
chaiml-espresso-story-24-3275-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-espresso-story-24-3275-v4/tokenizer.json
chaiml-espresso-story-24-3275-v4-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-espresso-story-24-3275-v4/flywheel_model.0.safetensors
chaiml-espresso-story-24-3275-v4-mkmlizer:
Loading 0: 0%| | 0/507 [00:00<?, ?it/s]
Loading 0: 1%| | 5/507 [00:00<00:18, 26.67it/s]
Loading 0: 2%|▏ | 12/507 [00:00<00:11, 41.27it/s]
Loading 0: 3%|▎ | 17/507 [00:00<00:13, 36.65it/s]
Loading 0: 4%|▍ | 22/507 [00:00<00:12, 37.97it/s]
Loading 0: 5%|▌ | 27/507 [00:00<00:11, 40.22it/s]
Loading 0: 6%|▋ | 32/507 [00:00<00:13, 34.19it/s]
Loading 0: 8%|▊ | 39/507 [00:01<00:11, 41.37it/s]
Loading 0: 9%|▊ | 44/507 [00:01<00:11, 41.16it/s]
Loading 0: 10%|▉ | 49/507 [00:01<00:12, 36.01it/s]
Loading 0: 10%|█ | 53/507 [00:01<00:16, 26.89it/s]
Loading 0: 11%|█ | 57/507 [00:01<00:17, 25.73it/s]
Loading 0: 12%|█▏ | 63/507 [00:01<00:14, 30.62it/s]
Loading 0: 13%|█▎ | 67/507 [00:01<00:14, 31.32it/s]
Loading 0: 14%|█▍ | 73/507 [00:02<00:12, 34.02it/s]
Loading 0: 16%|█▌ | 80/507 [00:02<00:12, 35.21it/s]
Loading 0: 17%|█▋ | 85/507 [00:02<00:11, 37.83it/s]
Loading 0: 18%|█▊ | 89/507 [00:02<00:12, 33.62it/s]
Loading 0: 19%|█▉ | 96/507 [00:02<00:10, 39.27it/s]
Loading 0: 20%|█▉ | 101/507 [00:02<00:10, 37.93it/s]
Loading 0: 21%|██ | 106/507 [00:03<00:10, 38.11it/s]
Loading 0: 22%|██▏ | 110/507 [00:03<00:10, 37.90it/s]
Loading 0: 22%|██▏ | 114/507 [00:03<00:14, 26.22it/s]
Loading 0: 23%|██▎ | 118/507 [00:03<00:14, 26.12it/s]
Loading 0: 24%|██▍ | 122/507 [00:03<00:14, 26.17it/s]
Loading 0: 25%|██▌ | 129/507 [00:03<00:11, 33.59it/s]
Loading 0: 26%|██▌ | 133/507 [00:03<00:11, 33.20it/s]
Loading 0: 27%|██▋ | 138/507 [00:04<00:10, 36.53it/s]
Loading 0: 28%|██▊ | 142/507 [00:04<00:10, 35.86it/s]
Loading 0: 29%|██▉ | 147/507 [00:04<00:09, 38.75it/s]
Loading 0: 30%|██▉ | 152/507 [00:04<00:09, 37.87it/s]
Loading 0: 31%|███ | 157/507 [00:04<00:09, 38.56it/s]
Loading 0: 32%|███▏ | 162/507 [00:04<00:08, 40.92it/s]
Loading 0: 33%|███▎ | 167/507 [00:04<00:08, 42.39it/s]
Loading 0: 34%|███▍ | 172/507 [00:05<00:12, 26.03it/s]
Loading 0: 35%|███▍ | 176/507 [00:05<00:12, 26.47it/s]
Loading 0: 36%|███▌ | 183/507 [00:05<00:09, 34.06it/s]
Loading 0: 37%|███▋ | 188/507 [00:05<00:09, 35.23it/s]
Loading 0: 38%|███▊ | 193/507 [00:05<00:08, 36.77it/s]
Loading 0: 39%|███▉ | 198/507 [00:05<00:08, 38.56it/s]
Loading 0: 40%|████ | 203/507 [00:05<00:09, 31.83it/s]
Loading 0: 41%|████▏ | 210/507 [00:06<00:07, 38.38it/s]
Loading 0: 42%|████▏ | 215/507 [00:06<00:07, 38.74it/s]
Loading 0: 43%|████▎ | 220/507 [00:06<00:08, 34.53it/s]
Loading 0: 44%|████▍ | 224/507 [00:06<00:10, 26.57it/s]
Loading 0: 45%|████▌ | 230/507 [00:06<00:09, 28.08it/s]
Loading 0: 47%|████▋ | 237/507 [00:06<00:07, 34.60it/s]
Loading 0: 48%|████▊ | 241/507 [00:07<00:07, 34.24it/s]
Loading 0: 49%|████▊ | 246/507 [00:07<00:07, 36.00it/s]
Loading 0: 49%|████▉ | 250/507 [00:07<00:07, 34.35it/s]
Loading 0: 50%|█████ | 255/507 [00:07<00:06, 36.12it/s]
Loading 0: 51%|█████ | 259/507 [00:07<00:07, 34.56it/s]
Loading 0: 52%|█████▏ | 264/507 [00:07<00:06, 37.39it/s]
Loading 0: 53%|█████▎ | 268/507 [00:07<00:06, 36.25it/s]
Loading 0: 54%|█████▍ | 273/507 [00:07<00:06, 37.65it/s]
Loading 0: 55%|█████▍ | 277/507 [00:08<00:06, 34.30it/s]
Loading 0: 55%|█████▌ | 281/507 [00:08<00:06, 35.48it/s]
Loading 0: 56%|█████▌ | 285/507 [00:08<00:09, 23.74it/s]
Loading 0: 57%|█████▋ | 288/507 [00:08<00:09, 22.88it/s]
Loading 0: 58%|█████▊ | 293/507 [00:08<00:08, 24.36it/s]
Loading 0: 59%|█████▉ | 299/507 [00:23<00:08, 24.36it/s]
Loading 0: 59%|█████▉ | 300/507 [00:23<03:07, 1.10it/s]
Loading 0: 60%|█████▉ | 302/507 [00:23<02:41, 1.27it/s]
Loading 0: 61%|██████ | 307/507 [00:23<01:45, 1.90it/s]
Loading 0: 61%|██████ | 310/507 [00:24<01:22, 2.38it/s]
Loading 0: 62%|██████▏ | 313/507 [00:24<01:03, 3.07it/s]
Loading 0: 63%|██████▎ | 318/507 [00:24<00:40, 4.68it/s]
Loading 0: 64%|██████▎ | 322/507 [00:24<00:29, 6.29it/s]
Loading 0: 64%|██████▍ | 327/507 [00:24<00:20, 8.99it/s]
Loading 0: 65%|██████▌ | 331/507 [00:24<00:15, 11.32it/s]
Loading 0: 66%|██████▌ | 335/507 [00:24<00:12, 13.94it/s]
Loading 0: 67%|██████▋ | 340/507 [00:25<00:10, 15.59it/s]
Loading 0: 68%|██████▊ | 344/507 [00:25<00:08, 18.12it/s]
Loading 0: 69%|██████▊ | 348/507 [00:25<00:08, 19.48it/s]
Loading 0: 70%|██████▉ | 354/507 [00:25<00:05, 25.51it/s]
Loading 0: 71%|███████ | 358/507 [00:25<00:05, 27.26it/s]
Loading 0: 72%|███████▏ | 363/507 [00:25<00:04, 31.66it/s]
Loading 0: 72%|███████▏ | 367/507 [00:25<00:04, 32.44it/s]
Loading 0: 73%|███████▎ | 372/507 [00:25<00:03, 36.04it/s]
Loading 0: 74%|███████▍ | 377/507 [00:26<00:03, 37.26it/s]
Loading 0: 75%|███████▌ | 382/507 [00:26<00:03, 38.42it/s]
Loading 0: 76%|███████▋ | 387/507 [00:26<00:02, 41.09it/s]
Loading 0: 77%|███████▋ | 392/507 [00:26<00:03, 35.85it/s]
Loading 0: 78%|███████▊ | 396/507 [00:26<00:04, 26.20it/s]
Loading 0: 79%|███████▉ | 401/507 [00:26<00:03, 27.71it/s]
Loading 0: 80%|████████ | 408/507 [00:26<00:02, 34.53it/s]
Loading 0: 81%|████████▏ | 412/507 [00:27<00:02, 34.44it/s]
Loading 0: 82%|████████▏ | 417/507 [00:27<00:02, 36.53it/s]
Loading 0: 83%|████████▎ | 421/507 [00:27<00:02, 36.04it/s]
Loading 0: 84%|████████▍ | 426/507 [00:27<00:02, 39.06it/s]
Loading 0: 85%|████████▌ | 431/507 [00:27<00:01, 38.76it/s]
Loading 0: 86%|████████▌ | 436/507 [00:27<00:01, 39.56it/s]
Loading 0: 87%|████████▋ | 441/507 [00:27<00:01, 41.97it/s]
Loading 0: 88%|████████▊ | 446/507 [00:27<00:01, 35.77it/s]
Loading 0: 90%|████████▉ | 454/507 [00:28<00:01, 42.51it/s]
Loading 0: 91%|█████████ | 459/507 [00:30<00:06, 7.02it/s]
Loading 0: 92%|█████████▏| 465/507 [00:30<00:04, 9.37it/s]
Loading 0: 93%|█████████▎| 472/507 [00:30<00:02, 13.11it/s]
Loading 0: 94%|█████████▍| 476/507 [00:30<00:02, 15.02it/s]
Loading 0: 95%|█████████▍| 481/507 [00:30<00:01, 18.51it/s]
Loading 0: 96%|█████████▌| 485/507 [00:31<00:01, 20.82it/s]
Loading 0: 97%|█████████▋| 490/507 [00:31<00:00, 24.87it/s]
Loading 0: 97%|█████████▋| 494/507 [00:31<00:00, 26.53it/s]
Loading 0: 98%|█████████▊| 499/507 [00:31<00:00, 30.06it/s]
Loading 0: 99%|█████████▉| 503/507 [00:31<00:00, 31.27it/s]
Job chaiml-espresso-story-24-3275-v4-mkmlizer completed after 135.05s with status: succeeded
Stopping job with name chaiml-espresso-story-24-3275-v4-mkmlizer
Pipeline stage MKMLizer completed in 136.37s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-espresso-story-24-3275-v4
Waiting for inference service chaiml-espresso-story-24-3275-v4 to be ready
Connection pool is full, discarding connection: %s. Connection pool size: %s
Inference service chaiml-espresso-story-24-3275-v4 ready after 183.93907380104065s
Pipeline stage MKMLDeployer completed in 185.46s
run pipeline stage %s
Running pipeline stage StressChecker
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 2.731417417526245s
Received healthy response to inference request in 2.5577433109283447s
Received healthy response to inference request in 2.3798811435699463s
Received healthy response to inference request in 2.6258344650268555s
Received healthy response to inference request in 2.563875913619995s
5 requests
0 failed requests
5th percentile: 2.415453577041626
10th percentile: 2.4510260105133055
20th percentile: 2.522170877456665
30th percentile: 2.558969831466675
40th percentile: 2.561422872543335
50th percentile: 2.563875913619995
60th percentile: 2.588659334182739
70th percentile: 2.6134427547454835
80th percentile: 2.6469510555267335
90th percentile: 2.689184236526489
95th percentile: 2.710300827026367
99th percentile: 2.7271940994262693
mean time: 2.571750450134277
Pipeline stage StressChecker completed in 14.25s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.46s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 2.34s
Shutdown handler de-registered
chaiml-espresso-story-24_3275_v4 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.11s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.10s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service chaiml-espresso-story-24-3275-v4-profiler
Waiting for inference service chaiml-espresso-story-24-3275-v4-profiler to be ready
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3149.76s
Shutdown handler de-registered
chaiml-espresso-story-24_3275_v4 status is now inactive due to auto deactivation removed underperforming models