Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer
Waiting for job on chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer to finish
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ _____ __ __ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ /___/ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ Version: 0.11.12 ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ belonging to: ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: Traceback (most recent call last):
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 513, in http_get
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: hf_transfer.download(
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: Exception: Failed too many failures in parallel (3): PyErr { type: <class 'Exception'>, value: Exception('Error while downloading: reqwest::Error { kind: Status(503), url: Url { scheme: "https", cannot_be_a_base: false, username: "", password: None, host: Some(Domain("cdn-lfs-us-1.hf.co")), port: None, path: "/repos/fb/c6/49806f7811883b5f66fef/a7bccb841eec89231850ecb70d793b7a4a25bf1146d86c95aa2ff0d4b3076681", query: Some("response-content-disposition=inline%3B+filename*%3DUTF-8%27%27model-00004-of-00005.safetensors%3B+filename%3D%22model-00004-of-00005.safetensors%22%3B&Expires=1729289094&Policy=eyJTdGF0ZW1lbnQiOlt7IkNvbmRpdGlvbiI6eyJEYXRlTGVzc1RoYW4iOnsiQVdTOkVwb2NoVGltZSI6MTcyOTI4OTA5NH19LCJSZXNvdXJjZSI6Imh0dHBzOi8vY2RuLWxmcy11cy0xLmhmLmNvL3JlcG9zL2ZiL2M2L2ZiYzZhYjg2MTIzZGU2NWJhMmMxNzhlZTBjYzE0NTFkMzc5M2UxMjcyMTk0OTgwNmY3ODExODgzYjVmNjZmZWYvYTdiY2NiODQxZWVjODkyMzE4NTBlY2I3MGQ3OTNiN2E0YTI1YmYxMTQ2ZDg2Yzk1YWEyZmYwZDRiMzA3NjY4MT9yZXNwb25zZS1jb250ZW50LWRpc3Bvc2l0aW9uPSoifV19&Signature=PZlal1putklhoEywrZjBST6rNwiMQXQ9wq%7EpolLBzZbhpMnZtqrX4qUBbU7FhC9-JqL5hymg9sApiwklnDv4tccXaKEXbB-pyRyX4GnHiMNjKs73pbWCTG96G0VipNsEsUOXYhz-7dvhhAm1f4y1H9U3cZCa0LS2df0gBiEcsXJ6LC2HXj0g4XdzOXHYwdSSnOvbSJSNPENcK3TpZ1hbCzpcRsbdWwa1ttT6EQ5HuIV8mmHMxBG0OKktHMqNh-S8NeglOT%7E6NZOqIrYBfXdkoVZuDUh8P8NMIda7GzbGPUl2ey2os6c0l8XqnMLLWJRS9G2YF0hgiKgf6hmaXpXpJg__&Key-Pair-Id=K24J24Z295AEI9"), fragment: None } }'), traceback: None } (NoPermits)
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: The above exception was the direct cause of the following exception:
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: Traceback (most recent call last):
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/code/uploading/mkmlize.py", line 151, in <module>
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: cli()
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 1157, in __call__
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: return self.main(*args, **kwargs)
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 1078, in main
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: rv = self.invoke(ctx)
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 1688, in invoke
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: return _process_result(sub_ctx.command.invoke(sub_ctx))
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 1434, in invoke
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: return ctx.invoke(self.callback, **ctx.params)
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 783, in invoke
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: return __callback(*args, **kwargs)
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/code/uploading/mkmlize.py", line 38, in quantize
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: temp_folder = download_to_shared_memory(repo_id, revision, hf_auth_token)
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/code/uploading/mkmlize.py", line 65, in download_to_shared_memory
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: snapshot_download(
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/utils/_validators.py", line 114, in _inner_fn
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: return fn(*args, **kwargs)
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/_snapshot_download.py", line 292, in snapshot_download
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: _inner_hf_hub_download(file)
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/_snapshot_download.py", line 268, in _inner_hf_hub_download
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: return hf_hub_download(
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/utils/_validators.py", line 114, in _inner_fn
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: return fn(*args, **kwargs)
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 1202, in hf_hub_download
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: return _hf_hub_download_to_local_dir(
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 1487, in _hf_hub_download_to_local_dir
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: _download_to_tmp_and_move(
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 1884, in _download_to_tmp_and_move
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: http_get(
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 524, in http_get
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: raise RuntimeError(
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: RuntimeError: An error occurred while downloading using `hf_transfer`. Consider disabling HF_HUB_ENABLE_HF_TRANSFER for better error handling.
Job chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer completed after 64.79s with status: failed
Stopping job with name chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer
%s, retrying in %s seconds...
Starting job with name chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer
Waiting for job on chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer to finish
Failed to get response for submission chaiml-nemo-20241010-tie_5991_v2: ('http://chaiml-nemo-20241010-tie-5991-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:52688->127.0.0.1:8080: read: connection reset by peer\n')
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ _____ __ __ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ /___/ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ Version: 0.11.12 ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ belonging to: ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ║ ║
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: Downloaded to shared memory in 30.934s
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpju11q_7p, device:0
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: quantized model in 37.200s
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: Processed model ChaiML/Viral-ss-v3-12b1e5-dpo in 68.133s
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-viral-ss-v3-12b1e5-dpo-v1
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-viral-ss-v3-12b1e5-dpo-v1/config.json
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-viral-ss-v3-12b1e5-dpo-v1/special_tokens_map.json
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-viral-ss-v3-12b1e5-dpo-v1/tokenizer_config.json
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-viral-ss-v3-12b1e5-dpo-v1/tokenizer.json
chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:13, 27.12it/s]
Loading 0: 4%|▎ | 13/363 [00:00<00:07, 49.27it/s]
Loading 0: 5%|▌ | 19/363 [00:00<00:07, 45.46it/s]
Loading 0: 7%|▋ | 24/363 [00:00<00:08, 41.65it/s]
Loading 0: 8%|▊ | 30/363 [00:00<00:07, 46.35it/s]
Loading 0: 10%|▉ | 35/363 [00:00<00:07, 44.36it/s]
Loading 0: 11%|█ | 40/363 [00:00<00:07, 43.35it/s]
Loading 0: 12%|█▏ | 45/363 [00:01<00:07, 44.39it/s]
Loading 0: 14%|█▍ | 50/363 [00:01<00:08, 37.28it/s]
Loading 0: 16%|█▌ | 58/363 [00:01<00:06, 47.60it/s]
Loading 0: 18%|█▊ | 64/363 [00:01<00:10, 29.70it/s]
Loading 0: 20%|█▉ | 72/363 [00:01<00:07, 37.87it/s]
Loading 0: 21%|██▏ | 78/363 [00:01<00:07, 40.17it/s]
Loading 0: 23%|██▎ | 83/363 [00:02<00:07, 39.61it/s]
Loading 0: 25%|██▍ | 90/363 [00:02<00:06, 44.50it/s]
Loading 0: 26%|██▌ | 95/363 [00:02<00:06, 44.06it/s]
Loading 0: 28%|██▊ | 100/363 [00:02<00:06, 38.62it/s]
Loading 0: 30%|██▉ | 108/363 [00:02<00:05, 47.66it/s]
Loading 0: 31%|███▏ | 114/363 [00:02<00:05, 44.18it/s]
Loading 0: 33%|███▎ | 119/363 [00:02<00:05, 43.95it/s]
Loading 0: 35%|███▍ | 126/363 [00:02<00:04, 49.29it/s]
Loading 0: 36%|███▋ | 132/363 [00:03<00:04, 47.94it/s]
Loading 0: 38%|███▊ | 137/363 [00:03<00:04, 46.78it/s]
Loading 0: 39%|███▉ | 142/363 [00:03<00:06, 33.76it/s]
Loading 0: 40%|████ | 146/363 [00:03<00:06, 34.46it/s]
Loading 0: 41%|████▏ | 150/363 [00:03<00:06, 34.43it/s]
Loading 0: 43%|████▎ | 157/363 [00:03<00:04, 41.53it/s]
Loading 0: 45%|████▍ | 163/363 [00:03<00:04, 42.34it/s]
Loading 0: 46%|████▋ | 168/363 [00:04<00:04, 43.12it/s]
Loading 0: 48%|████▊ | 175/363 [00:04<00:03, 48.78it/s]
Loading 0: 50%|████▉ | 181/363 [00:04<00:03, 47.04it/s]
Loading 0: 51%|█████ | 186/363 [00:04<00:03, 45.33it/s]
Loading 0: 53%|█████▎ | 193/363 [00:04<00:03, 50.13it/s]
Loading 0: 55%|█████▍ | 199/363 [00:04<00:03, 45.99it/s]
Loading 0: 56%|█████▌ | 204/363 [00:04<00:03, 43.35it/s]
Loading 0: 58%|█████▊ | 211/363 [00:04<00:03, 48.18it/s]
Loading 0: 60%|█████▉ | 217/363 [00:05<00:03, 47.13it/s]
Loading 0: 61%|██████▏ | 223/363 [00:05<00:03, 35.20it/s]
Loading 0: 63%|██████▎ | 228/363 [00:05<00:03, 35.23it/s]
Loading 0: 64%|██████▍ | 233/363 [00:05<00:03, 38.03it/s]
Loading 0: 66%|██████▌ | 239/363 [00:05<00:03, 37.59it/s]
Loading 0: 68%|██████▊ | 246/363 [00:05<00:02, 44.56it/s]
Loading 0: 69%|██████▉ | 251/363 [00:05<00:02, 43.71it/s]
Loading 0: 71%|███████ | 256/363 [00:06<00:02, 44.26it/s]
Loading 0: 72%|███████▏ | 262/363 [00:06<00:02, 44.31it/s]
Loading 0: 74%|███████▎ | 267/363 [00:06<00:02, 44.01it/s]
Loading 0: 75%|███████▌ | 273/363 [00:06<00:01, 47.88it/s]
Loading 0: 77%|███████▋ | 278/363 [00:06<00:01, 47.48it/s]
Loading 0: 78%|███████▊ | 283/363 [00:06<00:01, 46.62it/s]
Loading 0: 79%|███████▉ | 288/363 [00:06<00:01, 45.97it/s]
Loading 0: 81%|████████ | 293/363 [00:06<00:01, 39.58it/s]
Loading 0: 83%|████████▎ | 301/363 [00:07<00:01, 49.33it/s]
Loading 0: 85%|████████▍ | 307/363 [00:14<00:21, 2.60it/s]
Loading 0: 86%|████████▌ | 312/363 [00:14<00:14, 3.46it/s]
Loading 0: 88%|████████▊ | 320/363 [00:14<00:07, 5.39it/s]
Loading 0: 90%|████████▉ | 326/363 [00:14<00:05, 7.20it/s]
Loading 0: 91%|█████████ | 331/363 [00:14<00:03, 9.09it/s]
Loading 0: 93%|█████████▎| 338/363 [00:15<00:01, 12.78it/s]
Loading 0: 95%|█████████▍| 344/363 [00:15<00:01, 15.86it/s]
Loading 0: 96%|█████████▌| 349/363 [00:15<00:00, 18.88it/s]
Loading 0: 98%|█████████▊| 356/363 [00:15<00:00, 24.85it/s]
Loading 0: 100%|█████████▉| 362/363 [00:15<00:00, 28.39it/s]
Job chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer completed after 131.35s with status: succeeded
Stopping job with name chaiml-viral-ss-v3-12b1e5-dpo-v1-mkmlizer
Pipeline stage MKMLizer completed in 197.12s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.18s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-viral-ss-v3-12b1e5-dpo-v1
Waiting for inference service chaiml-viral-ss-v3-12b1e5-dpo-v1 to be ready
Inference service chaiml-viral-ss-v3-12b1e5-dpo-v1 ready after 150.55304074287415s
Pipeline stage MKMLDeployer completed in 151.10s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.318702459335327s
Received healthy response to inference request in 1.727236032485962s
Received healthy response to inference request in 1.8647210597991943s
Received healthy response to inference request in 1.7718937397003174s
Received healthy response to inference request in 1.4736218452453613s
5 requests
0 failed requests
5th percentile: 1.5243446826934814
10th percentile: 1.5750675201416016
20th percentile: 1.6765131950378418
30th percentile: 1.736167573928833
40th percentile: 1.7540306568145752
50th percentile: 1.7718937397003174
60th percentile: 1.8090246677398683
70th percentile: 1.846155595779419
80th percentile: 1.955517339706421
90th percentile: 2.137109899520874
95th percentile: 2.2279061794281003
99th percentile: 2.300543203353882
mean time: 1.8312350273132325
Pipeline stage StressChecker completed in 10.94s
Shutdown handler de-registered
chaiml-viral-ss-v3-12b1e5-dpo_v1 status is now deployed due to DeploymentManager action
chaiml-viral-ss-v3-12b1e5-dpo_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-viral-ss-v3-12b1e5-dpo_v1 status is now torndown due to DeploymentManager action