submission_id: turboderp-cat-llama-3-70_8684_v1
developer_uid: Meliodia
best_of: 4
celo_rating: 1178.64
display_name: turboderp-cat-llama-3-70_8684_v1
family_friendly_score: 0.0
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}
is_internal_developer: True
language_model: turboderp/Cat-Llama-3-70B-instruct
max_input_tokens: 512
max_output_tokens: 64
model_architecture: LlamaForCausalLM
model_eval_status: success
model_group: turboderp/Cat-Llama-3-70
model_name: turboderp-cat-llama-3-70_8684_v1
model_num_parameters: 70553739264.0
model_repo: turboderp/Cat-Llama-3-70B-instruct
model_size: 71B
num_battles: 13876
num_wins: 6876
ranking_group: single
reward_formatter: {'bot_template': '{bot_name}: {message}\n', 'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'response_template': '{bot_name}:', 'truncate_by_message': False, 'user_template': '{user_name}: {message}\n'}
reward_repo: ChaiML/reward_gpt2_medium_preference_24m_e2
status: torndown
submission_type: basic
timestamp: 2024-06-09T03:49:19+00:00
us_pacific_date: 2024-06-08
win_ratio: 0.49553185356010376
Resubmit model
Running pipeline stage MKMLizer
Starting job with name turboderp-cat-llama-3-70-8684-v1-mkmlizer
Waiting for job on turboderp-cat-llama-3-70-8684-v1-mkmlizer to finish
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ _____ __ __ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ /___/ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ Version: 0.8.14 ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ https://mk1.ai ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ The license key for the current software has been verified as ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ belonging to: ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ Chai Research Corp. ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ Expiration: 2024-07-15 23:59:59 ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
turboderp-cat-llama-3-70-8684-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/huggingface_hub/utils/_deprecation.py:131: FutureWarning: 'list_files_info' (from 'huggingface_hub.hf_api') is deprecated and will be removed from version '0.23'. Use `list_repo_tree` and `get_paths_info` instead.
turboderp-cat-llama-3-70-8684-v1-mkmlizer: warnings.warn(warning_message, FutureWarning)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: Traceback (most recent call last):
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/urllib3/connection.py", line 174, in _new_conn
turboderp-cat-llama-3-70-8684-v1-mkmlizer: conn = connection.create_connection(
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/urllib3/util/connection.py", line 72, in create_connection
turboderp-cat-llama-3-70-8684-v1-mkmlizer: for res in socket.getaddrinfo(host, port, family, socket.SOCK_STREAM):
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/socket.py", line 955, in getaddrinfo
turboderp-cat-llama-3-70-8684-v1-mkmlizer: for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
turboderp-cat-llama-3-70-8684-v1-mkmlizer: socket.gaierror: [Errno -3] Temporary failure in name resolution
turboderp-cat-llama-3-70-8684-v1-mkmlizer: During handling of the above exception, another exception occurred:
turboderp-cat-llama-3-70-8684-v1-mkmlizer: Traceback (most recent call last):
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/urllib3/connectionpool.py", line 714, in urlopen
turboderp-cat-llama-3-70-8684-v1-mkmlizer: httplib_response = self._make_request(
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/urllib3/connectionpool.py", line 403, in _make_request
turboderp-cat-llama-3-70-8684-v1-mkmlizer: self._validate_conn(conn)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/urllib3/connectionpool.py", line 1053, in _validate_conn
turboderp-cat-llama-3-70-8684-v1-mkmlizer: conn.connect()
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/urllib3/connection.py", line 363, in connect
turboderp-cat-llama-3-70-8684-v1-mkmlizer: self.sock = conn = self._new_conn()
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/urllib3/connection.py", line 186, in _new_conn
turboderp-cat-llama-3-70-8684-v1-mkmlizer: raise NewConnectionError(
turboderp-cat-llama-3-70-8684-v1-mkmlizer: urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPSConnection object at 0x7f80381b6770>: Failed to establish a new connection: [Errno -3] Temporary failure in name resolution
turboderp-cat-llama-3-70-8684-v1-mkmlizer: During handling of the above exception, another exception occurred:
turboderp-cat-llama-3-70-8684-v1-mkmlizer: Traceback (most recent call last):
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/requests/adapters.py", line 486, in send
turboderp-cat-llama-3-70-8684-v1-mkmlizer: resp = conn.urlopen(
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/urllib3/connectionpool.py", line 798, in urlopen
turboderp-cat-llama-3-70-8684-v1-mkmlizer: retries = retries.increment(
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/urllib3/util/retry.py", line 592, in increment
turboderp-cat-llama-3-70-8684-v1-mkmlizer: raise MaxRetryError(_pool, url, error or ResponseError(cause))
turboderp-cat-llama-3-70-8684-v1-mkmlizer: urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='cdn-lfs-us-1.huggingface.co', port=443): Max retries exceeded with url: /repos/a5/d9/8096b7c63a1ee593fd9d5/6b7c5415b9f93d644d5d4240ffb12b90cd58b33452f202f3044bed58a31462ba?response-content-disposition=inline%3B+filename*%3DUTF-8%27%27model-00023-of-00030.safetensors%3B+filename%3D%22model-00023-of-00030.safetensors%22%3B&Expires=1718164607&Policy=eyJTdGF0ZW1lbnQiOlt7IkNvbmRpdGlvbiI6eyJEYXRlTGVzc1RoYW4iOnsiQVdTOkVwb2NoVGltZSI6MTcxODE2NDYwN319LCJSZXNvdXJjZSI6Imh0dHBzOi8vY2RuLWxmcy11cy0xLmh1Z2dpbmdmYWNlLmNvL3JlcG9zL2E1L2Q5L2E1ZDk1ODdjYzg2N2FmMGNmMTU5ZmIxZmQ4MjgyZjRlOTNkYTEyZDE2MzA4MDk2YjdjNjNhMWVlNTkzZmQ5ZDUvNmI3YzU0MTViOWY5M2Q2NDRkNWQ0MjQwZmZiMTJiOTBjZDU4YjMzNDUyZjIwMmYzMDQ0YmVkNThhMzE0NjJiYT9yZXNwb25zZS1jb250ZW50LWRpc3Bvc2l0aW9uPSoifV19&Signature=kloGXexSzc317GVugGRgUreA4BPDr0UFLZEcfWy9GTjg84fZJTVfzdFZomCYz2twXMjl4D2mqGZW-WJicpYPVuLUfCOUOAYeZryU0pvEYmWic5u0MjHUkg0xBjZKFkaV8fFzl6CxnCaa1NB~pFltviPgfWMdjWK-WAZx23GqILjn~psTUzWeQAczJ3~vKv~1Y35dk5CKmAdLvs3pyI15FU9rsV1aG0Xk0iD7yK-4pj0J0n-M7GbVvyb7kK1Fjwxq~Lqvpl3cpgjYI1BWQTgyCbVnsAj5LPFDdWedzHMA-uAFKLpbTf6BgiLCGr-NBUmHUuVzEhL-4sOMcalJxwxSIA__&Key-Pair-Id=KCD77M1F0VK2B (Caused by NewConnectionError('<urllib3.connection.HTTPSConnection object at 0x7f80381b6770>: Failed to establish a new connection: [Errno -3] Temporary failure in name resolution'))
turboderp-cat-llama-3-70-8684-v1-mkmlizer: During handling of the above exception, another exception occurred:
turboderp-cat-llama-3-70-8684-v1-mkmlizer: Traceback (most recent call last):
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/code/uploading/mkmlize.py", line 151, in <module>
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cli()
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 1128, in __call__
turboderp-cat-llama-3-70-8684-v1-mkmlizer: return self.main(*args, **kwargs)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 1053, in main
turboderp-cat-llama-3-70-8684-v1-mkmlizer: rv = self.invoke(ctx)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 1659, in invoke
turboderp-cat-llama-3-70-8684-v1-mkmlizer: return _process_result(sub_ctx.command.invoke(sub_ctx))
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 1395, in invoke
turboderp-cat-llama-3-70-8684-v1-mkmlizer: return ctx.invoke(self.callback, **ctx.params)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/click/core.py", line 754, in invoke
turboderp-cat-llama-3-70-8684-v1-mkmlizer: return __callback(*args, **kwargs)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/code/uploading/mkmlize.py", line 38, in quantize
turboderp-cat-llama-3-70-8684-v1-mkmlizer: temp_folder = download_to_shared_memory(repo_id, revision, hf_auth_token)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/code/uploading/mkmlize.py", line 65, in download_to_shared_memory
turboderp-cat-llama-3-70-8684-v1-mkmlizer: snapshot_download(
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/utils/_validators.py", line 119, in _inner_fn
turboderp-cat-llama-3-70-8684-v1-mkmlizer: return fn(*args, **kwargs)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/_snapshot_download.py", line 314, in snapshot_download
turboderp-cat-llama-3-70-8684-v1-mkmlizer: _inner_hf_hub_download(file)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/_snapshot_download.py", line 290, in _inner_hf_hub_download
turboderp-cat-llama-3-70-8684-v1-mkmlizer: return hf_hub_download(
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/utils/_validators.py", line 119, in _inner_fn
turboderp-cat-llama-3-70-8684-v1-mkmlizer: return fn(*args, **kwargs)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 1492, in hf_hub_download
turboderp-cat-llama-3-70-8684-v1-mkmlizer: http_get(
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 456, in http_get
turboderp-cat-llama-3-70-8684-v1-mkmlizer: r = _request_wrapper(
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 392, in _request_wrapper
turboderp-cat-llama-3-70-8684-v1-mkmlizer: response = get_session().request(method=method, url=url, **params)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/requests/sessions.py", line 589, in request
turboderp-cat-llama-3-70-8684-v1-mkmlizer: resp = self.send(prep, **send_kwargs)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/requests/sessions.py", line 703, in send
turboderp-cat-llama-3-70-8684-v1-mkmlizer: r = adapter.send(request, **kwargs)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/utils/_http.py", line 68, in send
turboderp-cat-llama-3-70-8684-v1-mkmlizer: return super().send(request, *args, **kwargs)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: File "/opt/conda/lib/python3.10/site-packages/requests/adapters.py", line 519, in send
turboderp-cat-llama-3-70-8684-v1-mkmlizer: raise ConnectionError(e, request=request)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: requests.exceptions.ConnectionError: (MaxRetryError("HTTPSConnectionPool(host='cdn-lfs-us-1.huggingface.co', port=443): Max retries exceeded with url: /repos/a5/d9/8096b7c63a1ee593fd9d5/6b7c5415b9f93d644d5d4240ffb12b90cd58b33452f202f3044bed58a31462ba?response-content-disposition=inline%3B+filename*%3DUTF-8%27%27model-00023-of-00030.safetensors%3B+filename%3D%22model-00023-of-00030.safetensors%22%3B&Expires=1718164607&Policy=eyJTdGF0ZW1lbnQiOlt7IkNvbmRpdGlvbiI6eyJEYXRlTGVzc1RoYW4iOnsiQVdTOkVwb2NoVGltZSI6MTcxODE2NDYwN319LCJSZXNvdXJjZSI6Imh0dHBzOi8vY2RuLWxmcy11cy0xLmh1Z2dpbmdmYWNlLmNvL3JlcG9zL2E1L2Q5L2E1ZDk1ODdjYzg2N2FmMGNmMTU5ZmIxZmQ4MjgyZjRlOTNkYTEyZDE2MzA4MDk2YjdjNjNhMWVlNTkzZmQ5ZDUvNmI3YzU0MTViOWY5M2Q2NDRkNWQ0MjQwZmZiMTJiOTBjZDU4YjMzNDUyZjIwMmYzMDQ0YmVkNThhMzE0NjJiYT9yZXNwb25zZS1jb250ZW50LWRpc3Bvc2l0aW9uPSoifV19&Signature=kloGXexSzc317GVugGRgUreA4BPDr0UFLZEcfWy9GTjg84fZJTVfzdFZomCYz2twXMjl4D2mqGZW-WJicpYPVuLUfCOUOAYeZryU0pvEYmWic5u0MjHUkg0xBjZKFkaV8fFzl6CxnCaa1NB~pFltviPgfWMdjWK-WAZx23GqILjn~psTUzWeQAczJ3~vKv~1Y35dk5CKmAdLvs3pyI15FU9rsV1aG0Xk0iD7yK-4pj0J0n-M7GbVvyb7kK1Fjwxq~Lqvpl3cpgjYI1BWQTgyCbVnsAj5LPFDdWedzHMA-uAFKLpbTf6BgiLCGr-NBUmHUuVzEhL-4sOMcalJxwxSIA__&Key-Pair-Id=KCD77M1F0VK2B (Caused by NewConnectionError('<urllib3.connection.HTTPSConnection object at 0x7f80381b6770>: Failed to establish a new connection: [Errno -3] Temporary failure in name resolution'))"), '(Request ID: f9c56b27-b3f2-411b-b388-5122ea3b2ada)')
Job turboderp-cat-llama-3-70-8684-v1-mkmlizer completed after 458.19s with status: failed
Stopping job with name turboderp-cat-llama-3-70-8684-v1-mkmlizer
%s, retrying in %s seconds...
Starting job with name turboderp-cat-llama-3-70-8684-v1-mkmlizer
Waiting for job on turboderp-cat-llama-3-70-8684-v1-mkmlizer to finish
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ _____ __ __ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ /___/ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ Version: 0.8.14 ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ https://mk1.ai ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ The license key for the current software has been verified as ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ belonging to: ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ Chai Research Corp. ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ Expiration: 2024-07-15 23:59:59 ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ║ ║
turboderp-cat-llama-3-70-8684-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
turboderp-cat-llama-3-70-8684-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/huggingface_hub/utils/_deprecation.py:131: FutureWarning: 'list_files_info' (from 'huggingface_hub.hf_api') is deprecated and will be removed from version '0.23'. Use `list_repo_tree` and `get_paths_info` instead.
turboderp-cat-llama-3-70-8684-v1-mkmlizer: warnings.warn(warning_message, FutureWarning)
turboderp-cat-llama-3-70-8684-v1-mkmlizer: Downloaded to shared memory in 187.039s
turboderp-cat-llama-3-70-8684-v1-mkmlizer: quantizing model to /dev/shm/model_cache
turboderp-cat-llama-3-70-8684-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
turboderp-cat-llama-3-70-8684-v1-mkmlizer: Loading 0: 0%| | 0/723 [00:00<?, ?it/s] Loading 0: 1%| | 4/723 [00:00<00:24, 29.83it/s] Loading 0: 2%|▏ | 11/723 [00:00<00:17, 40.61it/s] Loading 0: 2%|▏ | 16/723 [00:00<00:27, 25.81it/s] Loading 0: 3%|▎ | 19/723 [00:00<00:27, 25.84it/s] Loading 0: 3%|▎ | 22/723 [00:00<00:26, 26.11it/s] Loading 0: 4%|▍ | 30/723 [00:00<00:19, 35.35it/s] Loading 0: 5%|▍ | 34/723 [00:01<00:19, 35.31it/s] Loading 0: 6%|▌ | 42/723 [00:01<00:21, 31.00it/s] Loading 0: 6%|▋ | 46/723 [00:01<00:23, 28.32it/s] Loading 0: 7%|▋ | 49/723 [00:01<00:24, 28.06it/s] Loading 0: 8%|▊ | 57/723 [00:01<00:18, 35.56it/s] Loading 0: 8%|▊ | 61/723 [00:01<00:18, 35.46it/s] Loading 0: 9%|▉ | 68/723 [00:02<00:22, 29.77it/s] Loading 0: 10%|▉ | 72/723 [00:02<00:23, 28.00it/s] Loading 0: 11%|█ | 76/723 [00:02<00:22, 28.95it/s] Loading 0: 12%|█▏ | 84/723 [00:02<00:17, 36.08it/s] Loading 0: 12%|█▏ | 88/723 [00:02<00:17, 35.79it/s] Loading 0: 13%|█▎ | 92/723 [00:03<00:23, 26.57it/s] Loading 0: 13%|█▎ | 96/723 [00:03<00:24, 25.76it/s] Loading 0: 14%|█▍ | 103/723 [00:03<00:19, 31.81it/s] Loading 0: 15%|█▌ | 110/723 [00:03<00:16, 36.25it/s] Loading 0: 16%|█▌ | 116/723 [00:03<00:21, 28.41it/s] Loading 0: 17%|█▋ | 120/723 [00:03<00:20, 29.69it/s] Loading 0: 17%|█▋ | 124/723 [00:04<00:19, 30.49it/s] Loading 0: 17%|█▋ | 124/723 [00:22<00:19, 30.49it/s] Loading 0: 17%|█▋ | 125/723 [00:22<15:19, 1.54s/it] Loading 0: 18%|█▊ | 130/723 [00:22<09:45, 1.01it/s] Loading 0: 19%|█▉ | 137/723 [00:23<05:39, 1.72it/s] Loading 0: 20%|█▉ | 142/723 [00:23<04:05, 2.36it/s] Loading 0: 20%|██ | 146/723 [00:23<03:09, 3.04it/s] Loading 0: 21%|██ | 149/723 [00:23<02:35, 3.70it/s] Loading 0: 22%|██▏ | 157/723 [00:23<01:28, 6.38it/s] Loading 0: 23%|██▎ | 165/723 [00:24<00:56, 9.93it/s] Loading 0: 24%|██▎ | 170/723 [00:24<00:51, 10.75it/s] Loading 0: 24%|██▍ | 174/723 [00:24<00:42, 12.81it/s] Loading 0: 25%|██▍ | 178/723 [00:24<00:35, 15.19it/s] Loading 0: 25%|██▌ | 184/723 [00:24<00:27, 19.50it/s] Loading 0: 27%|██▋ | 192/723 [00:24<00:19, 27.44it/s] Loading 0: 27%|██▋ | 197/723 [00:25<00:25, 20.85it/s] Loading 0: 28%|██▊ | 202/723 [00:25<00:21, 23.94it/s] Loading 0: 29%|██▉ | 210/723 [00:25<00:16, 30.70it/s] Loading 0: 30%|██▉ | 215/723 [00:25<00:15, 32.33it/s] Loading 0: 30%|███ | 220/723 [00:26<00:21, 23.54it/s] Loading 0: 32%|███▏ | 228/723 [00:26<00:16, 29.97it/s] Loading 0: 32%|███▏ | 232/723 [00:26<00:15, 30.91it/s] Loading 0: 33%|███▎ | 237/723 [00:26<00:15, 32.35it/s] Loading 0: 33%|███▎ | 242/723 [00:26<00:17, 26.85it/s] Loading 0: 34%|███▍ | 246/723 [00:26<00:16, 28.45it/s] Loading 0: 35%|███▍ | 250/723 [00:26<00:15, 29.85it/s] Loading 0: 35%|███▌ | 256/723 [00:27<00:14, 33.30it/s] Loading 0: 36%|███▋ | 263/723 [00:27<00:12, 37.68it/s] Loading 0: 36%|███▋ | 263/723 [00:46<00:12, 37.68it/s] Loading 0: 37%|███▋ | 264/723 [00:46<10:35, 1.38s/it] Loading 0: 37%|███▋ | 268/723 [00:46<07:38, 1.01s/it] Loading 0: 38%|███▊ | 272/723 [00:46<05:29, 1.37it/s] Loading 0: 38%|███▊ | 275/723 [00:47<04:14, 1.76it/s] Loading 0: 39%|███▉ | 283/723 [00:47<02:15, 3.25it/s] Loading 0: 40%|████ | 291/723 [00:47<01:21, 5.30it/s] Loading 0: 41%|████ | 296/723 [00:47<01:07, 6.35it/s] Loading 0: 41%|████▏ | 300/723 [00:47<00:53, 7.86it/s] Loading 0: 42%|████▏ | 304/723 [00:47<00:43, 9.70it/s] Loading 0: 43%|████▎ | 310/723 [00:48<00:31, 13.27it/s] Loading 0: 44%|████▍ | 318/723 [00:48<00:20, 19.73it/s] Loading 0: 45%|████▍ | 323/723 [00:48<00:23, 17.01it/s] Loading 0: 45%|████▌ | 328/723 [00:48<00:19, 20.18it/s] Loading 0: 46%|████▋ | 336/723 [00:48<00:14, 26.73it/s] Loading 0: 47%|████▋ | 341/723 [00:49<00:13, 29.17it/s] Loading 0: 48%|████▊ | 346/723 [00:49<00:16, 22.59it/s] Loading 0: 49%|████▉ | 354/723 [00:49<00:12, 29.22it/s] Loading 0: 50%|████▉ | 359/723 [00:49<00:11, 31.03it/s] Loading 0: 50%|█████ | 363/723 [00:49<00:11, 31.16it/s] Loading 0: 51%|█████ | 368/723 [00:50<00:13, 26.17it/s] Loading 0: 51%|█████▏ | 372/723 [00:50<00:12, 28.14it/s] Loading 0: 52%|█████▏ | 376/723 [00:50<00:11, 29.74it/s] Loading 0: 53%|█████▎ | 382/723 [00:50<00:10, 33.59it/s] Loading 0: 54%|█████▍ | 389/723 [00:50<00:08, 38.18it/s] Loading 0: 54%|█████▍ | 394/723 [00:50<00:11, 29.65it/s] Loading 0: 55%|█████▌ | 398/723 [00:50<00:10, 30.83it/s] Loading 0: 55%|█████▌ | 400/723 [01:10<00:10, 30.83it/s] Loading 0: 55%|█████▌ | 401/723 [01:10<07:22, 1.37s/it] Loading 0: 57%|█████▋ | 409/723 [01:10<04:05, 1.28it/s] Loading 0: 58%|█████▊ | 417/723 [01:11<02:28, 2.06it/s] Loading 0: 59%|█████▊ | 423/723 [01:11<01:51, 2.70it/s] Loading 0: 59%|█████▉ | 427/723 [01:11<01:27, 3.38it/s] Loading 0: 60%|██████ | 435/723 [01:11<00:54, 5.29it/s] Loading 0: 61%|██████ | 440/723 [01:11<00:41, 6.78it/s] Loading 0: 62%|██████▏ | 446/723 [01:12<00:32, 8.59it/s] Loading 0: 62%|██████▏ | 450/723 [01:12<00:27, 9.93it/s] Loading 0: 63%|██████▎ | 454/723 [01:12<00:22, 11.97it/s] Loading 0: 64%|██████▍ | 462/723 [01:12<00:14, 17.69it/s] Loading 0: 64%|██████▍ | 466/723 [01:12<00:12, 20.01it/s] Loading 0: 65%|██████▌ | 470/723 [01:13<00:13, 18.56it/s] Loading 0: 66%|██████▌ | 474/723 [01:13<00:12, 19.45it/s] Loading 0: 67%|██████▋ | 481/723 [01:13<00:09, 25.59it/s] Loading 0: 67%|██████▋ | 488/723 [01:13<00:07, 30.94it/s] Loading 0: 68%|██████▊ | 494/723 [01:13<00:08, 25.74it/s] Loading 0: 69%|██████▉ | 498/723 [01:13<00:08, 27.49it/s] Loading 0: 69%|██████▉ | 502/723 [01:14<00:07, 28.87it/s] Loading 0: 70%|███████ | 508/723 [01:14<00:06, 32.08it/s] Loading 0: 71%|███████ | 515/723 [01:14<00:05, 36.31it/s] Loading 0: 72%|███████▏ | 520/723 [01:14<00:07, 28.99it/s] Loading 0: 72%|███████▏ | 524/723 [01:14<00:06, 30.37it/s] Loading 0: 73%|███████▎ | 528/723 [01:14<00:06, 28.74it/s] Loading 0: 74%|███████▍ | 535/723 [01:15<00:05, 34.11it/s] Loading 0: 75%|███████▍ | 540/723 [01:35<00:05, 34.11it/s] Loading 0: 75%|███████▍ | 541/723 [01:35<03:20, 1.10s/it] Loading 0: 76%|███████▌ | 546/723 [01:35<02:24, 1.23it/s] Loading 0: 76%|███████▌ | 550/723 [01:35<01:50, 1.57it/s] Loading 0: 76%|███████▋ | 553/723 [01:35<01:27, 1.95it/s] Loading 0: 78%|███████▊ | 561/723 [01:36<00:48, 3.37it/s] Loading 0: 78%|███████▊ | 565/723 [01:36<00:36, 4.28it/s] Loading 0: 79%|███████▉ | 572/723 [01:36<00:24, 6.16it/s] Loading 0: 80%|███████▉ | 576/723 [01:36<00:19, 7.45it/s] Loading 0: 80%|████████ | 580/723 [01:36<00:15, 9.24it/s] Loading 0: 81%|████████▏ | 588/723 [01:36<00:09, 14.16it/s] Loading 0: 82%|████████▏ | 592/723 [01:37<00:07, 16.41it/s] Loading 0: 82%|████████▏ | 596/723 [01:37<00:07, 15.92it/s] Loading 0: 83%|████████▎ | 600/723 [01:37<00:07, 17.28it/s] Loading 0: 84%|████████▍ | 607/723 [01:37<00:04, 23.30it/s] Loading 0: 85%|████████▍ | 614/723 [01:37<00:03, 28.67it/s] Loading 0: 86%|████████▌ | 620/723 [01:38<00:04, 24.81it/s] Loading 0: 86%|████████▋ | 624/723 [01:38<00:03, 26.43it/s] Loading 0: 87%|████████▋ | 628/723 [01:38<00:03, 27.81it/s] Loading 0: 88%|████████▊ | 634/723 [01:38<00:02, 31.47it/s] Loading 0: 89%|████████▊ | 641/723 [01:38<00:02, 35.58it/s] Loading 0: 89%|████████▉ | 646/723 [01:38<00:02, 28.32it/s] Loading 0: 90%|████████▉ | 650/723 [01:39<00:02, 29.53it/s] Loading 0: 90%|█████████ | 654/723 [01:39<00:02, 27.79it/s] Loading 0: 91%|█████████▏| 661/723 [01:39<00:01, 33.47it/s] Loading 0: 93%|█████████▎| 669/723 [01:39<00:01, 42.15it/s] Loading 0: 93%|█████████▎| 674/723 [01:39<00:01, 27.75it/s] Loading 0: 94%|█████████▍| 678/723 [01:39<00:01, 29.38it/s] Loading 0: 94%|█████████▍| 678/723 [01:59<00:01, 29.38it/s] Loading 0: 94%|█████████▍| 679/723 [01:59<01:07, 1.53s/it] Loading 0: 95%|█████████▌| 687/723 [02:00<00:29, 1.20it/s] Loading 0: 96%|█████████▌| 692/723 [02:00<00:18, 1.67it/s] Loading 0: 97%|█████████▋| 698/723 [02:00<00:10, 2.41it/s] Loading 0: 97%|█████████▋| 702/723 [02:00<00:06, 3.03it/s] Loading 0: 98%|█████████▊| 706/723 [02:00<00:04, 3.94it/s] Loading 0: 99%|█████████▉| 714/723 [02:01<00:01, 6.48it/s] Loading 0: 99%|█████████▉| 718/723 [02:01<00:00, 7.98it/s] Loading 0: 100%|█████████▉| 722/723 [02:11<00:00, 7.98it/s] Loading 0: 100%|██████████| 723/723 [02:11<00:00, 1.41it/s] Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
turboderp-cat-llama-3-70-8684-v1-mkmlizer: quantized model in 148.396s
turboderp-cat-llama-3-70-8684-v1-mkmlizer: Processed model turboderp/Cat-Llama-3-70B-instruct in 356.952s
turboderp-cat-llama-3-70-8684-v1-mkmlizer: creating bucket guanaco-mkml-models
turboderp-cat-llama-3-70-8684-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
turboderp-cat-llama-3-70-8684-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/turboderp-cat-llama-3-70-8684-v1
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/turboderp-cat-llama-3-70-8684-v1/config.json
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/turboderp-cat-llama-3-70-8684-v1/tokenizer_config.json
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/turboderp-cat-llama-3-70-8684-v1/special_tokens_map.json
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/turboderp-cat-llama-3-70-8684-v1/tokenizer.json
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.5.safetensors s3://guanaco-mkml-models/turboderp-cat-llama-3-70-8684-v1/flywheel_model.5.safetensors
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.2.safetensors s3://guanaco-mkml-models/turboderp-cat-llama-3-70-8684-v1/flywheel_model.2.safetensors
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/turboderp-cat-llama-3-70-8684-v1/flywheel_model.1.safetensors
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/turboderp-cat-llama-3-70-8684-v1/flywheel_model.0.safetensors
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.4.safetensors s3://guanaco-mkml-models/turboderp-cat-llama-3-70-8684-v1/flywheel_model.4.safetensors
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.3.safetensors s3://guanaco-mkml-models/turboderp-cat-llama-3-70-8684-v1/flywheel_model.3.safetensors
turboderp-cat-llama-3-70-8684-v1-mkmlizer: loading reward model from ChaiML/reward_gpt2_medium_preference_24m_e2
turboderp-cat-llama-3-70-8684-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/configuration_auto.py:913: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
turboderp-cat-llama-3-70-8684-v1-mkmlizer: warnings.warn(
turboderp-cat-llama-3-70-8684-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py:757: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
turboderp-cat-llama-3-70-8684-v1-mkmlizer: warnings.warn(
turboderp-cat-llama-3-70-8684-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:468: FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.
turboderp-cat-llama-3-70-8684-v1-mkmlizer: warnings.warn(
turboderp-cat-llama-3-70-8684-v1-mkmlizer: /opt/conda/lib/python3.10/site-packages/torch/_utils.py:831: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
turboderp-cat-llama-3-70-8684-v1-mkmlizer: return self.fget.__get__(instance, owner)()
turboderp-cat-llama-3-70-8684-v1-mkmlizer: Saving model to /tmp/reward_cache/reward.tensors
turboderp-cat-llama-3-70-8684-v1-mkmlizer: Saving duration: 0.412s
turboderp-cat-llama-3-70-8684-v1-mkmlizer: Processed model ChaiML/reward_gpt2_medium_preference_24m_e2 in 4.518s
turboderp-cat-llama-3-70-8684-v1-mkmlizer: creating bucket guanaco-reward-models
turboderp-cat-llama-3-70-8684-v1-mkmlizer: Bucket 's3://guanaco-reward-models/' created
turboderp-cat-llama-3-70-8684-v1-mkmlizer: uploading /tmp/reward_cache to s3://guanaco-reward-models/turboderp-cat-llama-3-70-8684-v1_reward
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /tmp/reward_cache/config.json s3://guanaco-reward-models/turboderp-cat-llama-3-70-8684-v1_reward/config.json
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /tmp/reward_cache/tokenizer_config.json s3://guanaco-reward-models/turboderp-cat-llama-3-70-8684-v1_reward/tokenizer_config.json
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /tmp/reward_cache/special_tokens_map.json s3://guanaco-reward-models/turboderp-cat-llama-3-70-8684-v1_reward/special_tokens_map.json
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /tmp/reward_cache/vocab.json s3://guanaco-reward-models/turboderp-cat-llama-3-70-8684-v1_reward/vocab.json
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /tmp/reward_cache/merges.txt s3://guanaco-reward-models/turboderp-cat-llama-3-70-8684-v1_reward/merges.txt
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /tmp/reward_cache/tokenizer.json s3://guanaco-reward-models/turboderp-cat-llama-3-70-8684-v1_reward/tokenizer.json
turboderp-cat-llama-3-70-8684-v1-mkmlizer: cp /tmp/reward_cache/reward.tensors s3://guanaco-reward-models/turboderp-cat-llama-3-70-8684-v1_reward/reward.tensors
Job turboderp-cat-llama-3-70-8684-v1-mkmlizer completed after 412.31s with status: succeeded
Stopping job with name turboderp-cat-llama-3-70-8684-v1-mkmlizer
Pipeline stage MKMLizer completed in 874.11s
Running pipeline stage MKMLKubeTemplater
Pipeline stage MKMLKubeTemplater completed in 0.11s
Running pipeline stage ISVCDeployer
Creating inference service turboderp-cat-llama-3-70-8684-v1
Waiting for inference service turboderp-cat-llama-3-70-8684-v1 to be ready
Inference service turboderp-cat-llama-3-70-8684-v1 ready after 91.46139717102051s
Pipeline stage ISVCDeployer completed in 99.50s
Running pipeline stage StressChecker
Received healthy response to inference request in 5.141484498977661s
Received healthy response to inference request in 4.20534086227417s
Received healthy response to inference request in 4.214126110076904s
Received healthy response to inference request in 3.6193079948425293s
Received healthy response to inference request in 3.709124803543091s
5 requests
0 failed requests
5th percentile: 3.637271356582642
10th percentile: 3.655234718322754
20th percentile: 3.6911614418029783
30th percentile: 3.8083680152893065
40th percentile: 4.006854438781739
50th percentile: 4.20534086227417
60th percentile: 4.2088549613952635
70th percentile: 4.212369060516357
80th percentile: 4.399597787857056
90th percentile: 4.770541143417359
95th percentile: 4.956012821197509
99th percentile: 5.10439016342163
mean time: 4.1778768539428714
Pipeline stage StressChecker completed in 21.85s
Running pipeline stage DaemonicModelEvalScorer
Pipeline stage DaemonicModelEvalScorer completed in 0.05s
Running M-Eval for topic stay_in_character
Running pipeline stage DaemonicSafetyScorer
M-Eval Dataset for topic stay_in_character is loaded
Pipeline stage DaemonicSafetyScorer completed in 0.12s
turboderp-cat-llama-3-70_8684_v1 status is now deployed due to DeploymentManager action
turboderp-cat-llama-3-70_8684_v1 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of turboderp-cat-llama-3-70_8684_v1
Running pipeline stage ISVCDeleter
Checking if service turboderp-cat-llama-3-70-8684-v1 is running
Skipping teardown as no inference service was found
Pipeline stage ISVCDeleter completed in 3.95s
Running pipeline stage MKMLModelDeleter
Cleaning model data from S3
Cleaning model data from model cache
Deleting key turboderp-cat-llama-3-70-8684-v1/config.json from bucket guanaco-mkml-models
Deleting key turboderp-cat-llama-3-70-8684-v1/flywheel_model.0.safetensors from bucket guanaco-mkml-models
Deleting key turboderp-cat-llama-3-70-8684-v1/flywheel_model.1.safetensors from bucket guanaco-mkml-models
Deleting key turboderp-cat-llama-3-70-8684-v1/flywheel_model.2.safetensors from bucket guanaco-mkml-models
Deleting key turboderp-cat-llama-3-70-8684-v1/flywheel_model.3.safetensors from bucket guanaco-mkml-models
Deleting key turboderp-cat-llama-3-70-8684-v1/flywheel_model.4.safetensors from bucket guanaco-mkml-models
Deleting key turboderp-cat-llama-3-70-8684-v1/flywheel_model.5.safetensors from bucket guanaco-mkml-models
Deleting key turboderp-cat-llama-3-70-8684-v1/special_tokens_map.json from bucket guanaco-mkml-models
Deleting key turboderp-cat-llama-3-70-8684-v1/tokenizer.json from bucket guanaco-mkml-models
Deleting key turboderp-cat-llama-3-70-8684-v1/tokenizer_config.json from bucket guanaco-mkml-models
Cleaning model data from model cache
Deleting key turboderp-cat-llama-3-70-8684-v1_reward/config.json from bucket guanaco-reward-models
Deleting key turboderp-cat-llama-3-70-8684-v1_reward/merges.txt from bucket guanaco-reward-models
Deleting key turboderp-cat-llama-3-70-8684-v1_reward/reward.tensors from bucket guanaco-reward-models
Deleting key turboderp-cat-llama-3-70-8684-v1_reward/special_tokens_map.json from bucket guanaco-reward-models
Deleting key turboderp-cat-llama-3-70-8684-v1_reward/tokenizer.json from bucket guanaco-reward-models
Deleting key turboderp-cat-llama-3-70-8684-v1_reward/tokenizer_config.json from bucket guanaco-reward-models
Deleting key turboderp-cat-llama-3-70-8684-v1_reward/vocab.json from bucket guanaco-reward-models
Pipeline stage MKMLModelDeleter completed in 13.83s
turboderp-cat-llama-3-70_8684_v1 status is now torndown due to DeploymentManager action