Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer
Waiting for job on chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer to finish
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Version: 0.30.2 ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ https://mk1.ai ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ belonging to: ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Chai Research Corp. ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Job chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer completed after 41.84s with status: failed
Stopping job with name chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer
%s, retrying in %s seconds...
Starting job with name chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer
Waiting for job on chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer to finish
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Version: 0.30.2 ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ https://mk1.ai ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ belonging to: ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Chai Research Corp. ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission mistralai-mistral-nem_93303_v569: ('http://mistralai-mistral-nem-93303-v569-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Retrying (%r) after connection broken by '%r': %s
Failed to get response for submission mistralai-mistral-nem_93303_v569: ('http://mistralai-mistral-nem-93303-v569-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission mistralai-mistral-nem_93303_v569: ('http://mistralai-mistral-nem-93303-v569-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-cai-v1-dpo-instr_73326_v5: HTTPConnectionPool(host='chaiml-cai-v1-dpo-instr-73326-v5-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-cai-v1-1-dpo-bas_56329_v5: HTTPConnectionPool(host='chaiml-cai-v1-1-dpo-bas-56329-v5-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-cai-v1-2-dpo-bas_75789_v5: HTTPConnectionPool(host='chaiml-cai-v1-2-dpo-bas-75789-v5-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-cai-v1-1-dpo-bas_42726_v4: HTTPConnectionPool(host='chaiml-cai-v1-1-dpo-bas-42726-v4-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-cai-v1-1-dpo-ins_70786_v4: HTTPConnectionPool(host='chaiml-cai-v1-1-dpo-ins-70786-v4-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-cai-users-v0-fu_42497_v78: HTTPConnectionPool(host='chaiml-cai-users-v0-fu-42497-v78-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission mistralai-mistral-nem_93303_v569: ('http://mistralai-mistral-nem-93303-v569-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-cai-users-v0-fu_42497_v78: HTTPConnectionPool(host='chaiml-cai-users-v0-fu-42497-v78-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Stopping job with name chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer
%s, retrying in %s seconds...
Starting job with name chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer
Waiting for job on chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer to finish
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Version: 0.30.2 ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ https://mk1.ai ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ belonging to: ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Chai Research Corp. ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ║ ║
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission mistralai-mistral-nem_93303_v569: ('http://mistralai-mistral-nem-93303-v569-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: Downloaded to shared memory in 47.358s
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: Checking if ChaiML/cai-v1-dpo_basev0-lr5e6b01 already exists in ChaiML
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: quantizing model to /dev/shm/model_cache, profile:q4, folder:/tmp/tmpajbzfn_z, device:0
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Failed to get response for submission chaiml-prm-v0-2-pair-ll_21380_v1: ('http://chaiml-prm-v0-2-pair-ll-21380-v1-predictor.creator-studio.kchai-coreweave-us-east-04a.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:37128->127.0.0.1:8080: read: connection reset by peer\n')
Failed to get response for submission mistralai-mistral-nem_93303_v569: ('http://mistralai-mistral-nem-93303-v569-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-cai-v1-1-dpo-bas_42726_v4: HTTPConnectionPool(host='chaiml-cai-v1-1-dpo-bas-42726-v4-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-cai-v1-1-dpo-ins_70786_v4: HTTPConnectionPool(host='chaiml-cai-v1-1-dpo-ins-70786-v4-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Unable to record family friendly update due to error: Invalid JSON input: Expecting value: line 1 column 1 (char 0)
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: quantized model in 283.653s
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: Processed model ChaiML/cai-v1-dpo_basev0-lr5e6b01 in 331.012s
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: creating bucket guanaco-mkml-models
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-cai-v1-dpo-basev-43236-v5/nvidia
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-cai-v1-dpo-basev-43236-v5/nvidia/config.json
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-cai-v1-dpo-basev-43236-v5/nvidia/special_tokens_map.json
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-cai-v1-dpo-basev-43236-v5/nvidia/tokenizer_config.json
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-cai-v1-dpo-basev-43236-v5/nvidia/tokenizer.json
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-cai-v1-dpo-basev-43236-v5/nvidia/flywheel_model.1.safetensors
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-cai-v1-dpo-basev-43236-v5/nvidia/flywheel_model.0.safetensors
chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 3/363 [00:02<04:19, 1.38it/s]
Loading 0: 1%| | 4/363 [00:04<06:41, 1.12s/it]
Loading 0: 1%|▏ | 5/363 [00:06<08:23, 1.41s/it]
Loading 0: 2%|▏ | 8/363 [00:06<03:53, 1.52it/s]
Loading 0: 2%|▏ | 9/363 [00:06<03:24, 1.73it/s]
Loading 0: 3%|▎ | 10/363 [00:06<02:45, 2.13it/s]
Loading 0: 3%|▎ | 12/363 [00:08<03:56, 1.49it/s]
Loading 0: 4%|▎ | 13/363 [00:10<05:35, 1.04it/s]
Loading 0: 4%|▍ | 14/363 [00:12<07:03, 1.21s/it]
Loading 0: 5%|▍ | 17/363 [00:13<03:43, 1.55it/s]
Loading 0: 5%|▍ | 18/363 [00:13<03:16, 1.76it/s]
Loading 0: 5%|▌ | 19/363 [00:13<02:42, 2.12it/s]
Loading 0: 6%|▌ | 21/363 [00:15<03:47, 1.50it/s]
Loading 0: 6%|▌ | 22/363 [00:17<05:21, 1.06it/s]
Loading 0: 6%|▋ | 23/363 [00:19<06:47, 1.20s/it]
Loading 0: 7%|▋ | 26/363 [00:19<03:38, 1.54it/s]
Loading 0: 7%|▋ | 27/363 [00:19<03:11, 1.75it/s]
Loading 0: 8%|▊ | 28/363 [00:20<02:38, 2.12it/s]
Loading 0: 8%|▊ | 30/363 [00:20<01:56, 2.86it/s]
Loading 0: 9%|▊ | 31/363 [00:20<01:48, 3.05it/s]
Loading 0: 9%|▉ | 32/363 [00:20<01:32, 3.60it/s]
Loading 0: 9%|▉ | 33/363 [00:20<01:19, 4.13it/s]
Loading 0: 9%|▉ | 34/363 [00:22<03:48, 1.44it/s]
Loading 0: 10%|▉ | 35/363 [00:24<05:40, 1.04s/it]
Loading 0: 10%|▉ | 36/363 [00:26<07:10, 1.32s/it]
Loading 0: 11%|█ | 39/363 [00:28<05:13, 1.03it/s]
Loading 0: 11%|█ | 40/363 [00:30<06:17, 1.17s/it]
Loading 0: 11%|█▏ | 41/363 [00:32<07:20, 1.37s/it]
Loading 0: 12%|█▏ | 44/363 [00:33<03:56, 1.35it/s]
Loading 0: 12%|█▏ | 45/363 [00:33<03:26, 1.54it/s]
Loading 0: 13%|█▎ | 46/363 [00:33<02:49, 1.87it/s]
Loading 0: 13%|█▎ | 48/363 [00:35<03:42, 1.41it/s]
Loading 0: 13%|█▎ | 49/363 [00:37<05:05, 1.03it/s]
Loading 0: 14%|█▍ | 50/363 [00:39<06:21, 1.22s/it]
Loading 0: 15%|█▍ | 53/363 [00:39<03:24, 1.52it/s]
Loading 0: 15%|█▍ | 54/363 [00:39<02:59, 1.73it/s]
Loading 0: 15%|█▌ | 55/363 [00:40<02:27, 2.09it/s]
Loading 0: 16%|█▌ | 57/363 [00:42<03:24, 1.50it/s]
Loading 0: 16%|█▌ | 58/363 [00:44<04:47, 1.06it/s]
Loading 0: 16%|█▋ | 59/363 [00:46<06:03, 1.20s/it]
Loading 0: 17%|█▋ | 62/363 [00:46<03:14, 1.54it/s]
Loading 0: 17%|█▋ | 63/363 [00:46<02:51, 1.75it/s]
Loading 0: 18%|█▊ | 64/363 [00:46<02:20, 2.12it/s]
Loading 0: 18%|█▊ | 65/363 [00:48<04:04, 1.22it/s]
Loading 0: 18%|█▊ | 67/363 [00:48<02:42, 1.82it/s]
Loading 0: 19%|█▊ | 68/363 [00:49<02:22, 2.07it/s]
Loading 0: 19%|█▉ | 69/363 [00:49<01:56, 2.52it/s]
Loading 0: 19%|█▉ | 70/363 [00:49<01:35, 3.08it/s]
Loading 0: 20%|█▉ | 71/363 [00:51<03:42, 1.31it/s]
Loading 0: 20%|█▉ | 72/363 [00:53<05:17, 1.09s/it]
Loading 0: 20%|██ | 73/363 [00:55<06:33, 1.36s/it]
Loading 0: 21%|██ | 76/363 [00:55<03:11, 1.50it/s]
Loading 0: 21%|██ | 77/363 [00:55<02:44, 1.73it/s]
Loading 0: 21%|██▏ | 78/363 [00:56<02:14, 2.12it/s]
Loading 0: 22%|██▏ | 79/363 [00:57<03:56, 1.20it/s]
Loading 0: 22%|██▏ | 80/363 [00:59<05:23, 1.14s/it]
Loading 0: 23%|██▎ | 82/363 [01:00<03:21, 1.40it/s]
Loading 0: 23%|██▎ | 83/363 [01:00<02:49, 1.65it/s]
Loading 0: 23%|██▎ | 84/363 [01:00<02:14, 2.07it/s]
Loading 0: 24%|██▎ | 86/363 [01:02<03:11, 1.45it/s]
Loading 0: 24%|██▍ | 87/363 [01:04<04:34, 1.00it/s]
Loading 0: 25%|██▍ | 90/363 [01:06<03:45, 1.21it/s]
Loading 0: 25%|██▌ | 91/363 [01:08<04:42, 1.04s/it]
Loading 0: 25%|██▌ | 92/363 [01:10<05:38, 1.25s/it]
Loading 0: 26%|██▌ | 95/363 [01:10<03:06, 1.44it/s]
Loading 0: 26%|██▋ | 96/363 [01:11<02:43, 1.63it/s]
Loading 0: 27%|██▋ | 97/363 [01:11<02:15, 1.97it/s]
Loading 0: 27%|██▋ | 99/363 [01:13<03:00, 1.46it/s]
Loading 0: 28%|██▊ | 100/363 [01:15<04:09, 1.05it/s]
Loading 0: 28%|██▊ | 101/363 [01:17<05:13, 1.20s/it]
Loading 0: 29%|██▊ | 104/363 [01:17<02:47, 1.54it/s]
Loading 0: 29%|██▉ | 105/363 [01:17<02:27, 1.75it/s]
Loading 0: 29%|██▉ | 106/363 [01:17<02:02, 2.10it/s]
Loading 0: 29%|██▉ | 107/363 [01:17<01:41, 2.53it/s]
Loading 0: 30%|██▉ | 108/363 [01:19<03:20, 1.27it/s]
Loading 0: 31%|███ | 111/363 [01:21<03:00, 1.40it/s]
Loading 0: 31%|███ | 112/363 [01:23<04:00, 1.04it/s]
Loading 0: 31%|███ | 113/363 [01:25<04:59, 1.20s/it]
Loading 0: 32%|███▏ | 116/363 [01:26<02:43, 1.51it/s]
Loading 0: 32%|███▏ | 117/363 [01:26<02:23, 1.71it/s]
Loading 0: 33%|███▎ | 118/363 [01:26<01:58, 2.06it/s]
Loading 0: 33%|███▎ | 120/363 [01:28<02:43, 1.49it/s]
Loading 0: 33%|███▎ | 121/363 [01:30<03:47, 1.06it/s]
Loading 0: 34%|███▎ | 122/363 [01:32<04:47, 1.19s/it]
Loading 0: 34%|███▍ | 125/363 [01:32<02:33, 1.55it/s]
Loading 0: 35%|███▍ | 126/363 [01:32<02:14, 1.76it/s]
Loading 0: 35%|███▍ | 127/363 [01:32<01:51, 2.12it/s]
Loading 0: 36%|███▌ | 129/363 [01:34<02:35, 1.50it/s]
Loading 0: 36%|███▌ | 130/363 [01:36<03:38, 1.06it/s]
Loading 0: 36%|███▌ | 131/363 [01:38<04:37, 1.19s/it]
Loading 0: 37%|███▋ | 134/363 [01:39<02:28, 1.55it/s]
Loading 0: 37%|███▋ | 135/363 [01:39<02:09, 1.76it/s]
Loading 0: 37%|███▋ | 136/363 [01:39<01:46, 2.12it/s]
Loading 0: 38%|███▊ | 138/363 [01:41<02:29, 1.50it/s]
Loading 0: 38%|███▊ | 139/363 [01:43<03:30, 1.06it/s]
Loading 0: 39%|███▊ | 140/363 [01:45<04:26, 1.19s/it]
Loading 0: 39%|███▉ | 143/363 [01:45<02:22, 1.55it/s]
Loading 0: 40%|███▉ | 144/363 [01:46<02:04, 1.76it/s]
Loading 0: 40%|███▉ | 145/363 [01:46<01:43, 2.11it/s]
Loading 0: 40%|████ | 147/363 [01:46<01:15, 2.85it/s]
Loading 0: 41%|████ | 148/363 [01:46<01:10, 3.05it/s]
Loading 0: 41%|████ | 149/363 [01:46<00:59, 3.60it/s]
Loading 0: 41%|████▏ | 150/363 [01:46<00:51, 4.15it/s]
Loading 0: 42%|████▏ | 151/363 [01:48<02:26, 1.44it/s]
Loading 0: 42%|████▏ | 152/363 [01:50<03:38, 1.04s/it]
Loading 0: 42%|████▏ | 153/363 [01:52<04:35, 1.31s/it]
Loading 0: 43%|████▎ | 156/363 [01:54<03:16, 1.05it/s]
Loading 0: 43%|████▎ | 157/363 [01:56<03:58, 1.16s/it]
Loading 0: 44%|████▎ | 158/363 [01:58<04:37, 1.36s/it]
Loading 0: 44%|████▍ | 161/363 [01:59<02:28, 1.36it/s]
Loading 0: 45%|████▍ | 162/363 [01:59<02:07, 1.57it/s]
Loading 0: 45%|████▍ | 163/363 [01:59<01:44, 1.90it/s]
Loading 0: 45%|████▌ | 165/363 [02:01<02:18, 1.43it/s]
Loading 0: 46%|████▌ | 166/363 [02:03<03:09, 1.04it/s]
Loading 0: 46%|████▌ | 167/363 [02:05<03:57, 1.21s/it]
Loading 0: 47%|████▋ | 170/363 [02:05<02:06, 1.53it/s]
Loading 0: 47%|████▋ | 171/363 [02:05<01:50, 1.74it/s]
Loading 0: 47%|████▋ | 172/363 [02:06<01:31, 2.10it/s]
Loading 0: 48%|████▊ | 174/363 [02:07<02:06, 1.50it/s]
Loading 0: 48%|████▊ | 175/363 [02:09<02:57, 1.06it/s]
Loading 0: 48%|████▊ | 176/363 [02:11<03:43, 1.19s/it]
Loading 0: 49%|████▉ | 179/363 [02:12<01:58, 1.55it/s]
Loading 0: 50%|████▉ | 180/363 [02:12<01:43, 1.77it/s]
Loading 0: 50%|████▉ | 181/363 [02:12<01:25, 2.12it/s]
Loading 0: 50%|█████ | 182/363 [02:14<02:28, 1.22it/s]
Loading 0: 51%|█████ | 184/363 [02:14<01:38, 1.81it/s]
Loading 0: 51%|█████ | 185/363 [02:15<01:25, 2.07it/s]
Loading 0: 51%|█████ | 186/363 [02:15<01:09, 2.54it/s]
Loading 0: 52%|█████▏ | 187/363 [02:15<00:57, 3.07it/s]
Loading 0: 52%|█████▏ | 188/363 [02:17<02:13, 1.31it/s]
Loading 0: 52%|█████▏ | 189/363 [02:19<03:12, 1.11s/it]
Loading 0: 53%|█████▎ | 192/363 [02:21<02:26, 1.17it/s]
Loading 0: 53%|█████▎ | 193/363 [02:23<03:04, 1.08s/it]
Loading 0: 53%|█████▎ | 194/363 [02:25<03:40, 1.31s/it]
Loading 0: 54%|█████▍ | 197/363 [02:25<01:58, 1.40it/s]
Loading 0: 55%|█████▍ | 198/363 [02:25<01:42, 1.61it/s]
Loading 0: 55%|█████▍ | 199/363 [02:25<01:24, 1.94it/s]
Loading 0: 55%|█████▌ | 201/363 [02:27<01:51, 1.45it/s]
Loading 0: 56%|█████▌ | 202/363 [02:29<02:34, 1.04it/s]
Loading 0: 56%|█████▌ | 203/363 [02:31<03:12, 1.20s/it]
Loading 0: 57%|█████▋ | 206/363 [02:32<01:42, 1.53it/s]
Loading 0: 57%|█████▋ | 207/363 [02:32<01:29, 1.74it/s]
Loading 0: 57%|█████▋ | 208/363 [02:32<01:13, 2.10it/s]
Loading 0: 58%|█████▊ | 210/363 [02:34<01:41, 1.50it/s]
Loading 0: 58%|█████▊ | 211/363 [02:36<02:22, 1.07it/s]
Loading 0: 58%|█████▊ | 212/363 [02:38<02:59, 1.19s/it]
Loading 0: 59%|█████▉ | 215/363 [02:38<01:35, 1.55it/s]
Loading 0: 60%|█████▉ | 216/363 [02:38<01:23, 1.76it/s]
Loading 0: 60%|█████▉ | 217/363 [02:39<01:08, 2.13it/s]
Loading 0: 60%|██████ | 218/363 [02:41<01:58, 1.22it/s]
Loading 0: 60%|██████ | 219/363 [02:43<02:41, 1.12s/it]
Loading 0: 61%|██████ | 221/363 [02:43<01:41, 1.40it/s]
Loading 0: 61%|██████ | 222/363 [02:43<01:25, 1.65it/s]
Loading 0: 61%|██████▏ | 223/363 [02:43<01:08, 2.06it/s]
Loading 0: 62%|██████▏ | 224/363 [02:43<00:54, 2.55it/s]
Loading 0: 62%|██████▏ | 225/363 [02:45<01:52, 1.23it/s]
Loading 0: 63%|██████▎ | 228/363 [02:47<01:37, 1.38it/s]
Loading 0: 63%|██████▎ | 229/363 [02:49<02:10, 1.03it/s]
Loading 0: 63%|██████▎ | 230/363 [02:51<02:41, 1.21s/it]
Loading 0: 64%|██████▍ | 233/363 [02:51<01:26, 1.51it/s]
Loading 0: 64%|██████▍ | 234/363 [02:52<01:14, 1.73it/s]
Loading 0: 65%|██████▍ | 235/363 [02:52<01:01, 2.07it/s]
Loading 0: 65%|██████▌ | 237/363 [02:54<01:24, 1.50it/s]
Loading 0: 66%|██████▌ | 238/363 [02:56<01:57, 1.07it/s]
Loading 0: 66%|██████▌ | 239/363 [02:58<02:27, 1.19s/it]
Loading 0: 67%|██████▋ | 242/363 [02:58<01:17, 1.55it/s]
Loading 0: 67%|██████▋ | 243/363 [02:58<01:08, 1.76it/s]
Loading 0: 67%|██████▋ | 244/363 [02:58<00:56, 2.12it/s]
Loading 0: 68%|██████▊ | 246/363 [03:00<01:17, 1.51it/s]
Loading 0: 68%|██████▊ | 247/363 [03:02<01:48, 1.07it/s]
Loading 0: 68%|██████▊ | 248/363 [03:04<02:17, 1.19s/it]
Loading 0: 69%|██████▉ | 251/363 [03:05<01:12, 1.55it/s]
Loading 0: 69%|██████▉ | 252/363 [03:05<01:03, 1.76it/s]
Loading 0: 70%|██████▉ | 253/363 [03:05<00:51, 2.12it/s]
Loading 0: 70%|███████ | 255/363 [03:07<01:11, 1.51it/s]
Loading 0: 71%|███████ | 256/363 [03:09<01:40, 1.07it/s]
Loading 0: 71%|███████ | 257/363 [03:11<02:05, 1.19s/it]
Loading 0: 72%|███████▏ | 260/363 [03:11<01:06, 1.55it/s]
Loading 0: 72%|███████▏ | 261/363 [03:11<00:57, 1.76it/s]
Loading 0: 72%|███████▏ | 262/363 [03:12<00:47, 2.12it/s]
Loading 0: 73%|███████▎ | 264/363 [03:12<00:34, 2.86it/s]
Loading 0: 73%|███████▎ | 265/363 [03:12<00:32, 3.06it/s]
Loading 0: 73%|███████▎ | 266/363 [03:12<00:26, 3.60it/s]
Loading 0: 74%|███████▎ | 267/363 [03:12<00:23, 4.15it/s]
Loading 0: 74%|███████▍ | 268/363 [03:14<01:05, 1.45it/s]
Loading 0: 74%|███████▍ | 269/363 [03:16<01:37, 1.04s/it]
Loading 0: 74%|███████▍ | 270/363 [03:18<02:02, 1.31s/it]
Loading 0: 75%|███████▌ | 273/363 [03:33<04:56, 3.29s/it]
Loading 0: 75%|███████▌ | 274/363 [03:35<04:28, 3.01s/it]
Loading 0: 76%|███████▌ | 275/363 [03:37<04:06, 2.80s/it]
Loading 0: 77%|███████▋ | 278/363 [03:37<02:04, 1.47s/it]
Loading 0: 77%|███████▋ | 279/363 [03:38<01:43, 1.23s/it]
Loading 0: 77%|███████▋ | 280/363 [03:38<01:22, 1.01it/s]
Loading 0: 78%|███████▊ | 282/363 [03:40<01:19, 1.02it/s]
Loading 0: 78%|███████▊ | 283/363 [03:42<01:34, 1.19s/it]
Loading 0: 78%|███████▊ | 284/363 [03:44<01:49, 1.38s/it]
Loading 0: 79%|███████▉ | 287/363 [03:44<00:55, 1.36it/s]
Loading 0: 79%|███████▉ | 288/363 [03:44<00:47, 1.57it/s]
Loading 0: 80%|███████▉ | 289/363 [03:44<00:38, 1.91it/s]
Loading 0: 80%|████████ | 291/363 [03:46<00:50, 1.44it/s]
Loading 0: 80%|████████ | 292/363 [03:48<01:08, 1.04it/s]
Loading 0: 81%|████████ | 293/363 [03:50<01:24, 1.20s/it]
Loading 0: 82%|████████▏ | 296/363 [03:50<00:43, 1.54it/s]
Loading 0: 82%|████████▏ | 297/363 [03:51<00:37, 1.77it/s]
Loading 0: 82%|████████▏ | 298/363 [03:51<00:30, 2.12it/s]
Loading 0: 82%|████████▏ | 299/363 [03:53<00:52, 1.22it/s]
Loading 0: 83%|████████▎ | 301/363 [03:53<00:33, 1.84it/s]
Loading 0: 83%|████████▎ | 302/363 [03:53<00:28, 2.12it/s]
Loading 0: 83%|████████▎ | 303/363 [03:53<00:23, 2.59it/s]
Loading 0: 84%|████████▎ | 304/363 [03:53<00:18, 3.13it/s]
Loading 0: 84%|████████▍ | 305/363 [03:55<00:43, 1.33it/s]
Loading 0: 84%|████████▍ | 306/363 [03:57<01:02, 1.10s/it]
Loading 0: 85%|████████▌ | 309/363 [03:59<00:46, 1.17it/s]
Loading 0: 85%|████████▌ | 310/363 [04:01<00:57, 1.08s/it]
Loading 0: 86%|████████▌ | 311/363 [04:03<01:07, 1.29s/it]
Loading 0: 87%|████████▋ | 314/363 [04:04<00:34, 1.43it/s]
Loading 0: 87%|████████▋ | 315/363 [04:04<00:29, 1.65it/s]
Loading 0: 87%|████████▋ | 316/363 [04:04<00:23, 1.99it/s]
Loading 0: 88%|████████▊ | 318/363 [04:06<00:30, 1.47it/s]
Loading 0: 88%|████████▊ | 319/363 [04:08<00:41, 1.05it/s]
Loading 0: 88%|████████▊ | 320/363 [04:10<00:51, 1.20s/it]
Loading 0: 89%|████████▉ | 323/363 [04:10<00:25, 1.55it/s]
Loading 0: 89%|████████▉ | 324/363 [04:10<00:21, 1.77it/s]
Loading 0: 90%|████████▉ | 325/363 [04:10<00:17, 2.14it/s]
Loading 0: 90%|█████████ | 327/363 [04:12<00:23, 1.51it/s]
Loading 0: 90%|█████████ | 328/363 [04:14<00:32, 1.07it/s]
Loading 0: 91%|█████████ | 329/363 [04:16<00:40, 1.18s/it]
Loading 0: 91%|█████████▏| 332/363 [04:17<00:19, 1.57it/s]
Loading 0: 92%|█████████▏| 333/363 [04:17<00:16, 1.80it/s]
Loading 0: 92%|█████████▏| 334/363 [04:17<00:13, 2.17it/s]
Loading 0: 92%|█████████▏| 335/363 [04:19<00:22, 1.23it/s]
Loading 0: 93%|█████████▎| 336/363 [04:21<00:30, 1.11s/it]
Loading 0: 93%|█████████▎| 338/363 [04:21<00:17, 1.42it/s]
Loading 0: 93%|█████████▎| 339/363 [04:21<00:14, 1.68it/s]
Loading 0: 94%|█████████▎| 340/363 [04:22<00:11, 2.09it/s]
Loading 0: 94%|█████████▍| 341/363 [04:22<00:09, 2.33it/s]
Loading 0: 94%|█████████▍| 343/363 [04:24<00:13, 1.51it/s]
Loading 0: 95%|█████████▌| 346/363 [04:26<00:11, 1.52it/s]
Loading 0: 96%|█████████▌| 347/363 [04:28<00:14, 1.11it/s]
Loading 0: 96%|█████████▌| 348/363 [04:30<00:16, 1.13s/it]
Loading 0: 97%|█████████▋| 351/363 [04:30<00:07, 1.56it/s]
Loading 0: 97%|█████████▋| 352/363 [04:30<00:06, 1.77it/s]
Loading 0: 97%|█████████▋| 353/363 [04:30<00:04, 2.12it/s]
Loading 0: 98%|█████████▊| 355/363 [04:32<00:05, 1.50it/s]
Loading 0: 98%|█████████▊| 356/363 [04:34<00:06, 1.06it/s]
Loading 0: 98%|█████████▊| 357/363 [04:36<00:07, 1.20s/it]
Loading 0: 99%|█████████▉| 360/363 [04:37<00:01, 1.54it/s]
Loading 0: 99%|█████████▉| 361/363 [04:37<00:01, 1.76it/s]
Loading 0: 100%|█████████▉| 362/363 [04:37<00:00, 2.12it/s]
Job chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer completed after 378.75s with status: succeeded
Stopping job with name chaiml-cai-v1-dpo-basev-43236-v5-mkmlizer
Pipeline stage MKMLizer completed in 3584.55s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.18s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-cai-v1-dpo-basev-43236-v5
Waiting for inference service chaiml-cai-v1-dpo-basev-43236-v5 to be ready
Failed to get response for submission chaiml-prm-v0-2-pair-ll_21380_v1: HTTPConnectionPool(host='chaiml-prm-v0-2-pair-ll-21380-v1-predictor.creator-studio.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-cai-users-v0-fu_42497_v78: HTTPConnectionPool(host='chaiml-cai-users-v0-fu-42497-v78-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Retrying (%r) after connection broken by '%r': %s
Failed to get response for submission chaiml-cai-v1-1-dpo-ins_84951_v5: HTTPConnectionPool(host='chaiml-cai-v1-1-dpo-ins-84951-v5-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission mistralai-mistral-nem_93303_v569: ('http://mistralai-mistral-nem-93303-v569-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-cai-v1-1-dpo-ins_84951_v5: HTTPConnectionPool(host='chaiml-cai-v1-1-dpo-ins-84951-v5-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-cai-users-v0-fu_42497_v78: HTTPConnectionPool(host='chaiml-cai-users-v0-fu-42497-v78-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Tearing down inference service chaiml-cai-v1-dpo-basev-43236-v5
%s, retrying in %s seconds...
Creating inference service chaiml-cai-v1-dpo-basev-43236-v5
Waiting for inference service chaiml-cai-v1-dpo-basev-43236-v5 to be ready
Failed to get response for submission chaiml-cai-v1-1-dpo-bas_42726_v4: HTTPConnectionPool(host='chaiml-cai-v1-1-dpo-bas-42726-v4-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-cai-v1-1-dpo-bas_42726_v4: HTTPConnectionPool(host='chaiml-cai-v1-1-dpo-bas-42726-v4-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-cai-v1-dpo-instr_73326_v5: HTTPConnectionPool(host='chaiml-cai-v1-dpo-instr-73326-v5-predictor.tenant-chaiml-guanaco.kchai-coreweave-us-east-04a.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service chaiml-cai-v1-dpo-basev-43236-v5 ready after 90.45872497558594s
Pipeline stage MKMLDeployer completed in 707.21s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.2875635623931885s
Received healthy response to inference request in 2.9616756439208984s
Failed to get response for submission mistralai-mistral-nem_93303_v569: ('http://mistralai-mistral-nem-93303-v569-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Received healthy response to inference request in 2.9750001430511475s
Received healthy response to inference request in 2.962132215499878s
Received healthy response to inference request in 2.9566426277160645s
5 requests
0 failed requests
5th percentile: 2.9576492309570312
10th percentile: 2.958655834197998
20th percentile: 2.9606690406799316
30th percentile: 2.9617669582366943
40th percentile: 2.961949586868286
50th percentile: 2.962132215499878
60th percentile: 2.967279386520386
70th percentile: 2.9724265575408935
80th percentile: 3.037512826919556
90th percentile: 3.162538194656372
95th percentile: 3.22505087852478
99th percentile: 3.275061025619507
mean time: 3.0286028385162354
Pipeline stage StressChecker completed in 17.42s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.74s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.78s
Shutdown handler de-registered
chaiml-cai-v1-dpo-basev_43236_v5 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 5512.29s
Shutdown handler de-registered
chaiml-cai-v1-dpo-basev_43236_v5 status is now inactive due to auto deactivation removed underperforming models
chaiml-cai-v1-dpo-basev_43236_v5 status is now torndown due to DeploymentManager action