Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-1107-quang-ir-mi-74661-v1-mkmlizer
Waiting for job on chaiml-1107-quang-ir-mi-74661-v1-mkmlizer to finish
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ Version: 0.30.2 ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ belonging to: ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ║ ║
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: Downloaded to shared memory in 60.069s
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: Checking if ChaiML/1107-quang-ir-mixall-cld-i8a6-dpohrd-f5000-dpo_mult25k_0164641ep_2e6ct already exists in ChaiML
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpoodljxsm, device:0
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: quantized model in 42.466s
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: Processed model ChaiML/1107-quang-ir-mixall-cld-i8a6-dpohrd-f5000-dpo_mult25k_0164641ep_2e6ct in 102.536s
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-1107-quang-ir-mi-74661-v1/nvidia
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-1107-quang-ir-mi-74661-v1/nvidia/config.json
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-1107-quang-ir-mi-74661-v1/nvidia/special_tokens_map.json
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-1107-quang-ir-mi-74661-v1/nvidia/tokenizer_config.json
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-1107-quang-ir-mi-74661-v1/nvidia/tokenizer.json
Failed to get response for submission mistralai-mistral-nem_93303_v569: ('http://mistralai-mistral-nem-93303-v569-predictor.tenant-chaiml-guanaco.k2.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-1107-quang-ir-mi-74661-v1/nvidia/flywheel_model.1.safetensors
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-1107-quang-ir-mi-74661-v1/nvidia/flywheel_model.0.safetensors
chaiml-1107-quang-ir-mi-74661-v1-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:16, 21.67it/s]
Loading 0: 3%|▎ | 12/363 [00:00<00:08, 39.62it/s]
Loading 0: 5%|▍ | 17/363 [00:00<00:09, 35.59it/s]
Loading 0: 6%|▌ | 22/363 [00:00<00:09, 35.32it/s]
Loading 0: 7%|▋ | 26/363 [00:00<00:09, 34.70it/s]
Loading 0: 9%|▉ | 32/363 [00:00<00:08, 40.32it/s]
Loading 0: 10%|█ | 37/363 [00:01<00:13, 24.95it/s]
Loading 0: 11%|█▏ | 41/363 [00:01<00:13, 23.18it/s]
Loading 0: 13%|█▎ | 48/363 [00:01<00:10, 30.66it/s]
Loading 0: 14%|█▍ | 52/363 [00:01<00:10, 29.84it/s]
Loading 0: 16%|█▌ | 57/363 [00:01<00:09, 32.79it/s]
Loading 0: 17%|█▋ | 61/363 [00:01<00:09, 30.69it/s]
Loading 0: 18%|█▊ | 65/363 [00:02<00:09, 32.05it/s]
Loading 0: 19%|█▉ | 70/363 [00:02<00:10, 28.33it/s]
Loading 0: 20%|██ | 74/363 [00:02<00:11, 25.59it/s]
Loading 0: 22%|██▏ | 79/363 [00:02<00:09, 29.53it/s]
Loading 0: 23%|██▎ | 83/363 [00:02<00:09, 30.89it/s]
Loading 0: 24%|██▍ | 87/363 [00:02<00:10, 27.32it/s]
Loading 0: 25%|██▌ | 92/363 [00:03<00:10, 26.58it/s]
Loading 0: 27%|██▋ | 99/363 [00:03<00:07, 33.93it/s]
Loading 0: 28%|██▊ | 103/363 [00:03<00:08, 31.82it/s]
Loading 0: 29%|██▉ | 107/363 [00:03<00:09, 27.05it/s]
Loading 0: 31%|███ | 112/363 [00:03<00:08, 29.58it/s]
Loading 0: 32%|███▏ | 116/363 [00:03<00:08, 29.51it/s]
Loading 0: 33%|███▎ | 120/363 [00:03<00:07, 31.75it/s]
Loading 0: 34%|███▍ | 124/363 [00:04<00:07, 29.95it/s]
Loading 0: 36%|███▌ | 129/363 [00:04<00:06, 33.61it/s]
Loading 0: 37%|███▋ | 133/363 [00:04<00:07, 31.83it/s]
Loading 0: 38%|███▊ | 138/363 [00:04<00:06, 35.04it/s]
Loading 0: 39%|███▉ | 142/363 [00:04<00:06, 32.69it/s]
Loading 0: 40%|████ | 146/363 [00:04<00:07, 30.39it/s]
Loading 0: 41%|████▏ | 150/363 [00:04<00:08, 26.30it/s]
Loading 0: 42%|████▏ | 153/363 [00:05<00:09, 22.64it/s]
Loading 0: 44%|████▎ | 158/363 [00:05<00:08, 23.06it/s]
Loading 0: 45%|████▌ | 165/363 [00:05<00:06, 30.74it/s]
Loading 0: 47%|████▋ | 169/363 [00:05<00:06, 29.25it/s]
Loading 0: 48%|████▊ | 174/363 [00:05<00:05, 32.46it/s]
Loading 0: 49%|████▉ | 178/363 [00:05<00:06, 30.83it/s]
Loading 0: 50%|█████ | 182/363 [00:06<00:05, 32.01it/s]
Loading 0: 52%|█████▏ | 187/363 [00:06<00:06, 28.68it/s]
Loading 0: 53%|█████▎ | 191/363 [00:06<00:06, 28.16it/s]
Loading 0: 53%|█████▎ | 194/363 [00:06<00:07, 24.03it/s]
Loading 0: 55%|█████▌ | 201/363 [00:19<02:13, 1.21it/s]
Loading 0: 56%|█████▌ | 203/363 [00:19<01:54, 1.39it/s]
Loading 0: 57%|█████▋ | 208/363 [00:19<01:13, 2.10it/s]
Loading 0: 58%|█████▊ | 211/363 [00:20<00:57, 2.65it/s]
Loading 0: 59%|█████▉ | 214/363 [00:20<00:43, 3.42it/s]
Loading 0: 60%|██████ | 218/363 [00:20<00:30, 4.80it/s]
Loading 0: 61%|██████ | 221/363 [00:20<00:23, 6.08it/s]
Loading 0: 62%|██████▏ | 224/363 [00:20<00:19, 7.31it/s]
Loading 0: 63%|██████▎ | 229/363 [00:20<00:12, 10.65it/s]
Loading 0: 64%|██████▍ | 232/363 [00:20<00:10, 12.58it/s]
Loading 0: 65%|██████▌ | 237/363 [00:20<00:07, 17.19it/s]
Loading 0: 66%|██████▋ | 241/363 [00:21<00:06, 19.38it/s]
Loading 0: 68%|██████▊ | 246/363 [00:21<00:04, 24.11it/s]
Loading 0: 69%|██████▉ | 250/363 [00:21<00:04, 25.15it/s]
Loading 0: 70%|███████ | 255/363 [00:21<00:03, 29.60it/s]
Loading 0: 71%|███████▏ | 259/363 [00:21<00:03, 28.49it/s]
Loading 0: 73%|███████▎ | 266/363 [00:21<00:02, 35.50it/s]
Loading 0: 75%|███████▍ | 271/363 [00:22<00:03, 25.35it/s]
Loading 0: 76%|███████▌ | 275/363 [00:22<00:03, 23.77it/s]
Loading 0: 78%|███████▊ | 282/363 [00:22<00:02, 30.88it/s]
Loading 0: 79%|███████▉ | 286/363 [00:22<00:02, 30.12it/s]
Loading 0: 80%|████████ | 291/363 [00:22<00:02, 33.31it/s]
Loading 0: 81%|████████▏ | 295/363 [00:22<00:02, 32.17it/s]
Loading 0: 82%|████████▏ | 299/363 [00:22<00:01, 32.90it/s]
Loading 0: 84%|████████▎ | 304/363 [00:23<00:01, 29.76it/s]
Loading 0: 85%|████████▍ | 308/363 [00:23<00:01, 29.10it/s]
Loading 0: 86%|████████▌ | 312/363 [00:23<00:01, 26.17it/s]
Loading 0: 88%|████████▊ | 318/363 [00:23<00:01, 31.97it/s]
Loading 0: 89%|████████▊ | 322/363 [00:23<00:01, 30.92it/s]
Loading 0: 90%|█████████ | 327/363 [00:23<00:01, 34.31it/s]
Loading 0: 91%|█████████ | 331/363 [00:23<00:00, 32.30it/s]
Loading 0: 92%|█████████▏| 335/363 [00:24<00:00, 33.06it/s]
Loading 0: 93%|█████████▎| 339/363 [00:24<00:00, 33.83it/s]
Loading 0: 94%|█████████▍| 343/363 [00:24<00:00, 20.27it/s]
Loading 0: 96%|█████████▌| 348/363 [00:24<00:00, 21.81it/s]
Loading 0: 98%|█████████▊| 355/363 [00:24<00:00, 29.43it/s]
Loading 0: 99%|█████████▉| 359/363 [00:25<00:00, 29.18it/s]
Job chaiml-1107-quang-ir-mi-74661-v1-mkmlizer completed after 175.16s with status: succeeded
Stopping job with name chaiml-1107-quang-ir-mi-74661-v1-mkmlizer
Pipeline stage MKMLizer completed in 175.62s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-1107-quang-ir-mi-74661-v1
Waiting for inference service chaiml-1107-quang-ir-mi-74661-v1 to be ready
Inference service chaiml-1107-quang-ir-mi-74661-v1 ready after 160.67651534080505s
Pipeline stage MKMLDeployer completed in 161.09s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.665316104888916s
Received healthy response to inference request in 2.2473385334014893s
Received healthy response to inference request in 2.089768886566162s
Received healthy response to inference request in 2.0834527015686035s
5 requests
1 failed requests
5th percentile: 2.0847159385681153
10th percentile: 2.085979175567627
20th percentile: 2.0885056495666503
30th percentile: 2.1212828159332275
40th percentile: 2.1843106746673584
50th percentile: 2.2473385334014893
60th percentile: 2.41452956199646
70th percentile: 2.5817205905914307
80th percentile: 6.1560926437377965
90th percentile: 13.137645721435549
95th percentile: 16.62842226028442
99th percentile: 19.421043491363523
mean time: 5.841015005111695
%s, retrying in %s seconds...
Received healthy response to inference request in 2.1863977909088135s
Received healthy response to inference request in 2.2312510013580322s
Received healthy response to inference request in 2.168990135192871s
Received healthy response to inference request in 2.2979555130004883s
Received healthy response to inference request in 2.3046517372131348s
5 requests
0 failed requests
5th percentile: 2.1724716663360595
10th percentile: 2.175953197479248
20th percentile: 2.182916259765625
30th percentile: 2.195368432998657
40th percentile: 2.213309717178345
50th percentile: 2.2312510013580322
60th percentile: 2.2579328060150146
70th percentile: 2.284614610671997
80th percentile: 2.2992947578430174
90th percentile: 2.301973247528076
95th percentile: 2.3033124923706056
99th percentile: 2.304383888244629
mean time: 2.237849235534668
Pipeline stage StressChecker completed in 42.58s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.64s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 1.07s
Shutdown handler de-registered
chaiml-1107-quang-ir-mi_74661_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2971.13s
Shutdown handler de-registered
chaiml-1107-quang-ir-mi_74661_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-1107-quang-ir-mi_74661_v1 status is now torndown due to DeploymentManager action