Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer
Waiting for job on chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer to finish
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ Version: 0.29.15 ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ belonging to: ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ║ ║
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: Downloaded to shared memory in 77.761s
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: Checking if ChaiML/gy-exp193-sft-gy-datamix-v1-AR-6kchars-asstonly-plus-imend-loss-ep2 already exists in ChaiML
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp1liae31x, device:0
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: quantized model in 47.764s
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: Processed model ChaiML/gy-exp193-sft-gy-datamix-v1-AR-6kchars-asstonly-plus-imend-loss-ep2 in 125.525s
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-gy-exp193-sft-gy-90650-v1/nvidia
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-gy-exp193-sft-gy-90650-v1/nvidia/config.json
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-gy-exp193-sft-gy-90650-v1/nvidia/special_tokens_map.json
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-gy-exp193-sft-gy-90650-v1/nvidia/tokenizer_config.json
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-gy-exp193-sft-gy-90650-v1/nvidia/tokenizer.json
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-gy-exp193-sft-gy-90650-v1/nvidia/flywheel_model.1.safetensors
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-gy-exp193-sft-gy-90650-v1/nvidia/flywheel_model.0.safetensors
chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 4/363 [00:00<00:09, 39.60it/s]
Loading 0: 2%|▏ | 8/363 [00:00<00:11, 30.60it/s]
Loading 0: 3%|▎ | 12/363 [00:00<00:10, 33.37it/s]
Loading 0: 4%|▍ | 16/363 [00:00<00:11, 28.96it/s]
Loading 0: 6%|▌ | 21/363 [00:00<00:10, 33.56it/s]
Loading 0: 7%|▋ | 25/363 [00:00<00:11, 29.29it/s]
Loading 0: 9%|▉ | 32/363 [00:00<00:09, 35.51it/s]
Loading 0: 10%|▉ | 36/363 [00:01<00:15, 20.94it/s]
Loading 0: 11%|█ | 40/363 [00:01<00:13, 23.51it/s]
Loading 0: 12%|█▏ | 44/363 [00:01<00:13, 24.09it/s]
Loading 0: 13%|█▎ | 48/363 [00:01<00:12, 26.11it/s]
Loading 0: 14%|█▍ | 52/363 [00:01<00:12, 25.02it/s]
Loading 0: 16%|█▌ | 57/363 [00:02<00:10, 28.20it/s]
Loading 0: 17%|█▋ | 61/363 [00:02<00:11, 26.55it/s]
Loading 0: 18%|█▊ | 65/363 [00:02<00:10, 27.14it/s]
Loading 0: 19%|█▉ | 70/363 [00:02<00:12, 24.15it/s]
Loading 0: 20%|██ | 73/363 [00:02<00:14, 20.41it/s]
Loading 0: 22%|██▏ | 79/363 [00:03<00:11, 25.27it/s]
Loading 0: 23%|██▎ | 82/363 [00:03<00:11, 24.44it/s]
Loading 0: 24%|██▎ | 86/363 [00:03<00:10, 25.72it/s]
Loading 0: 25%|██▍ | 89/363 [00:03<00:10, 25.18it/s]
Loading 0: 25%|██▌ | 92/363 [00:03<00:13, 20.18it/s]
Loading 0: 27%|██▋ | 99/363 [00:03<00:09, 26.70it/s]
Loading 0: 28%|██▊ | 102/363 [00:03<00:10, 24.18it/s]
Loading 0: 29%|██▉ | 107/363 [00:04<00:11, 22.43it/s]
Loading 0: 31%|███ | 111/363 [00:04<00:09, 25.49it/s]
Loading 0: 31%|███▏ | 114/363 [00:04<00:10, 23.13it/s]
Loading 0: 33%|███▎ | 120/363 [00:04<00:08, 27.94it/s]
Loading 0: 34%|███▍ | 123/363 [00:04<00:09, 25.05it/s]
Loading 0: 36%|███▌ | 129/363 [00:04<00:07, 29.43it/s]
Loading 0: 37%|███▋ | 133/363 [00:05<00:08, 27.07it/s]
Loading 0: 38%|███▊ | 138/363 [00:05<00:07, 29.12it/s]
Loading 0: 39%|███▉ | 142/363 [00:05<00:08, 27.62it/s]
Loading 0: 41%|████ | 148/363 [00:05<00:06, 34.30it/s]
Loading 0: 42%|████▏ | 152/363 [00:05<00:08, 23.48it/s]
Loading 0: 43%|████▎ | 155/363 [00:05<00:08, 23.90it/s]
Loading 0: 44%|████▎ | 158/363 [00:06<00:10, 20.38it/s]
Loading 0: 45%|████▌ | 165/363 [00:06<00:07, 27.16it/s]
Loading 0: 47%|████▋ | 169/363 [00:06<00:07, 26.36it/s]
Loading 0: 48%|████▊ | 174/363 [00:06<00:06, 29.21it/s]
Loading 0: 49%|████▉ | 178/363 [00:06<00:06, 27.85it/s]
Loading 0: 50%|█████ | 182/363 [00:06<00:06, 27.87it/s]
Loading 0: 52%|█████▏ | 187/363 [00:07<00:06, 25.36it/s]
Loading 0: 52%|█████▏ | 190/363 [00:07<00:07, 23.01it/s]
Loading 0: 53%|█████▎ | 193/363 [00:07<00:07, 23.73it/s]
Loading 0: 54%|█████▍ | 196/363 [00:07<00:06, 24.02it/s]
Loading 0: 55%|█████▌ | 200/363 [00:22<00:06, 24.02it/s]
Loading 0: 55%|█████▌ | 201/363 [00:22<03:01, 1.12s/it]
Loading 0: 56%|█████▌ | 203/363 [00:22<02:30, 1.07it/s]
Loading 0: 57%|█████▋ | 208/363 [00:22<01:30, 1.72it/s]
Loading 0: 58%|█████▊ | 211/363 [00:22<01:09, 2.18it/s]
Loading 0: 59%|█████▉ | 214/363 [00:22<00:51, 2.87it/s]
Loading 0: 60%|██████ | 218/363 [00:22<00:35, 4.10it/s]
Loading 0: 61%|██████ | 221/363 [00:23<00:27, 5.17it/s]
Loading 0: 62%|██████▏ | 224/363 [00:23<00:22, 6.19it/s]
Loading 0: 63%|██████▎ | 228/363 [00:23<00:15, 8.67it/s]
Loading 0: 64%|██████▎ | 231/363 [00:23<00:13, 10.02it/s]
Loading 0: 65%|██████▌ | 237/363 [00:23<00:08, 14.87it/s]
Loading 0: 66%|██████▌ | 240/363 [00:23<00:07, 15.49it/s]
Loading 0: 68%|██████▊ | 246/363 [00:24<00:05, 20.72it/s]
Loading 0: 69%|██████▉ | 250/363 [00:24<00:05, 21.38it/s]
Loading 0: 70%|███████ | 255/363 [00:24<00:04, 24.76it/s]
Loading 0: 71%|███████▏ | 259/363 [00:24<00:04, 24.38it/s]
Loading 0: 73%|███████▎ | 264/363 [00:24<00:03, 29.21it/s]
Loading 0: 74%|███████▍ | 268/363 [00:24<00:04, 22.85it/s]
Loading 0: 75%|███████▍ | 271/363 [00:25<00:04, 21.42it/s]
Loading 0: 75%|███████▌ | 274/363 [00:25<00:03, 22.58it/s]
Loading 0: 76%|███████▋ | 277/363 [00:25<00:03, 23.39it/s]
Loading 0: 78%|███████▊ | 282/363 [00:25<00:02, 27.06it/s]
Loading 0: 79%|███████▊ | 285/363 [00:25<00:03, 24.36it/s]
Loading 0: 80%|███████▉ | 289/363 [00:25<00:02, 27.61it/s]
Loading 0: 80%|████████ | 292/363 [00:25<00:02, 27.28it/s]
Loading 0: 81%|████████▏ | 295/363 [00:25<00:02, 25.35it/s]
Loading 0: 82%|████████▏ | 299/363 [00:26<00:02, 26.18it/s]
Loading 0: 84%|████████▎ | 304/363 [00:26<00:02, 24.13it/s]
Loading 0: 85%|████████▍ | 307/363 [00:26<00:02, 22.09it/s]
Loading 0: 85%|████████▌ | 310/363 [00:26<00:02, 23.26it/s]
Loading 0: 86%|████████▌ | 313/363 [00:26<00:02, 23.90it/s]
Loading 0: 88%|████████▊ | 318/363 [00:26<00:01, 27.27it/s]
Loading 0: 88%|████████▊ | 321/363 [00:27<00:01, 24.67it/s]
Loading 0: 90%|█████████ | 327/363 [00:27<00:01, 30.03it/s]
Loading 0: 91%|█████████ | 331/363 [00:27<00:01, 28.35it/s]
Loading 0: 92%|█████████▏| 335/363 [00:27<00:00, 28.76it/s]
Loading 0: 93%|█████████▎| 338/363 [00:27<00:00, 27.00it/s]
Loading 0: 94%|█████████▍| 341/363 [00:27<00:01, 15.75it/s]
Loading 0: 96%|█████████▌| 347/363 [00:28<00:00, 21.52it/s]
Loading 0: 96%|█████████▋| 350/363 [00:28<00:00, 22.16it/s]
Loading 0: 98%|█████████▊| 355/363 [00:28<00:00, 25.64it/s]
Loading 0: 99%|█████████▊| 358/363 [00:28<00:00, 23.44it/s]
Job chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer completed after 158.36s with status: succeeded
Stopping job with name chaiml-gy-exp193-sft-gy-90650-v1-mkmlizer
Pipeline stage MKMLizer completed in 158.93s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-gy-exp193-sft-gy-90650-v1
Waiting for inference service chaiml-gy-exp193-sft-gy-90650-v1 to be ready
Failed to get response for submission chaiml-nis-qwen32b-sim_98336_v34: HTTPConnectionPool(host='chaiml-nis-qwen32b-sim-98336-v34-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-nis-qwen32b-sim_98336_v34: HTTPConnectionPool(host='chaiml-nis-qwen32b-sim-98336-v34-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-nis-qwen32b-sim_98336_v34: HTTPConnectionPool(host='chaiml-nis-qwen32b-sim-98336-v34-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Inference service chaiml-gy-exp193-sft-gy-90650-v1 ready after 331.48401069641113s
Pipeline stage MKMLDeployer completed in 331.95s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.7891082763671875s
Received healthy response to inference request in 1.8141698837280273s
Received healthy response to inference request in 1.9035897254943848s
Received healthy response to inference request in 1.8488621711730957s
5 requests
1 failed requests
5th percentile: 1.821108341217041
10th percentile: 1.8280467987060547
20th percentile: 1.841923713684082
30th percentile: 1.8598076820373535
40th percentile: 1.8816987037658692
50th percentile: 1.9035897254943848
60th percentile: 2.2577971458435058
70th percentile: 2.6120045661926268
80th percentile: 6.255879211425784
90th percentile: 13.18942108154297
95th percentile: 16.65619201660156
99th percentile: 19.429608764648435
mean time: 5.69573860168457
%s, retrying in %s seconds...
Received healthy response to inference request in 2.6431491374969482s
Received healthy response to inference request in 2.356041669845581s
Received healthy response to inference request in 2.1967568397521973s
Received healthy response to inference request in 1.916513442993164s
Received healthy response to inference request in 1.8919761180877686s
5 requests
0 failed requests
5th percentile: 1.8968835830688477
10th percentile: 1.9017910480499267
20th percentile: 1.9116059780120849
30th percentile: 1.9725621223449707
40th percentile: 2.084659481048584
50th percentile: 2.1967568397521973
60th percentile: 2.2604707717895507
70th percentile: 2.324184703826904
80th percentile: 2.4134631633758548
90th percentile: 2.5283061504364013
95th percentile: 2.5857276439666745
99th percentile: 2.6316648387908934
mean time: 2.200887441635132
Pipeline stage StressChecker completed in 41.81s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.70s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
Failed to get response for submission chaiml-nis-qwen32b-sim_98336_v34: HTTPConnectionPool(host='chaiml-nis-qwen32b-sim-98336-v34-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.71s
Shutdown handler de-registered
chaiml-gy-exp193-sft-gy_90650_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError('None: None')
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 5146.05s
Shutdown handler de-registered
chaiml-gy-exp193-sft-gy_90650_v1 status is now inactive due to auto deactivation removed underperforming models
chaiml-gy-exp193-sft-gy_90650_v1 status is now torndown due to DeploymentManager action