Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name rica40325-fliter65kv1-v3-mkmlizer
Waiting for job on rica40325-fliter65kv1-v3-mkmlizer to finish
rica40325-fliter65kv1-v3-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
rica40325-fliter65kv1-v3-mkmlizer: ║ ║
rica40325-fliter65kv1-v3-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
rica40325-fliter65kv1-v3-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
rica40325-fliter65kv1-v3-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
rica40325-fliter65kv1-v3-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
rica40325-fliter65kv1-v3-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
rica40325-fliter65kv1-v3-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
rica40325-fliter65kv1-v3-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
rica40325-fliter65kv1-v3-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
rica40325-fliter65kv1-v3-mkmlizer: ║ ║
rica40325-fliter65kv1-v3-mkmlizer: ║ Version: 0.27.1+vampire_v3 ║
rica40325-fliter65kv1-v3-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
rica40325-fliter65kv1-v3-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
rica40325-fliter65kv1-v3-mkmlizer: ║ https://mk1.ai ║
rica40325-fliter65kv1-v3-mkmlizer: ║ ║
rica40325-fliter65kv1-v3-mkmlizer: ║ The license key for the current software has been verified as ║
rica40325-fliter65kv1-v3-mkmlizer: ║ belonging to: ║
rica40325-fliter65kv1-v3-mkmlizer: ║ ║
rica40325-fliter65kv1-v3-mkmlizer: ║ Chai Research Corp. ║
rica40325-fliter65kv1-v3-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
rica40325-fliter65kv1-v3-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
rica40325-fliter65kv1-v3-mkmlizer: ║ ║
rica40325-fliter65kv1-v3-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
rica40325-fliter65kv1-v3-mkmlizer: Downloaded to shared memory in 36.350s
rica40325-fliter65kv1-v3-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpyjox_tbs, device:0
rica40325-fliter65kv1-v3-mkmlizer: Saving flywheel model at /dev/shm/model_cache
rica40325-fliter65kv1-v3-mkmlizer: quantized model in 31.158s
rica40325-fliter65kv1-v3-mkmlizer: Processed model rica40325/fliter65kv1 in 67.509s
rica40325-fliter65kv1-v3-mkmlizer: creating bucket guanaco-mkml-models
rica40325-fliter65kv1-v3-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
rica40325-fliter65kv1-v3-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/rica40325-fliter65kv1-v3
rica40325-fliter65kv1-v3-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/rica40325-fliter65kv1-v3/config.json
rica40325-fliter65kv1-v3-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/rica40325-fliter65kv1-v3/special_tokens_map.json
rica40325-fliter65kv1-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/rica40325-fliter65kv1-v3/tokenizer_config.json
rica40325-fliter65kv1-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/rica40325-fliter65kv1-v3/tokenizer.json
rica40325-fliter65kv1-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/rica40325-fliter65kv1-v3/flywheel_model.0.safetensors
rica40325-fliter65kv1-v3-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:11, 29.94it/s]
Loading 0: 4%|▎ | 13/363 [00:00<00:07, 48.32it/s]
Loading 0: 5%|▌ | 19/363 [00:00<00:07, 44.32it/s]
Loading 0: 7%|▋ | 24/363 [00:00<00:07, 43.07it/s]
Loading 0: 9%|▊ | 31/363 [00:00<00:06, 49.03it/s]
Loading 0: 10%|█ | 37/363 [00:00<00:07, 44.92it/s]
Loading 0: 12%|█▏ | 42/363 [00:00<00:07, 42.68it/s]
Loading 0: 13%|█▎ | 48/363 [00:01<00:06, 46.00it/s]
Loading 0: 15%|█▍ | 53/363 [00:01<00:06, 44.56it/s]
Loading 0: 16%|█▌ | 58/363 [00:01<00:06, 45.79it/s]
Loading 0: 17%|█▋ | 63/363 [00:01<00:09, 30.64it/s]
Loading 0: 18%|█▊ | 67/363 [00:01<00:09, 32.53it/s]
Loading 0: 20%|█▉ | 72/363 [00:01<00:08, 35.08it/s]
Loading 0: 21%|██ | 76/363 [00:01<00:08, 35.01it/s]
Loading 0: 22%|██▏ | 81/363 [00:02<00:07, 37.12it/s]
Loading 0: 23%|██▎ | 85/363 [00:02<00:07, 36.41it/s]
Loading 0: 25%|██▍ | 89/363 [00:02<00:07, 36.86it/s]
Loading 0: 26%|██▌ | 93/363 [00:02<00:07, 35.59it/s]
Loading 0: 27%|██▋ | 98/363 [00:02<00:06, 38.84it/s]
Loading 0: 28%|██▊ | 102/363 [00:02<00:07, 36.89it/s]
Loading 0: 29%|██▉ | 106/363 [00:02<00:06, 37.67it/s]
Loading 0: 31%|███ | 112/363 [00:02<00:06, 40.99it/s]
Loading 0: 32%|███▏ | 117/363 [00:02<00:06, 38.92it/s]
Loading 0: 33%|███▎ | 121/363 [00:03<00:06, 38.76it/s]
Loading 0: 34%|███▍ | 125/363 [00:03<00:06, 39.03it/s]
Loading 0: 36%|███▌ | 129/363 [00:03<00:06, 37.08it/s]
Loading 0: 37%|███▋ | 134/363 [00:03<00:05, 39.51it/s]
Loading 0: 38%|███▊ | 138/363 [00:03<00:05, 37.74it/s]
Loading 0: 39%|███▉ | 142/363 [00:03<00:08, 25.14it/s]
Loading 0: 40%|████ | 146/363 [00:03<00:07, 27.40it/s]
Loading 0: 41%|████▏ | 150/363 [00:04<00:07, 27.92it/s]
Loading 0: 43%|████▎ | 156/363 [00:04<00:06, 33.54it/s]
Loading 0: 44%|████▍ | 160/363 [00:04<00:06, 32.95it/s]
Loading 0: 45%|████▌ | 165/363 [00:04<00:05, 36.68it/s]
Loading 0: 47%|████▋ | 169/363 [00:04<00:05, 35.86it/s]
Loading 0: 48%|████▊ | 174/363 [00:04<00:05, 37.73it/s]
Loading 0: 49%|████▉ | 178/363 [00:04<00:05, 35.66it/s]
Loading 0: 50%|█████ | 183/363 [00:04<00:04, 37.88it/s]
Loading 0: 52%|█████▏ | 187/363 [00:05<00:04, 35.63it/s]
Loading 0: 53%|█████▎ | 192/363 [00:05<00:04, 36.75it/s]
Loading 0: 54%|█████▍ | 196/363 [00:05<00:04, 34.86it/s]
Loading 0: 55%|█████▌ | 201/363 [00:05<00:04, 37.22it/s]
Loading 0: 56%|█████▋ | 205/363 [00:05<00:04, 35.47it/s]
Loading 0: 58%|█████▊ | 210/363 [00:05<00:04, 36.51it/s]
Loading 0: 59%|█████▉ | 214/363 [00:05<00:04, 35.09it/s]
Loading 0: 60%|██████ | 218/363 [00:05<00:04, 35.72it/s]
Loading 0: 61%|██████▏ | 223/363 [00:06<00:05, 26.98it/s]
Loading 0: 63%|██████▎ | 227/363 [00:06<00:04, 28.60it/s]
Loading 0: 64%|██████▎ | 231/363 [00:06<00:04, 28.64it/s]
Loading 0: 65%|██████▌ | 237/363 [00:06<00:03, 34.03it/s]
Loading 0: 66%|██████▋ | 241/363 [00:06<00:03, 33.18it/s]
Loading 0: 68%|██████▊ | 246/363 [00:06<00:03, 35.99it/s]
Loading 0: 69%|██████▉ | 250/363 [00:06<00:03, 35.00it/s]
Loading 0: 70%|██████▉ | 254/363 [00:07<00:03, 33.48it/s]
Loading 0: 71%|███████ | 258/363 [00:07<00:03, 29.76it/s]
Loading 0: 73%|███████▎ | 264/363 [00:07<00:02, 35.75it/s]
Loading 0: 74%|███████▍ | 268/363 [00:07<00:02, 35.31it/s]
Loading 0: 75%|███████▌ | 273/363 [00:07<00:02, 38.15it/s]
Loading 0: 76%|███████▋ | 277/363 [00:07<00:02, 36.28it/s]
Loading 0: 78%|███████▊ | 282/363 [00:07<00:02, 39.15it/s]
Loading 0: 79%|███████▉ | 287/363 [00:07<00:01, 39.43it/s]
Loading 0: 80%|████████ | 292/363 [00:08<00:01, 40.37it/s]
Loading 0: 82%|████████▏ | 297/363 [00:08<00:01, 41.96it/s]
Loading 0: 83%|████████▎ | 302/363 [00:08<00:01, 42.74it/s]
Loading 0: 85%|████████▍ | 307/363 [00:08<00:02, 19.95it/s]
Loading 0: 86%|████████▌ | 312/363 [00:08<00:02, 22.58it/s]
Loading 0: 88%|████████▊ | 320/363 [00:09<00:01, 31.50it/s]
Loading 0: 90%|████████▉ | 326/363 [00:09<00:01, 33.80it/s]
Loading 0: 91%|█████████ | 331/363 [00:09<00:00, 35.59it/s]
Loading 0: 93%|█████████▎| 338/363 [00:09<00:00, 41.79it/s]
Loading 0: 95%|█████████▍| 344/363 [00:09<00:00, 40.20it/s]
Loading 0: 96%|█████████▌| 349/363 [00:09<00:00, 39.49it/s]
Loading 0: 98%|█████████▊| 356/363 [00:09<00:00, 44.35it/s]
Loading 0: 100%|█████████▉| 362/363 [00:10<00:00, 43.36it/s]
Job rica40325-fliter65kv1-v3-mkmlizer completed after 95.09s with status: succeeded
Stopping job with name rica40325-fliter65kv1-v3-mkmlizer
Pipeline stage MKMLizer completed in 95.65s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service rica40325-fliter65kv1-v3
Waiting for inference service rica40325-fliter65kv1-v3 to be ready
Inference service rica40325-fliter65kv1-v3 ready after 110.47611856460571s
Pipeline stage MKMLDeployer completed in 111.06s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.3893771171569824s
Received healthy response to inference request in 1.6751854419708252s
Received healthy response to inference request in 1.7430613040924072s
Received healthy response to inference request in 1.5749402046203613s
Received healthy response to inference request in 2.0334417819976807s
5 requests
0 failed requests
5th percentile: 1.594989252090454
10th percentile: 1.6150382995605468
20th percentile: 1.6551363945007325
30th percentile: 1.6887606143951417
40th percentile: 1.7159109592437745
50th percentile: 1.7430613040924072
60th percentile: 1.8592134952545165
70th percentile: 1.975365686416626
80th percentile: 2.1046288490295413
90th percentile: 2.2470029830932616
95th percentile: 2.318190050125122
99th percentile: 2.37513970375061
mean time: 1.8832011699676514
Pipeline stage StressChecker completed in 10.83s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.73s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.75s
Shutdown handler de-registered
rica40325-fliter65kv1_v3 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4856.36s
Shutdown handler de-registered
rica40325-fliter65kv1_v3 status is now inactive due to auto deactivation removed underperforming models