Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name rirv938-prefgrok-cp312-76270-v1-mkmlizer
Waiting for job on rirv938-prefgrok-cp312-76270-v1-mkmlizer to finish
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ Version: 0.29.15 ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ https://mk1.ai ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ The license key for the current software has been verified as ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ belonging to: ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ Chai Research Corp. ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ║ ║
rirv938-prefgrok-cp312-76270-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
rirv938-prefgrok-cp312-76270-v1-mkmlizer: Downloaded to shared memory in 160.169s
rirv938-prefgrok-cp312-76270-v1-mkmlizer: Checking if rirv938/prefgrok_cp312_98ff_b35_r1_reformat_merged already exists in ChaiML
rirv938-prefgrok-cp312-76270-v1-mkmlizer: Creating repo ChaiML/prefgrok_cp312_98ff_b35_r1_reformat_merged and uploading /tmp/tmpenykn2xp to it
rirv938-prefgrok-cp312-76270-v1-mkmlizer:
0%| | 0/22 [00:00<?, ?it/s]
5%|▍ | 1/22 [00:06<02:18, 6.60s/it]
9%|▉ | 2/22 [00:14<02:31, 7.57s/it]
14%|█▎ | 3/22 [00:22<02:28, 7.81s/it]
18%|█▊ | 4/22 [00:27<01:59, 6.61s/it]
23%|██▎ | 5/22 [00:32<01:40, 5.91s/it]
27%|██▋ | 6/22 [00:40<01:47, 6.73s/it]
32%|███▏ | 7/22 [00:47<01:41, 6.77s/it]
36%|███▋ | 8/22 [00:55<01:40, 7.20s/it]
41%|████ | 9/22 [01:00<01:22, 6.33s/it]
45%|████▌ | 10/22 [01:08<01:23, 6.93s/it]
50%|█████ | 11/22 [01:16<01:19, 7.26s/it]
55%|█████▍ | 12/22 [01:20<01:02, 6.27s/it]
59%|█████▉ | 13/22 [01:38<01:29, 9.93s/it]
64%|██████▎ | 14/22 [01:42<01:03, 7.99s/it]
68%|██████▊ | 15/22 [01:50<00:55, 7.95s/it]
73%|███████▎ | 16/22 [01:53<00:40, 6.72s/it]
77%|███████▋ | 17/22 [02:02<00:35, 7.17s/it]
82%|████████▏ | 18/22 [02:09<00:28, 7.11s/it]
86%|████████▋ | 19/22 [02:12<00:18, 6.04s/it]
91%|█████████ | 20/22 [02:19<00:12, 6.27s/it]
95%|█████████▌| 21/22 [02:25<00:06, 6.12s/it]
100%|██████████| 22/22 [02:26<00:00, 4.68s/it]
100%|██████████| 22/22 [02:26<00:00, 6.66s/it]
rirv938-prefgrok-cp312-76270-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpenykn2xp, device:0
rirv938-prefgrok-cp312-76270-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
rirv938-prefgrok-cp312-76270-v1-mkmlizer: quantized model in 58.723s
rirv938-prefgrok-cp312-76270-v1-mkmlizer: Processed model rirv938/prefgrok_cp312_98ff_b35_r1_reformat_merged in 458.825s
rirv938-prefgrok-cp312-76270-v1-mkmlizer: creating bucket guanaco-mkml-models
rirv938-prefgrok-cp312-76270-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
rirv938-prefgrok-cp312-76270-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/rirv938-prefgrok-cp312-76270-v1/nvidia
rirv938-prefgrok-cp312-76270-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/rirv938-prefgrok-cp312-76270-v1/nvidia/config.json
rirv938-prefgrok-cp312-76270-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/rirv938-prefgrok-cp312-76270-v1/nvidia/special_tokens_map.json
rirv938-prefgrok-cp312-76270-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/rirv938-prefgrok-cp312-76270-v1/nvidia/tokenizer_config.json
rirv938-prefgrok-cp312-76270-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/rirv938-prefgrok-cp312-76270-v1/nvidia/tokenizer.json
rirv938-prefgrok-cp312-76270-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/rirv938-prefgrok-cp312-76270-v1/nvidia/flywheel_model.1.safetensors
rirv938-prefgrok-cp312-76270-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/rirv938-prefgrok-cp312-76270-v1/nvidia/flywheel_model.0.safetensors
rirv938-prefgrok-cp312-76270-v1-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 3/363 [00:00<00:12, 28.90it/s]
Loading 0: 2%|▏ | 6/363 [00:00<00:25, 14.04it/s]
Loading 0: 3%|▎ | 10/363 [00:00<00:16, 21.14it/s]
Loading 0: 4%|▎ | 13/363 [00:00<00:25, 13.48it/s]
Loading 0: 4%|▍ | 15/363 [00:01<00:31, 11.19it/s]
Loading 0: 6%|▌ | 21/363 [00:01<00:21, 16.14it/s]
Loading 0: 6%|▋ | 23/363 [00:01<00:26, 13.05it/s]
Loading 0: 8%|▊ | 28/363 [00:01<00:18, 18.49it/s]
Loading 0: 9%|▉ | 32/363 [00:01<00:15, 21.80it/s]
Loading 0: 10%|▉ | 35/363 [00:02<00:22, 14.76it/s]
Loading 0: 10%|█ | 38/363 [00:02<00:21, 15.29it/s]
Loading 0: 11%|█▏ | 41/363 [00:02<00:26, 12.12it/s]
Loading 0: 13%|█▎ | 46/363 [00:02<00:18, 16.95it/s]
Loading 0: 14%|█▍ | 50/363 [00:03<00:15, 20.08it/s]
Loading 0: 15%|█▍ | 53/363 [00:03<00:21, 14.38it/s]
Loading 0: 15%|█▌ | 56/363 [00:03<00:20, 15.19it/s]
Loading 0: 16%|█▋ | 59/363 [00:03<00:25, 12.13it/s]
Loading 0: 18%|█▊ | 64/363 [00:04<00:17, 16.82it/s]
Loading 0: 19%|█▊ | 68/363 [00:04<00:14, 19.98it/s]
Loading 0: 20%|█▉ | 71/363 [00:04<00:19, 14.65it/s]
Loading 0: 20%|██ | 74/363 [00:04<00:19, 15.21it/s]
Loading 0: 21%|██ | 77/363 [00:05<00:23, 12.20it/s]
Loading 0: 23%|██▎ | 82/363 [00:05<00:16, 17.12it/s]
Loading 0: 24%|██▎ | 86/363 [00:05<00:13, 20.43it/s]
Loading 0: 25%|██▍ | 89/363 [00:05<00:18, 14.76it/s]
Loading 0: 25%|██▌ | 92/363 [00:05<00:17, 15.25it/s]
Loading 0: 26%|██▋ | 96/363 [00:06<00:14, 18.97it/s]
Loading 0: 27%|██▋ | 99/363 [00:06<00:12, 20.88it/s]
Loading 0: 28%|██▊ | 102/363 [00:06<00:13, 19.04it/s]
Loading 0: 29%|██▉ | 105/363 [00:06<00:17, 14.40it/s]
Loading 0: 29%|██▉ | 107/363 [00:06<00:19, 13.10it/s]
Loading 0: 30%|███ | 109/363 [00:07<00:19, 12.75it/s]
Loading 0: 31%|███ | 111/363 [00:07<00:18, 13.55it/s]
Loading 0: 31%|███ | 113/363 [00:07<00:22, 11.11it/s]
Loading 0: 33%|███▎ | 118/363 [00:07<00:14, 17.48it/s]
Loading 0: 34%|███▎ | 122/363 [00:07<00:11, 21.40it/s]
Loading 0: 34%|███▍ | 125/363 [00:08<00:16, 14.59it/s]
Loading 0: 35%|███▌ | 128/363 [00:08<00:15, 15.52it/s]
Loading 0: 36%|███▌ | 131/363 [00:08<00:18, 12.34it/s]
Loading 0: 37%|███▋ | 136/363 [00:08<00:13, 17.31it/s]
Loading 0: 39%|███▊ | 140/363 [00:08<00:10, 20.54it/s]
Loading 0: 39%|███▉ | 143/363 [00:09<00:14, 14.91it/s]
Loading 0: 40%|████ | 146/363 [00:09<00:13, 15.71it/s]
Loading 0: 41%|████ | 149/363 [00:09<00:16, 12.71it/s]
Loading 0: 42%|████▏ | 154/363 [00:09<00:11, 17.66it/s]
Loading 0: 44%|████▎ | 158/363 [00:09<00:09, 20.84it/s]
Loading 0: 44%|████▍ | 161/363 [00:10<00:13, 15.16it/s]
Loading 0: 45%|████▌ | 164/363 [00:10<00:12, 15.87it/s]
Loading 0: 46%|████▌ | 167/363 [00:10<00:15, 12.42it/s]
Loading 0: 47%|████▋ | 172/363 [00:10<00:11, 17.23it/s]
Loading 0: 48%|████▊ | 176/363 [00:11<00:09, 20.28it/s]
Loading 0: 49%|████▉ | 179/363 [00:11<00:12, 14.82it/s]
Loading 0: 50%|█████ | 182/363 [00:11<00:11, 15.50it/s]
Loading 0: 51%|█████ | 185/363 [00:11<00:14, 12.38it/s]
Loading 0: 52%|█████▏ | 190/363 [00:12<00:10, 17.10it/s]
Loading 0: 53%|█████▎ | 194/363 [00:12<00:08, 19.71it/s]
Loading 0: 54%|█████▍ | 197/363 [00:12<00:11, 14.23it/s]
Loading 0: 55%|█████▌ | 200/363 [00:12<00:10, 15.04it/s]
Loading 0: 56%|█████▌ | 202/363 [00:28<04:15, 1.59s/it]
Loading 0: 56%|█████▌ | 203/363 [00:28<03:45, 1.41s/it]
Loading 0: 57%|█████▋ | 207/363 [00:28<02:09, 1.20it/s]
Loading 0: 58%|█████▊ | 210/363 [00:28<01:30, 1.70it/s]
Loading 0: 59%|█████▊ | 213/363 [00:28<01:05, 2.28it/s]
Loading 0: 59%|█████▉ | 215/363 [00:28<00:53, 2.77it/s]
Loading 0: 60%|█████▉ | 217/363 [00:29<00:42, 3.42it/s]
Loading 0: 60%|██████ | 219/363 [00:29<00:33, 4.33it/s]
Loading 0: 61%|██████ | 221/363 [00:29<00:28, 4.97it/s]
Loading 0: 62%|██████▏ | 226/363 [00:29<00:15, 8.82it/s]
Loading 0: 63%|██████▎ | 230/363 [00:29<00:10, 12.15it/s]
Loading 0: 64%|██████▍ | 233/363 [00:30<00:11, 10.99it/s]
Loading 0: 65%|██████▌ | 236/363 [00:30<00:10, 12.47it/s]
Loading 0: 66%|██████▌ | 239/363 [00:30<00:11, 10.93it/s]
Loading 0: 67%|██████▋ | 244/363 [00:30<00:07, 15.61it/s]
Loading 0: 68%|██████▊ | 248/363 [00:30<00:06, 18.76it/s]
Loading 0: 69%|██████▉ | 251/363 [00:31<00:08, 13.88it/s]
Loading 0: 70%|██████▉ | 254/363 [00:31<00:07, 14.80it/s]
Loading 0: 71%|███████ | 257/363 [00:31<00:08, 11.98it/s]
Loading 0: 72%|███████▏ | 262/363 [00:31<00:06, 16.59it/s]
Loading 0: 73%|███████▎ | 266/363 [00:31<00:04, 19.72it/s]
Loading 0: 74%|███████▍ | 269/363 [00:32<00:06, 14.36it/s]
Loading 0: 75%|███████▍ | 272/363 [00:32<00:06, 15.07it/s]
Loading 0: 75%|███████▌ | 274/363 [00:32<00:06, 13.58it/s]
Loading 0: 76%|███████▌ | 276/363 [00:32<00:06, 13.12it/s]
Loading 0: 77%|███████▋ | 280/363 [00:32<00:04, 17.17it/s]
Loading 0: 78%|███████▊ | 284/363 [00:33<00:03, 20.68it/s]
Loading 0: 79%|███████▉ | 287/363 [00:33<00:05, 14.48it/s]
Loading 0: 80%|███████▉ | 289/363 [00:33<00:05, 13.63it/s]
Loading 0: 80%|████████ | 291/363 [00:33<00:04, 14.58it/s]
Loading 0: 81%|████████ | 293/363 [00:34<00:05, 11.94it/s]
Loading 0: 82%|████████▏ | 298/363 [00:34<00:03, 17.86it/s]
Loading 0: 83%|████████▎ | 302/363 [00:34<00:02, 21.39it/s]
Loading 0: 84%|████████▍ | 305/363 [00:34<00:03, 15.43it/s]
Loading 0: 85%|████████▍ | 308/363 [00:34<00:03, 16.13it/s]
Loading 0: 86%|████████▌ | 311/363 [00:35<00:04, 12.45it/s]
Loading 0: 87%|████████▋ | 316/363 [00:35<00:02, 17.50it/s]
Loading 0: 88%|████████▊ | 320/363 [00:35<00:02, 20.86it/s]
Loading 0: 89%|████████▉ | 323/363 [00:35<00:02, 15.10it/s]
Loading 0: 90%|████████▉ | 326/363 [00:35<00:02, 15.81it/s]
Loading 0: 91%|█████████ | 329/363 [00:36<00:02, 12.52it/s]
Loading 0: 92%|█████████▏| 334/363 [00:36<00:01, 17.11it/s]
Loading 0: 93%|█████████▎| 338/363 [00:36<00:01, 20.50it/s]
Loading 0: 94%|█████████▍| 341/363 [00:36<00:01, 15.18it/s]
Loading 0: 95%|█████████▍| 344/363 [00:37<00:01, 15.90it/s]
Loading 0: 96%|█████████▌| 347/363 [00:37<00:01, 12.52it/s]
Loading 0: 97%|█████████▋| 352/363 [00:37<00:00, 17.31it/s]
Loading 0: 98%|█████████▊| 356/363 [00:37<00:00, 20.17it/s]
Loading 0: 99%|█████████▉| 359/363 [00:38<00:00, 11.12it/s]
Loading 0: 100%|█████████▉| 362/363 [00:38<00:00, 11.20it/s]
Job rirv938-prefgrok-cp312-76270-v1-mkmlizer completed after 495.84s with status: succeeded
Stopping job with name rirv938-prefgrok-cp312-76270-v1-mkmlizer
Pipeline stage MKMLizer completed in 496.36s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service rirv938-prefgrok-cp312-76270-v1
Waiting for inference service rirv938-prefgrok-cp312-76270-v1 to be ready
Inference service rirv938-prefgrok-cp312-76270-v1 ready after 191.07095789909363s
Pipeline stage MKMLDeployer completed in 191.51s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.453672409057617s
Received healthy response to inference request in 2.1246337890625s
Received healthy response to inference request in 2.2482149600982666s
Received healthy response to inference request in 2.02565860748291s
5 requests
1 failed requests
5th percentile: 2.045453643798828
10th percentile: 2.0652486801147463
20th percentile: 2.104838752746582
30th percentile: 2.1493500232696534
40th percentile: 2.19878249168396
50th percentile: 2.2482149600982666
60th percentile: 2.330397939682007
70th percentile: 2.4125809192657472
80th percentile: 5.992031717300418
90th percentile: 13.068750333786014
95th percentile: 16.607109642028806
99th percentile: 19.437797088623046
mean time: 5.79952974319458
%s, retrying in %s seconds...
Received healthy response to inference request in 2.0068037509918213s
Received healthy response to inference request in 2.0887176990509033s
Received healthy response to inference request in 2.244619131088257s
Received healthy response to inference request in 1.8819801807403564s
Received healthy response to inference request in 2.2253355979919434s
5 requests
0 failed requests
5th percentile: 1.9069448947906493
10th percentile: 1.9319096088409424
20th percentile: 1.9818390369415284
30th percentile: 2.023186540603638
40th percentile: 2.0559521198272703
50th percentile: 2.0887176990509033
60th percentile: 2.1433648586273195
70th percentile: 2.1980120182037353
80th percentile: 2.229192304611206
90th percentile: 2.2369057178497314
95th percentile: 2.2407624244689943
99th percentile: 2.2438477897644042
mean time: 2.089491271972656
Pipeline stage StressChecker completed in 42.14s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.71s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.67s
Shutdown handler de-registered
rirv938-prefgrok-cp312-_76270_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3162.22s
Shutdown handler de-registered
rirv938-prefgrok-cp312-_76270_v1 status is now inactive due to auto deactivation removed underperforming models
rirv938-prefgrok-cp312-_76270_v1 status is now torndown due to DeploymentManager action