Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name rirv938-96p-4ff-rirv938-62112-v1-mkmlizer
Waiting for job on rirv938-96p-4ff-rirv938-62112-v1-mkmlizer to finish
Failed to get response for submission chaiml-sft-gemma2-28b-v_83370_v5: HTTPConnectionPool(host='chaiml-sft-gemma2-28b-v-83370-v5-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ _____ __ __ ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ /___/ ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ Version: 0.12.8 ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ https://mk1.ai ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ The license key for the current software has been verified as ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ belonging to: ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ Chai Research Corp. ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ Expiration: 2025-04-15 23:59:59 ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ║ ║
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
Failed to get response for submission chaiml-sft-gemma2-28b-v_83370_v5: HTTPConnectionPool(host='chaiml-sft-gemma2-28b-v-83370-v5-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-sft-gemma2-28b-v_83370_v5: HTTPConnectionPool(host='chaiml-sft-gemma2-28b-v-83370-v5-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: Downloaded to shared memory in 185.352s
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpsknk8ixc, device:0
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: quantized model in 64.824s
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: Processed model rirv938/96p_4ff_rirv938_20k_100p_0ff_ri_19485_v1_cp375_v3 in 250.177s
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: creating bucket guanaco-mkml-models
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/rirv938-96p-4ff-rirv938-62112-v1
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/rirv938-96p-4ff-rirv938-62112-v1/special_tokens_map.json
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/rirv938-96p-4ff-rirv938-62112-v1/config.json
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/rirv938-96p-4ff-rirv938-62112-v1/tokenizer_config.json
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/rirv938-96p-4ff-rirv938-62112-v1/tokenizer.json
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/rirv938-96p-4ff-rirv938-62112-v1/flywheel_model.0.safetensors
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/rirv938-96p-4ff-rirv938-62112-v1/flywheel_model.1.safetensors
rirv938-96p-4ff-rirv938-62112-v1-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 3/363 [00:00<00:12, 29.51it/s]
Loading 0: 2%|▏ | 6/363 [00:00<00:25, 13.93it/s]
Loading 0: 3%|▎ | 11/363 [00:00<00:16, 21.23it/s]
Loading 0: 4%|▍ | 14/363 [00:00<00:25, 13.76it/s]
Loading 0: 4%|▍ | 16/363 [00:01<00:26, 13.12it/s]
Loading 0: 5%|▌ | 19/363 [00:01<00:21, 16.07it/s]
Loading 0: 6%|▌ | 22/363 [00:01<00:22, 15.20it/s]
Loading 0: 7%|▋ | 24/363 [00:01<00:24, 13.97it/s]
Loading 0: 8%|▊ | 28/363 [00:01<00:18, 18.61it/s]
Loading 0: 9%|▉ | 32/363 [00:01<00:14, 22.85it/s]
Loading 0: 10%|▉ | 35/363 [00:02<00:21, 15.09it/s]
Loading 0: 10%|█ | 38/363 [00:02<00:20, 15.77it/s]
Loading 0: 11%|█▏ | 41/363 [00:02<00:26, 12.17it/s]
Loading 0: 13%|█▎ | 46/363 [00:02<00:18, 17.24it/s]
Loading 0: 14%|█▍ | 50/363 [00:02<00:15, 20.79it/s]
Loading 0: 15%|█▍ | 53/363 [00:03<00:20, 14.82it/s]
Loading 0: 15%|█▌ | 56/363 [00:03<00:20, 14.89it/s]
Loading 0: 16%|█▋ | 59/363 [00:03<00:26, 11.65it/s]
Loading 0: 18%|█▊ | 64/363 [00:04<00:18, 16.26it/s]
Loading 0: 19%|█▊ | 68/363 [00:04<00:15, 19.45it/s]
Loading 0: 20%|█▉ | 71/363 [00:04<00:20, 14.44it/s]
Loading 0: 20%|██ | 74/363 [00:04<00:19, 14.85it/s]
Loading 0: 21%|██ | 76/363 [00:04<00:21, 13.32it/s]
Loading 0: 21%|██▏ | 78/363 [00:05<00:22, 12.63it/s]
Loading 0: 23%|██▎ | 82/363 [00:05<00:16, 16.66it/s]
Loading 0: 24%|██▎ | 86/363 [00:05<00:13, 20.30it/s]
Loading 0: 25%|██▍ | 89/363 [00:05<00:19, 14.30it/s]
Loading 0: 25%|██▌ | 91/363 [00:05<00:19, 13.65it/s]
Loading 0: 26%|██▌ | 95/363 [00:06<00:15, 17.64it/s]
Loading 0: 27%|██▋ | 99/363 [00:06<00:12, 21.25it/s]
Loading 0: 28%|██▊ | 102/363 [00:06<00:13, 19.15it/s]
Loading 0: 29%|██▉ | 105/363 [00:06<00:18, 14.26it/s]
Loading 0: 29%|██▉ | 107/363 [00:06<00:19, 13.12it/s]
Loading 0: 30%|███ | 109/363 [00:07<00:20, 12.48it/s]
Loading 0: 31%|███ | 111/363 [00:07<00:18, 13.42it/s]
Loading 0: 31%|███ | 113/363 [00:07<00:22, 10.88it/s]
Loading 0: 33%|███▎ | 118/363 [00:07<00:14, 16.86it/s]
Loading 0: 34%|███▎ | 122/363 [00:07<00:11, 20.67it/s]
Loading 0: 34%|███▍ | 125/363 [00:08<00:16, 14.42it/s]
Loading 0: 35%|███▌ | 128/363 [00:08<00:15, 15.18it/s]
Loading 0: 36%|███▌ | 130/363 [00:08<00:17, 13.39it/s]
Loading 0: 36%|███▋ | 132/363 [00:08<00:17, 12.93it/s]
Loading 0: 37%|███▋ | 136/363 [00:08<00:13, 17.35it/s]
Loading 0: 39%|███▊ | 140/363 [00:08<00:10, 21.41it/s]
Loading 0: 39%|███▉ | 143/363 [00:09<00:14, 14.96it/s]
Loading 0: 40%|████ | 146/363 [00:09<00:13, 15.86it/s]
Loading 0: 41%|████ | 148/363 [00:09<00:15, 13.81it/s]
Loading 0: 41%|████▏ | 150/363 [00:09<00:16, 13.21it/s]
Loading 0: 42%|████▏ | 154/363 [00:09<00:12, 17.41it/s]
Loading 0: 44%|████▎ | 158/363 [00:10<00:09, 20.98it/s]
Loading 0: 44%|████▍ | 161/363 [00:10<00:13, 14.84it/s]
Loading 0: 45%|████▍ | 163/363 [00:10<00:14, 13.92it/s]
Loading 0: 45%|████▌ | 165/363 [00:10<00:13, 14.66it/s]
Loading 0: 46%|████▌ | 167/363 [00:10<00:17, 11.45it/s]
Loading 0: 47%|████▋ | 172/363 [00:11<00:10, 17.41it/s]
Loading 0: 48%|████▊ | 176/363 [00:11<00:08, 21.15it/s]
Loading 0: 49%|████▉ | 179/363 [00:11<00:12, 14.93it/s]
Loading 0: 50%|█████ | 182/363 [00:11<00:11, 15.51it/s]
Loading 0: 51%|█████ | 184/363 [00:11<00:13, 13.65it/s]
Loading 0: 51%|█████ | 186/363 [00:12<00:13, 13.14it/s]
Loading 0: 52%|█████▏ | 190/363 [00:12<00:10, 17.28it/s]
Loading 0: 53%|█████▎ | 194/363 [00:12<00:08, 21.09it/s]
Loading 0: 54%|█████▍ | 197/363 [00:12<00:11, 14.83it/s]
Loading 0: 55%|█████▍ | 199/363 [00:12<00:11, 13.82it/s]
Loading 0: 55%|█████▌ | 200/363 [00:23<00:11, 13.82it/s]
Loading 0: 55%|█████▌ | 201/363 [00:27<04:49, 1.79s/it]
Loading 0: 56%|█████▌ | 203/363 [00:28<03:40, 1.38s/it]
Loading 0: 57%|█████▋ | 207/363 [00:28<02:07, 1.22it/s]
Loading 0: 58%|█████▊ | 210/363 [00:28<01:29, 1.72it/s]
Loading 0: 59%|█████▊ | 213/363 [00:28<01:05, 2.30it/s]
Loading 0: 59%|█████▉ | 215/363 [00:28<00:53, 2.78it/s]
Loading 0: 60%|█████▉ | 217/363 [00:29<00:42, 3.40it/s]
Loading 0: 60%|██████ | 219/363 [00:29<00:33, 4.28it/s]
Loading 0: 61%|██████ | 221/363 [00:29<00:29, 4.83it/s]
Loading 0: 62%|██████▏ | 226/363 [00:29<00:16, 8.55it/s]
Loading 0: 63%|██████▎ | 230/363 [00:29<00:11, 11.69it/s]
Loading 0: 64%|██████▍ | 233/363 [00:30<00:12, 10.50it/s]
Loading 0: 65%|██████▌ | 236/363 [00:30<00:10, 11.65it/s]
Loading 0: 66%|██████▌ | 238/363 [00:30<00:11, 11.05it/s]
Loading 0: 66%|██████▌ | 240/363 [00:30<00:10, 11.26it/s]
Loading 0: 67%|██████▋ | 244/363 [00:30<00:07, 15.35it/s]
Loading 0: 68%|██████▊ | 248/363 [00:30<00:05, 19.31it/s]
Loading 0: 69%|██████▉ | 251/363 [00:31<00:07, 14.02it/s]
Loading 0: 70%|██████▉ | 253/363 [00:31<00:08, 12.66it/s]
Loading 0: 70%|███████ | 255/363 [00:31<00:07, 13.53it/s]
Loading 0: 71%|███████ | 257/363 [00:31<00:09, 11.22it/s]
Loading 0: 72%|███████▏ | 262/363 [00:31<00:05, 17.22it/s]
Loading 0: 73%|███████▎ | 266/363 [00:32<00:04, 21.02it/s]
Loading 0: 74%|███████▍ | 269/363 [00:32<00:06, 14.86it/s]
Loading 0: 75%|███████▍ | 272/363 [00:32<00:05, 15.60it/s]
Loading 0: 76%|███████▌ | 275/363 [00:32<00:07, 12.40it/s]
Loading 0: 77%|███████▋ | 280/363 [00:33<00:04, 17.55it/s]
Loading 0: 78%|███████▊ | 284/363 [00:33<00:03, 20.92it/s]
Loading 0: 79%|███████▉ | 287/363 [00:33<00:04, 15.60it/s]
Loading 0: 80%|███████▉ | 290/363 [00:33<00:04, 15.97it/s]
Loading 0: 81%|████████ | 293/363 [00:34<00:05, 12.56it/s]
Loading 0: 82%|████████▏ | 298/363 [00:34<00:03, 17.38it/s]
Loading 0: 83%|████████▎ | 302/363 [00:34<00:02, 20.64it/s]
Loading 0: 84%|████████▍ | 305/363 [00:34<00:03, 15.08it/s]
Loading 0: 85%|████████▍ | 308/363 [00:34<00:03, 15.63it/s]
Loading 0: 86%|████████▌ | 311/363 [00:35<00:04, 12.19it/s]
Loading 0: 87%|████████▋ | 316/363 [00:35<00:02, 17.04it/s]
Loading 0: 88%|████████▊ | 320/363 [00:35<00:02, 20.42it/s]
Loading 0: 89%|████████▉ | 323/363 [00:35<00:02, 14.95it/s]
Loading 0: 90%|████████▉ | 326/363 [00:35<00:02, 15.59it/s]
Loading 0: 91%|█████████ | 329/363 [00:36<00:02, 11.81it/s]
Loading 0: 92%|█████████▏| 334/363 [00:36<00:01, 16.53it/s]
Loading 0: 93%|█████████▎| 338/363 [00:36<00:01, 19.76it/s]
Loading 0: 94%|█████████▍| 341/363 [00:36<00:01, 14.61it/s]
Loading 0: 95%|█████████▍| 344/363 [00:37<00:01, 15.14it/s]
Loading 0: 96%|█████████▌| 347/363 [00:37<00:01, 11.85it/s]
Loading 0: 97%|█████████▋| 352/363 [00:37<00:00, 16.43it/s]
Loading 0: 98%|█████████▊| 356/363 [00:37<00:00, 19.50it/s]
Loading 0: 99%|█████████▉| 359/363 [00:44<00:02, 1.58it/s]
Loading 0: 99%|█████████▉| 361/363 [00:45<00:01, 1.86it/s]
Job rirv938-96p-4ff-rirv938-62112-v1-mkmlizer completed after 288.74s with status: succeeded
Stopping job with name rirv938-96p-4ff-rirv938-62112-v1-mkmlizer
Pipeline stage MKMLizer completed in 289.20s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service rirv938-96p-4ff-rirv938-62112-v1
Waiting for inference service rirv938-96p-4ff-rirv938-62112-v1 to be ready
Failed to get response for submission chaiml-sft-gemma2-28b-v_83370_v5: HTTPConnectionPool(host='chaiml-sft-gemma2-28b-v-83370-v5-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Failed to get response for submission chaiml-sft-gemma2-28b-v_83370_v5: HTTPConnectionPool(host='chaiml-sft-gemma2-28b-v-83370-v5-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Retrying (%r) after connection broken by '%r': %s
Retrying (%r) after connection broken by '%r': %s
Retrying (%r) after connection broken by '%r': %s
Retrying (%r) after connection broken by '%r': %s
Retrying (%r) after connection broken by '%r': %s
Unable to obtain autoscaler blend_gakas_2025-03-20 due to error: Failed to establish a connection: HTTPSConnectionPool(host='guanaco-autoscalers.firebaseio.com', port=443): Max retries exceeded with url: /autoscalers/blend_gakas_2025-03-20/actions.json (Caused by ProtocolError('Connection aborted.', RemoteDisconnected('Remote end closed connection without response'))). Falling back to minimal defaults: logs=None autoscaler_id='fallback' status='deployed' metric=<function null_metric at 0x7619880be200> observations=[] max_observations_length=1000 target=0.0 actions=[SubmissionAutoscalerParameterAction(generation_parameter='max_input_tokens', current_scale=128, min_scale=512, max_scale=2048)] scale_up_policy=SubmissionAutoscalerPolicy(max_percent_change=0.25, period=0.1, stabilisation_window=0, tolerance=0.1) scale_down_policy=SubmissionAutoscalerPolicy(max_percent_change=0.25, period=0.1, stabilisation_window=0, tolerance=0.1) panic_policy=SubmissionAutoscalerPanicPolicy(max_percent_change=1.0, period=0.1, stabilisation_window=0, tolerance=-1e-06, z_score=3, num_observations=5, num_historical_observations=100)
Retrying (%r) after connection broken by '%r': %s
Inference service rirv938-96p-4ff-rirv938-62112-v1 ready after 210.8758602142334s
Pipeline stage MKMLDeployer completed in 211.36s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.0162765979766846s
Received healthy response to inference request in 2.5766263008117676s
Retrying (%r) after connection broken by '%r': %s
Retrying (%r) after connection broken by '%r': %s
Received healthy response to inference request in 2.6379756927490234s
Retrying (%r) after connection broken by '%r': %s
Received healthy response to inference request in 2.7251741886138916s
Retrying (%r) after connection broken by '%r': %s
Received healthy response to inference request in 2.6308016777038574s
5 requests
0 failed requests
5th percentile: 2.5874613761901855
10th percentile: 2.5982964515686033
20th percentile: 2.6199666023254395
30th percentile: 2.6322364807128906
40th percentile: 2.635106086730957
50th percentile: 2.6379756927490234
60th percentile: 2.672855091094971
70th percentile: 2.707734489440918
80th percentile: 2.7833946704864503
90th percentile: 2.899835634231567
95th percentile: 2.958056116104126
99th percentile: 3.0046325016021727
mean time: 2.717370891571045
Pipeline stage StressChecker completed in 15.00s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.74s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.70s
Shutdown handler de-registered
rirv938-96p-4ff-rirv938_62112_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.12s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.10s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service rirv938-96p-4ff-rirv938-62112-v1-profiler
Waiting for inference service rirv938-96p-4ff-rirv938-62112-v1-profiler to be ready
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2978.32s
Shutdown handler de-registered
rirv938-96p-4ff-rirv938_62112_v1 status is now inactive due to auto deactivation removed underperforming models
rirv938-96p-4ff-rirv938_62112_v1 status is now torndown due to DeploymentManager action