Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name mistralai-mistral-nemo-9330-v226-mkmlizer
Waiting for job on mistralai-mistral-nemo-9330-v226-mkmlizer to finish
mistralai-mistral-nemo-9330-v226-mkmlizer: Downloaded to shared memory in 65.727s
mistralai-mistral-nemo-9330-v226-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpdufko2qe, device:0
mistralai-mistral-nemo-9330-v226-mkmlizer: Saving flywheel model at /dev/shm/model_cache
mistralai-mistral-nemo-9330-v226-mkmlizer: quantized model in 37.389s
mistralai-mistral-nemo-9330-v226-mkmlizer: Processed model mistralai/Mistral-Nemo-Instruct-2407 in 103.117s
mistralai-mistral-nemo-9330-v226-mkmlizer: creating bucket guanaco-mkml-models
mistralai-mistral-nemo-9330-v226-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
mistralai-mistral-nemo-9330-v226-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v226
mistralai-mistral-nemo-9330-v226-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v226/config.json
mistralai-mistral-nemo-9330-v226-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v226/special_tokens_map.json
mistralai-mistral-nemo-9330-v226-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v226/tokenizer_config.json
mistralai-mistral-nemo-9330-v226-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v226/tokenizer.json
mistralai-mistral-nemo-9330-v226-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/mistralai-mistral-nemo-9330-v226/flywheel_model.0.safetensors
mistralai-mistral-nemo-9330-v226-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:11, 30.20it/s]
Loading 0: 4%|▎ | 13/363 [00:00<00:06, 52.82it/s]
Loading 0: 5%|▌ | 19/363 [00:00<00:07, 48.65it/s]
Loading 0: 7%|▋ | 25/363 [00:00<00:06, 51.19it/s]
Loading 0: 9%|▊ | 31/363 [00:00<00:06, 53.90it/s]
Loading 0: 10%|█ | 37/363 [00:00<00:07, 44.57it/s]
Loading 0: 12%|█▏ | 42/363 [00:00<00:07, 40.66it/s]
Loading 0: 13%|█▎ | 48/363 [00:01<00:06, 45.36it/s]
Loading 0: 15%|█▍ | 53/363 [00:01<00:06, 45.44it/s]
Loading 0: 17%|█▋ | 60/363 [00:01<00:06, 45.58it/s]
Loading 0: 18%|█▊ | 65/363 [00:01<00:09, 31.72it/s]
Loading 0: 20%|█▉ | 72/363 [00:01<00:07, 38.39it/s]
Loading 0: 21%|██▏ | 78/363 [00:01<00:07, 38.68it/s]
Loading 0: 23%|██▎ | 83/363 [00:02<00:07, 37.54it/s]
Loading 0: 25%|██▍ | 89/363 [00:02<00:06, 41.03it/s]
Loading 0: 26%|██▌ | 94/363 [00:02<00:06, 40.19it/s]
Loading 0: 27%|██▋ | 99/363 [00:02<00:06, 40.54it/s]
Loading 0: 29%|██▊ | 104/363 [00:02<00:06, 42.32it/s]
Loading 0: 30%|███ | 109/363 [00:02<00:05, 43.72it/s]
Loading 0: 31%|███▏ | 114/363 [00:02<00:06, 35.96it/s]
Loading 0: 33%|███▎ | 118/363 [00:02<00:07, 33.58it/s]
Loading 0: 34%|███▍ | 125/363 [00:03<00:05, 40.38it/s]
Loading 0: 36%|███▌ | 130/363 [00:03<00:05, 39.46it/s]
Loading 0: 37%|███▋ | 135/363 [00:03<00:05, 39.33it/s]
Loading 0: 39%|███▊ | 140/363 [00:03<00:05, 41.11it/s]
Loading 0: 40%|███▉ | 145/363 [00:03<00:08, 25.98it/s]
Loading 0: 41%|████ | 149/363 [00:03<00:07, 27.10it/s]
Loading 0: 43%|████▎ | 156/363 [00:04<00:05, 35.04it/s]
Loading 0: 44%|████▍ | 161/363 [00:04<00:05, 36.18it/s]
Loading 0: 46%|████▌ | 166/363 [00:04<00:05, 38.76it/s]
Loading 0: 47%|████▋ | 172/363 [00:04<00:05, 37.88it/s]
Loading 0: 49%|████▉ | 177/363 [00:04<00:04, 37.65it/s]
Loading 0: 50%|█████ | 183/363 [00:04<00:04, 41.12it/s]
Loading 0: 52%|█████▏ | 188/363 [00:04<00:04, 40.10it/s]
Loading 0: 53%|█████▎ | 193/363 [00:04<00:04, 38.73it/s]
Loading 0: 54%|█████▍ | 197/363 [00:05<00:04, 37.51it/s]
Loading 0: 55%|█████▌ | 201/363 [00:05<00:04, 36.77it/s]
Loading 0: 56%|█████▋ | 205/363 [00:05<00:04, 35.29it/s]
Loading 0: 58%|█████▊ | 210/363 [00:05<00:04, 37.77it/s]
Loading 0: 59%|█████▉ | 214/363 [00:05<00:04, 36.47it/s]
Loading 0: 60%|██████ | 218/363 [00:05<00:03, 36.92it/s]
Loading 0: 61%|██████▏ | 223/363 [00:05<00:04, 29.17it/s]
Loading 0: 63%|██████▎ | 227/363 [00:05<00:04, 30.52it/s]
Loading 0: 64%|██████▎ | 231/363 [00:06<00:04, 29.64it/s]
Loading 0: 65%|██████▌ | 237/363 [00:06<00:03, 34.32it/s]
Loading 0: 66%|██████▋ | 241/363 [00:06<00:03, 33.16it/s]
Loading 0: 68%|██████▊ | 246/363 [00:06<00:03, 37.14it/s]
Loading 0: 69%|██████▉ | 250/363 [00:06<00:03, 36.62it/s]
Loading 0: 70%|███████ | 255/363 [00:06<00:02, 39.86it/s]
Loading 0: 72%|███████▏ | 260/363 [00:06<00:02, 40.77it/s]
Loading 0: 73%|███████▎ | 265/363 [00:06<00:02, 41.72it/s]
Loading 0: 74%|███████▍ | 270/363 [00:07<00:02, 43.57it/s]
Loading 0: 76%|███████▌ | 275/363 [00:07<00:02, 35.76it/s]
Loading 0: 78%|███████▊ | 282/363 [00:07<00:01, 42.57it/s]
Loading 0: 79%|███████▉ | 287/363 [00:07<00:01, 41.35it/s]
Loading 0: 80%|████████ | 292/363 [00:07<00:01, 41.66it/s]
Loading 0: 82%|████████▏ | 298/363 [00:07<00:01, 40.95it/s]
Loading 0: 84%|████████▎ | 304/363 [00:14<00:22, 2.66it/s]
Loading 0: 85%|████████▍ | 307/363 [00:14<00:17, 3.19it/s]
Loading 0: 86%|████████▌ | 312/363 [00:14<00:11, 4.45it/s]
Loading 0: 88%|████████▊ | 320/363 [00:14<00:05, 7.24it/s]
Loading 0: 90%|████████▉ | 326/363 [00:15<00:03, 9.68it/s]
Loading 0: 91%|█████████ | 331/363 [00:15<00:02, 12.22it/s]
Loading 0: 93%|█████████▎| 338/363 [00:15<00:01, 16.95it/s]
Loading 0: 94%|█████████▍| 343/363 [00:15<00:00, 20.23it/s]
Loading 0: 96%|█████████▌| 348/363 [00:15<00:00, 20.85it/s]
Loading 0: 97%|█████████▋| 353/363 [00:15<00:00, 24.86it/s]
Loading 0: 99%|█████████▊| 358/363 [00:15<00:00, 26.83it/s]
Job mistralai-mistral-nemo-9330-v226-mkmlizer completed after 144.83s with status: succeeded
Stopping job with name mistralai-mistral-nemo-9330-v226-mkmlizer
Pipeline stage MKMLizer completed in 145.34s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service mistralai-mistral-nemo-9330-v226
Waiting for inference service mistralai-mistral-nemo-9330-v226 to be ready
Inference service mistralai-mistral-nemo-9330-v226 ready after 281.02163767814636s
Pipeline stage MKMLDeployer completed in 281.55s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2619686126708984s
Received healthy response to inference request in 1.2434213161468506s
Received healthy response to inference request in 1.3946688175201416s
Received healthy response to inference request in 1.5531389713287354s
Received healthy response to inference request in 1.5774142742156982s
5 requests
0 failed requests
5th percentile: 1.2736708164215087
10th percentile: 1.303920316696167
20th percentile: 1.3644193172454835
30th percentile: 1.4263628482818604
40th percentile: 1.489750909805298
50th percentile: 1.5531389713287354
60th percentile: 1.5628490924835206
70th percentile: 1.5725592136383058
80th percentile: 1.7143251419067385
90th percentile: 1.9881468772888184
95th percentile: 2.125057744979858
99th percentile: 2.2345864391326904
mean time: 1.6061223983764648
Pipeline stage StressChecker completed in 9.31s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.66s
Shutdown handler de-registered
mistralai-mistral-nemo_9330_v226 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2488.58s
Shutdown handler de-registered
mistralai-mistral-nemo_9330_v226 status is now inactive due to auto deactivation removed underperforming models
mistralai-mistral-nemo_9330_v226 status is now torndown due to DeploymentManager action
mistralai-mistral-nemo_9330_v226 status is now torndown due to DeploymentManager action
mistralai-mistral-nemo_9330_v226 status is now torndown due to DeploymentManager action