Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name cgato-nemo-12b-thespice-3970-v4-mkmlizer
Waiting for job on cgato-nemo-12b-thespice-3970-v4-mkmlizer to finish
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ _____ __ __ ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ /___/ ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ Version: 0.11.12 ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ https://mk1.ai ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ The license key for the current software has been verified as ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ belonging to: ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ Chai Research Corp. ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ║ ║
cgato-nemo-12b-thespice-3970-v4-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
cgato-nemo-12b-thespice-3970-v4-mkmlizer: Downloaded to shared memory in 115.291s
cgato-nemo-12b-thespice-3970-v4-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp6chgdkab, device:0
cgato-nemo-12b-thespice-3970-v4-mkmlizer: Saving flywheel model at /dev/shm/model_cache
cgato-nemo-12b-thespice-3970-v4-mkmlizer: quantized model in 37.254s
cgato-nemo-12b-thespice-3970-v4-mkmlizer: Processed model cgato/Nemo-12b-TheSpice-V0.9-All-v2-KTO-v0.3-Quick in 152.545s
cgato-nemo-12b-thespice-3970-v4-mkmlizer: creating bucket guanaco-mkml-models
cgato-nemo-12b-thespice-3970-v4-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
cgato-nemo-12b-thespice-3970-v4-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/cgato-nemo-12b-thespice-3970-v4
cgato-nemo-12b-thespice-3970-v4-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/cgato-nemo-12b-thespice-3970-v4/config.json
cgato-nemo-12b-thespice-3970-v4-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/cgato-nemo-12b-thespice-3970-v4/special_tokens_map.json
cgato-nemo-12b-thespice-3970-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/cgato-nemo-12b-thespice-3970-v4/tokenizer_config.json
cgato-nemo-12b-thespice-3970-v4-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/cgato-nemo-12b-thespice-3970-v4/tokenizer.json
cgato-nemo-12b-thespice-3970-v4-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/cgato-nemo-12b-thespice-3970-v4/flywheel_model.0.safetensors
cgato-nemo-12b-thespice-3970-v4-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%|▏ | 5/363 [00:00<00:10, 33.22it/s]
Loading 0: 4%|▎ | 13/363 [00:00<00:06, 53.87it/s]
Loading 0: 5%|▌ | 19/363 [00:00<00:07, 47.78it/s]
Loading 0: 7%|▋ | 25/363 [00:00<00:06, 48.68it/s]
Loading 0: 9%|▊ | 31/363 [00:00<00:06, 50.21it/s]
Loading 0: 10%|█ | 37/363 [00:00<00:06, 46.61it/s]
Loading 0: 12%|█▏ | 42/363 [00:00<00:07, 45.71it/s]
Loading 0: 13%|█▎ | 49/363 [00:01<00:06, 50.96it/s]
Loading 0: 15%|█▌ | 55/363 [00:01<00:06, 47.49it/s]
Loading 0: 17%|█▋ | 61/363 [00:01<00:08, 36.58it/s]
Loading 0: 18%|█▊ | 66/363 [00:01<00:08, 36.59it/s]
Loading 0: 20%|█▉ | 72/363 [00:01<00:07, 40.44it/s]
Loading 0: 21%|██▏ | 78/363 [00:01<00:07, 39.04it/s]
Loading 0: 23%|██▎ | 83/363 [00:01<00:07, 39.51it/s]
Loading 0: 25%|██▍ | 90/363 [00:02<00:06, 44.67it/s]
Loading 0: 26%|██▋ | 96/363 [00:02<00:06, 43.05it/s]
Loading 0: 28%|██▊ | 101/363 [00:02<00:06, 42.95it/s]
Loading 0: 30%|██▉ | 108/363 [00:02<00:05, 49.31it/s]
Loading 0: 31%|███▏ | 114/363 [00:02<00:05, 44.40it/s]
Loading 0: 33%|███▎ | 119/363 [00:02<00:05, 43.18it/s]
Loading 0: 34%|███▍ | 125/363 [00:02<00:05, 47.25it/s]
Loading 0: 36%|███▌ | 130/363 [00:02<00:05, 46.34it/s]
Loading 0: 37%|███▋ | 135/363 [00:03<00:05, 44.47it/s]
Loading 0: 39%|███▊ | 140/363 [00:03<00:05, 44.46it/s]
Loading 0: 40%|███▉ | 145/363 [00:03<00:08, 27.14it/s]
Loading 0: 41%|████ | 149/363 [00:03<00:07, 27.50it/s]
Loading 0: 43%|████▎ | 156/363 [00:03<00:05, 34.88it/s]
Loading 0: 44%|████▍ | 161/363 [00:03<00:05, 36.10it/s]
Loading 0: 46%|████▌ | 166/363 [00:04<00:05, 36.21it/s]
Loading 0: 47%|████▋ | 170/363 [00:04<00:05, 36.58it/s]
Loading 0: 48%|████▊ | 174/363 [00:04<00:05, 37.28it/s]
Loading 0: 49%|████▉ | 178/363 [00:04<00:05, 36.33it/s]
Loading 0: 50%|█████ | 183/363 [00:04<00:04, 39.41it/s]
Loading 0: 52%|█████▏ | 188/363 [00:04<00:04, 39.75it/s]
Loading 0: 53%|█████▎ | 193/363 [00:04<00:04, 40.69it/s]
Loading 0: 55%|█████▍ | 198/363 [00:04<00:03, 42.13it/s]
Loading 0: 56%|█████▌ | 203/363 [00:05<00:04, 34.46it/s]
Loading 0: 58%|█████▊ | 210/363 [00:05<00:03, 41.08it/s]
Loading 0: 59%|█████▉ | 215/363 [00:05<00:03, 40.73it/s]
Loading 0: 61%|██████ | 220/363 [00:05<00:03, 40.90it/s]
Loading 0: 62%|██████▏ | 225/363 [00:05<00:05, 26.08it/s]
Loading 0: 63%|██████▎ | 230/363 [00:05<00:04, 28.50it/s]
Loading 0: 65%|██████▌ | 237/363 [00:06<00:03, 35.28it/s]
Loading 0: 67%|██████▋ | 242/363 [00:06<00:03, 36.63it/s]
Loading 0: 68%|██████▊ | 247/363 [00:06<00:03, 37.74it/s]
Loading 0: 69%|██████▉ | 252/363 [00:06<00:02, 39.71it/s]
Loading 0: 71%|███████ | 257/363 [00:06<00:03, 33.45it/s]
Loading 0: 73%|███████▎ | 264/363 [00:06<00:02, 40.62it/s]
Loading 0: 74%|███████▍ | 269/363 [00:06<00:02, 40.63it/s]
Loading 0: 75%|███████▌ | 274/363 [00:06<00:02, 40.93it/s]
Loading 0: 77%|███████▋ | 279/363 [00:07<00:01, 42.07it/s]
Loading 0: 78%|███████▊ | 284/363 [00:07<00:02, 34.55it/s]
Loading 0: 80%|████████ | 291/363 [00:07<00:01, 41.37it/s]
Loading 0: 82%|████████▏ | 296/363 [00:07<00:01, 41.67it/s]
Loading 0: 83%|████████▎ | 301/363 [00:07<00:01, 42.84it/s]
Loading 0: 84%|████████▍ | 306/363 [00:14<00:23, 2.47it/s]
Loading 0: 85%|████████▌ | 310/363 [00:14<00:16, 3.18it/s]
Loading 0: 87%|████████▋ | 314/363 [00:14<00:11, 4.17it/s]
Loading 0: 88%|████████▊ | 320/363 [00:14<00:06, 6.23it/s]
Loading 0: 90%|████████▉ | 326/363 [00:14<00:04, 8.71it/s]
Loading 0: 91%|█████████ | 330/363 [00:15<00:03, 10.62it/s]
Loading 0: 93%|█████████▎| 338/363 [00:15<00:01, 16.25it/s]
Loading 0: 95%|█████████▍| 344/363 [00:15<00:00, 19.67it/s]
Loading 0: 96%|█████████▌| 349/363 [00:15<00:00, 22.72it/s]
Loading 0: 98%|█████████▊| 355/363 [00:15<00:00, 27.78it/s]
Loading 0: 99%|█████████▉| 360/363 [00:15<00:00, 31.12it/s]
Job cgato-nemo-12b-thespice-3970-v4-mkmlizer completed after 175.59s with status: succeeded
Stopping job with name cgato-nemo-12b-thespice-3970-v4-mkmlizer
Pipeline stage MKMLizer completed in 176.12s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.17s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service cgato-nemo-12b-thespice-3970-v4
Waiting for inference service cgato-nemo-12b-thespice-3970-v4 to be ready
Connection pool is full, discarding connection: %s. Connection pool size: %s
Inference service cgato-nemo-12b-thespice-3970-v4 ready after 150.61071801185608s
Pipeline stage MKMLDeployer completed in 151.11s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.2111895084381104s
Received healthy response to inference request in 1.696347951889038s
Received healthy response to inference request in 1.6399211883544922s
Received healthy response to inference request in 1.6529431343078613s
5 requests
1 failed requests
5th percentile: 1.642525577545166
10th percentile: 1.6451299667358399
20th percentile: 1.6503387451171876
30th percentile: 1.6616240978240966
40th percentile: 1.6789860248565673
50th percentile: 1.696347951889038
60th percentile: 1.902284574508667
70th percentile: 2.108221197128296
80th percentile: 5.796516656875614
90th percentile: 12.967170953750612
95th percentile: 16.552498102188107
99th percentile: 19.42075982093811
mean time: 5.4676454067230225
%s, retrying in %s seconds...
Received healthy response to inference request in 1.7267656326293945s
Received healthy response to inference request in 2.1254608631134033s
Received healthy response to inference request in 2.049870014190674s
Received healthy response to inference request in 1.6183648109436035s
Received healthy response to inference request in 2.150596857070923s
5 requests
0 failed requests
5th percentile: 1.6400449752807618
10th percentile: 1.66172513961792
20th percentile: 1.7050854682922363
30th percentile: 1.7913865089416503
40th percentile: 1.920628261566162
50th percentile: 2.049870014190674
60th percentile: 2.0801063537597657
70th percentile: 2.1103426933288576
80th percentile: 2.130488061904907
90th percentile: 2.140542459487915
95th percentile: 2.145569658279419
99th percentile: 2.1495914173126223
mean time: 1.9342116355895995
Pipeline stage StressChecker completed in 39.76s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.08s
Shutdown handler de-registered
cgato-nemo-12b-thespice-_3970_v4 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2511.08s
Shutdown handler de-registered
cgato-nemo-12b-thespice-_3970_v4 status is now inactive due to auto deactivation removed underperforming models
cgato-nemo-12b-thespice-_3970_v4 status is now torndown due to DeploymentManager action