Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer
Waiting for job on cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer to finish
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ _____ __ __ ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ /___/ ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ Version: 0.11.12 ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ https://mk1.ai ║
Failed to get response for submission rica40325-10-14dpo_v2: ('http://rica40325-10-14dpo-v2-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ The license key for the current software has been verified as ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ belonging to: ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ Chai Research Corp. ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ║ ║
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: Downloaded to shared memory in 36.960s
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpgqabqn6x, device:0
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Connection pool is full, discarding connection: %s. Connection pool size: %s
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: quantized model in 26.326s
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: Processed model cgato/L3.1-8b-TheSpice-V0.9-RP-Preview2 in 63.287s
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: creating bucket guanaco-mkml-models
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/cgato-l3-1-8b-thespice-v-7768-v1
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/cgato-l3-1-8b-thespice-v-7768-v1/config.json
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/cgato-l3-1-8b-thespice-v-7768-v1/special_tokens_map.json
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/cgato-l3-1-8b-thespice-v-7768-v1/tokenizer_config.json
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/cgato-l3-1-8b-thespice-v-7768-v1/tokenizer.json
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/cgato-l3-1-8b-thespice-v-7768-v1/flywheel_model.0.safetensors
cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer:
Loading 0: 0%| | 0/291 [00:00<?, ?it/s]
Loading 0: 2%|▏ | 5/291 [00:00<00:08, 34.80it/s]
Loading 0: 4%|▍ | 13/291 [00:00<00:04, 56.62it/s]
Loading 0: 7%|▋ | 20/291 [00:00<00:05, 53.86it/s]
Loading 0: 9%|▉ | 26/291 [00:00<00:04, 53.97it/s]
Loading 0: 11%|█ | 32/291 [00:00<00:05, 45.98it/s]
Loading 0: 14%|█▎ | 40/291 [00:00<00:04, 53.19it/s]
Loading 0: 16%|█▌ | 46/291 [00:00<00:04, 49.68it/s]
Loading 0: 18%|█▊ | 52/291 [00:01<00:04, 51.17it/s]
Loading 0: 20%|██ | 59/291 [00:01<00:04, 46.46it/s]
Loading 0: 23%|██▎ | 67/291 [00:01<00:04, 53.46it/s]
Loading 0: 25%|██▌ | 73/291 [00:01<00:04, 49.47it/s]
Loading 0: 27%|██▋ | 79/291 [00:01<00:04, 50.89it/s]
Loading 0: 29%|██▉ | 85/291 [00:01<00:05, 36.40it/s]
Loading 0: 31%|███▏ | 91/291 [00:02<00:05, 37.19it/s]
Loading 0: 33%|███▎ | 96/291 [00:02<00:05, 38.51it/s]
Loading 0: 35%|███▌ | 103/291 [00:02<00:04, 44.07it/s]
Loading 0: 37%|███▋ | 109/291 [00:02<00:04, 40.74it/s]
Loading 0: 39%|███▉ | 114/291 [00:02<00:04, 41.09it/s]
Loading 0: 42%|████▏ | 121/291 [00:02<00:03, 46.92it/s]
Loading 0: 44%|████▎ | 127/291 [00:02<00:03, 44.71it/s]
Loading 0: 45%|████▌ | 132/291 [00:02<00:03, 44.77it/s]
Loading 0: 48%|████▊ | 139/291 [00:03<00:03, 50.03it/s]
Loading 0: 50%|████▉ | 145/291 [00:03<00:03, 46.83it/s]
Loading 0: 52%|█████▏ | 150/291 [00:03<00:03, 45.09it/s]
Loading 0: 54%|█████▍ | 157/291 [00:03<00:02, 50.46it/s]
Loading 0: 56%|█████▌ | 163/291 [00:03<00:02, 46.93it/s]
Loading 0: 58%|█████▊ | 168/291 [00:03<00:02, 45.94it/s]
Loading 0: 60%|██████ | 175/291 [00:03<00:02, 51.65it/s]
Loading 0: 62%|██████▏ | 181/291 [00:03<00:02, 43.89it/s]
Loading 0: 64%|██████▍ | 187/291 [00:04<00:02, 35.50it/s]
Loading 0: 66%|██████▌ | 192/291 [00:04<00:02, 36.88it/s]
Loading 0: 68%|██████▊ | 197/291 [00:04<00:02, 38.68it/s]
Loading 0: 70%|██████▉ | 203/291 [00:04<00:02, 37.15it/s]
Loading 0: 73%|███████▎ | 211/291 [00:04<00:01, 46.04it/s]
Loading 0: 75%|███████▍ | 217/291 [00:04<00:01, 44.45it/s]
Loading 0: 76%|███████▋ | 222/291 [00:04<00:01, 43.85it/s]
Loading 0: 79%|███████▊ | 229/291 [00:05<00:01, 48.69it/s]
Loading 0: 81%|████████ | 235/291 [00:05<00:01, 45.84it/s]
Loading 0: 82%|████████▏ | 240/291 [00:05<00:01, 45.62it/s]
Loading 0: 85%|████████▍ | 247/291 [00:05<00:00, 50.44it/s]
Loading 0: 87%|████████▋ | 253/291 [00:05<00:00, 46.38it/s]
Loading 0: 89%|████████▊ | 258/291 [00:05<00:00, 44.46it/s]
Loading 0: 91%|█████████ | 264/291 [00:05<00:00, 46.34it/s]
Loading 0: 92%|█████████▏| 269/291 [00:05<00:00, 46.18it/s]
Loading 0: 94%|█████████▍| 274/291 [00:06<00:00, 46.09it/s]
Loading 0: 96%|█████████▌| 279/291 [00:06<00:00, 47.13it/s]
Loading 0: 98%|█████████▊| 284/291 [00:06<00:00, 39.48it/s]
Loading 0: 99%|█████████▉| 289/291 [00:11<00:00, 3.01it/s]
Job cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer completed after 83.78s with status: succeeded
Stopping job with name cgato-l3-1-8b-thespice-v-7768-v1-mkmlizer
Pipeline stage MKMLizer completed in 84.37s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.19s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service cgato-l3-1-8b-thespice-v-7768-v1
Waiting for inference service cgato-l3-1-8b-thespice-v-7768-v1 to be ready
Inference service cgato-l3-1-8b-thespice-v-7768-v1 ready after 140.49457049369812s
Pipeline stage MKMLDeployer completed in 141.05s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8761682510375977s
Received healthy response to inference request in 1.5438563823699951s
Received healthy response to inference request in 1.3934078216552734s
Received healthy response to inference request in 1.838911771774292s
Received healthy response to inference request in 1.6045055389404297s
5 requests
0 failed requests
5th percentile: 1.4234975337982179
10th percentile: 1.453587245941162
20th percentile: 1.5137666702270507
30th percentile: 1.555986213684082
40th percentile: 1.5802458763122558
50th percentile: 1.6045055389404297
60th percentile: 1.6982680320739747
70th percentile: 1.7920305252075195
80th percentile: 1.8463630676269531
90th percentile: 1.8612656593322754
95th percentile: 1.8687169551849365
99th percentile: 1.8746779918670655
mean time: 1.6513699531555175
Pipeline stage StressChecker completed in 9.63s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.89s
Shutdown handler de-registered
cgato-l3-1-8b-thespice-v_7768_v1 status is now deployed due to DeploymentManager action
cgato-l3-1-8b-thespice-v_7768_v1 status is now inactive due to auto deactivation removed underperforming models
cgato-l3-1-8b-thespice-v_7768_v1 status is now torndown due to DeploymentManager action