Shutdown handler not registered because Python interpreter is not running in the main thread
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline %s
run pipeline stage %s
run pipeline stage %s
Running pipeline stage MKMLizer
Running pipeline stage MKMLizer
%s, retrying in %s seconds...
%s, retrying in %s seconds...
clean up pipeline due to error=MKMLizerError("module 'kubernetes.config' has no attribute 'load_kube_config'")
Shutdown handler de-registered
MKMLizerError("module 'kubernetes.config' has no attribute 'load_kube_config'")
chaiml-nemo-community-2_v1 status is now failed due to DeploymentManager action
Starting job with name chaiml-nemo-community-2-v1-mkmlizer
Waiting for job on chaiml-nemo-community-2-v1-mkmlizer to finish
chaiml-nemo-community-2-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-nemo-community-2-v1-mkmlizer: ║ _____ __ __ ║
chaiml-nemo-community-2-v1-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
chaiml-nemo-community-2-v1-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
chaiml-nemo-community-2-v1-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
chaiml-nemo-community-2-v1-mkmlizer: ║ /___/ ║
chaiml-nemo-community-2-v1-mkmlizer: ║ ║
chaiml-nemo-community-2-v1-mkmlizer: ║ Version: 0.11.12 ║
chaiml-nemo-community-2-v1-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
chaiml-nemo-community-2-v1-mkmlizer: ║ https://mk1.ai ║
chaiml-nemo-community-2-v1-mkmlizer: ║ ║
chaiml-nemo-community-2-v1-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-nemo-community-2-v1-mkmlizer: ║ belonging to: ║
chaiml-nemo-community-2-v1-mkmlizer: ║ ║
chaiml-nemo-community-2-v1-mkmlizer: ║ Chai Research Corp. ║
chaiml-nemo-community-2-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-nemo-community-2-v1-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
chaiml-nemo-community-2-v1-mkmlizer: ║ ║
chaiml-nemo-community-2-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
admin requested tearing down of chaiml-nemo-community-2_v1
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLDeleter completed in 0.14s
run pipeline stage %s
Running pipeline stage MKMLModelDeleter
Skipping deletion as no model was successfully uploaded
Pipeline stage MKMLModelDeleter completed in 0.14s
Shutdown handler de-registered
chaiml-nemo-community-2_v1 status is now torndown due to DeploymentManager action
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name arliai-mistral-nemo-12b-9104-v4-mkmlizer
Waiting for job on arliai-mistral-nemo-12b-9104-v4-mkmlizer to finish
chaiml-nemo-community-2-v1-mkmlizer: Downloaded to shared memory in 48.023s
chaiml-nemo-community-2-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpww6q0l1t, device:0
chaiml-nemo-community-2-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name intervitens-mini-magnum-5180-v6-mkmlizer
Waiting for job on intervitens-mini-magnum-5180-v6-mkmlizer to finish
intervitens-mini-magnum-5180-v6-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
intervitens-mini-magnum-5180-v6-mkmlizer: ║ _____ __ __ ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ /___/ ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ Version: 0.11.12 ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ https://mk1.ai ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ The license key for the current software has been verified as ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ belonging to: ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ Chai Research Corp. ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
intervitens-mini-magnum-5180-v6-mkmlizer: ║ ║
intervitens-mini-magnum-5180-v6-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-nemo-community-2-v1-mkmlizer: quantized model in 35.111s
chaiml-nemo-community-2-v1-mkmlizer: Processed model ChaiML/nemo-community-2 in 83.135s
chaiml-nemo-community-2-v1-mkmlizer: creating bucket guanaco-mkml-models
chaiml-nemo-community-2-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-nemo-community-2-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-nemo-community-2-v1
chaiml-nemo-community-2-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-nemo-community-2-v1/config.json
chaiml-nemo-community-2-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-nemo-community-2-v1/special_tokens_map.json
chaiml-nemo-community-2-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-nemo-community-2-v1/tokenizer_config.json
chaiml-nemo-community-2-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-nemo-community-2-v1/tokenizer.json
chaiml-nemo-community-2-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-nemo-community-2-v1/flywheel_model.0.safetensors
chaiml-nemo-community-2-v1-mkmlizer:
Loading 0: 0%| | 0/363 [00:00<?, ?it/s]
Loading 0: 1%| | 2/363 [00:05<17:45, 2.95s/it]
Loading 0: 2%|▏ | 6/363 [00:06<04:44, 1.25it/s]
Loading 0: 4%|▎ | 13/363 [00:06<01:41, 3.45it/s]
Loading 0: 5%|▌ | 19/363 [00:06<00:58, 5.90it/s]
Loading 0: 7%|▋ | 24/363 [00:06<00:41, 8.21it/s]
Loading 0: 9%|▉ | 32/363 [00:06<00:24, 13.51it/s]
Loading 0: 10%|█ | 38/363 [00:06<00:18, 17.55it/s]
Loading 0: 12%|█▏ | 44/363 [00:06<00:17, 18.61it/s]
Loading 0: 14%|█▍ | 50/363 [00:07<00:13, 23.60it/s]
Loading 0: 15%|█▌ | 56/363 [00:07<00:11, 27.38it/s]
Loading 0: 17%|█▋ | 61/363 [00:07<00:09, 30.43it/s]
Loading 0: 19%|█▊ | 68/363 [00:07<00:07, 37.14it/s]
Loading 0: 20%|██ | 74/363 [00:07<00:07, 38.76it/s]
Loading 0: 22%|██▏ | 79/363 [00:07<00:07, 38.02it/s]
Loading 0: 23%|██▎ | 85/363 [00:07<00:06, 42.18it/s]
Loading 0: 25%|██▍ | 90/363 [00:07<00:06, 40.53it/s]
Loading 0: 26%|██▌ | 95/363 [00:08<00:06, 41.28it/s]
Loading 0: 28%|██▊ | 101/363 [00:08<00:06, 41.66it/s]
Loading 0: 29%|██▉ | 106/363 [00:08<00:06, 42.53it/s]
Loading 0: 31%|███ | 112/363 [00:08<00:05, 46.62it/s]
Loading 0: 32%|███▏ | 117/363 [00:08<00:05, 45.19it/s]
Loading 0: 34%|███▎ | 122/363 [00:08<00:07, 32.90it/s]
Loading 0: 35%|███▍ | 127/363 [00:08<00:06, 36.21it/s]
Loading 0: 36%|███▋ | 132/363 [00:09<00:06, 33.82it/s]
Loading 0: 39%|███▊ | 140/363 [00:09<00:05, 42.40it/s]
Loading 0: 40%|████ | 146/363 [00:09<00:05, 41.95it/s]
Loading 0: 42%|████▏ | 151/363 [00:09<00:05, 41.19it/s]
Loading 0: 44%|████▎ | 158/363 [00:09<00:04, 45.95it/s]
Loading 0: 45%|████▌ | 164/363 [00:09<00:04, 44.40it/s]
Loading 0: 47%|████▋ | 169/363 [00:09<00:04, 43.65it/s]
Loading 0: 48%|████▊ | 175/363 [00:09<00:03, 47.26it/s]
Loading 0: 50%|████▉ | 180/363 [00:10<00:03, 46.30it/s]
Loading 0: 51%|█████ | 185/363 [00:10<00:04, 42.23it/s]
Loading 0: 53%|█████▎ | 191/363 [00:10<00:04, 42.46it/s]
Loading 0: 54%|█████▍ | 196/363 [00:10<00:03, 41.86it/s]
Loading 0: 56%|█████▌ | 202/363 [00:10<00:04, 33.19it/s]
Loading 0: 57%|█████▋ | 206/363 [00:10<00:04, 33.08it/s]
Loading 0: 58%|█████▊ | 211/363 [00:10<00:04, 36.05it/s]
Loading 0: 59%|█████▉ | 215/363 [00:11<00:04, 34.29it/s]
Loading 0: 61%|██████ | 220/363 [00:11<00:03, 36.97it/s]
Loading 0: 62%|██████▏ | 224/363 [00:11<00:03, 37.04it/s]
Loading 0: 63%|██████▎ | 230/363 [00:11<00:03, 41.90it/s]
Loading 0: 65%|██████▍ | 235/363 [00:11<00:02, 43.33it/s]
Loading 0: 66%|██████▌ | 240/363 [00:11<00:03, 36.92it/s]
Loading 0: 68%|██████▊ | 248/363 [00:11<00:02, 45.11it/s]
Loading 0: 70%|██████▉ | 254/363 [00:11<00:02, 44.50it/s]
Loading 0: 71%|███████▏ | 259/363 [00:12<00:02, 43.78it/s]
Loading 0: 73%|███████▎ | 266/363 [00:12<00:01, 48.55it/s]
Loading 0: 75%|███████▍ | 272/363 [00:12<00:01, 46.13it/s]
Loading 0: 76%|███████▋ | 277/363 [00:12<00:01, 44.77it/s]
Loading 0: 78%|███████▊ | 283/363 [00:12<00:02, 35.71it/s]
Loading 0: 79%|███████▉ | 287/363 [00:12<00:02, 35.58it/s]
Loading 0: 81%|████████ | 293/363 [00:12<00:01, 40.03it/s]
Loading 0: 82%|████████▏ | 299/363 [00:13<00:01, 40.13it/s]
Loading 0: 84%|████████▎ | 304/363 [00:13<00:01, 40.97it/s]
Loading 0: 86%|████████▌ | 311/363 [00:13<00:01, 47.08it/s]
Loading 0: 87%|████████▋ | 317/363 [00:13<00:00, 46.17it/s]
Loading 0: 89%|████████▊ | 322/363 [00:13<00:00, 45.93it/s]
Loading 0: 91%|█████████ | 329/363 [00:13<00:00, 50.66it/s]
Loading 0: 92%|█████████▏| 335/363 [00:13<00:00, 46.20it/s]
Loading 0: 94%|█████████▎| 340/363 [00:13<00:00, 45.69it/s]
Loading 0: 96%|█████████▌| 347/363 [00:14<00:00, 50.49it/s]
Loading 0: 97%|█████████▋| 353/363 [00:14<00:00, 46.85it/s]
Loading 0: 99%|█████████▊| 358/363 [00:14<00:00, 45.95it/s]
Job chaiml-nemo-community-2-v1-mkmlizer completed after 103.29s with status: succeeded
Stopping job with name chaiml-nemo-community-2-v1-mkmlizer
Pipeline stage MKMLizer completed in 104.29s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.12s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-nemo-community-2-v1
Waiting for inference service chaiml-nemo-community-2-v1 to be ready
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ _____ __ __ ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ /___/ ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ Version: 0.11.12 ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ https://mk1.ai ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ The license key for the current software has been verified as ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ belonging to: ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ Chai Research Corp. ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ Expiration: 2024-10-15 23:59:59 ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ║ ║
arliai-mistral-nemo-12b-9104-v4-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
intervitens-mini-magnum-5180-v6-mkmlizer: Downloaded to shared memory in 46.686s
intervitens-mini-magnum-5180-v6-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp1qogd2ua, device:0
intervitens-mini-magnum-5180-v6-mkmlizer: Saving flywheel model at /dev/shm/model_cache
arliai-mistral-nemo-12b-9104-v4-mkmlizer: Downloaded to shared memory in 42.130s
arliai-mistral-nemo-12b-9104-v4-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp8eee_gga, device:0
arliai-mistral-nemo-12b-9104-v4-mkmlizer: Saving flywheel model at /dev/shm/model_cache
intervitens-mini-magnum-5180-v6-mkmlizer: quantized model in 35.800s
intervitens-mini-magnum-5180-v6-mkmlizer: Processed model intervitens/mini-magnum-12b-v1.1 in 82.486s
intervitens-mini-magnum-5180-v6-mkmlizer: creating bucket guanaco-mkml-models
intervitens-mini-magnum-5180-v6-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
intervitens-mini-magnum-5180-v6-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/intervitens-mini-magnum-5180-v6
intervitens-mini-magnum-5180-v6-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/intervitens-mini-magnum-5180-v6/tokenizer.json
intervitens-mini-magnum-5180-v6-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/intervitens-mini-magnum-5180-v6/flywheel_model.0.safetensors
Job intervitens-mini-magnum-5180-v6-mkmlizer completed after 114.12s with status: succeeded
arliai-mistral-nemo-12b-9104-v4-mkmlizer: quantized model in 36.040s
Stopping job with name intervitens-mini-magnum-5180-v6-mkmlizer
arliai-mistral-nemo-12b-9104-v4-mkmlizer: Processed model ArliAI/Mistral-Nemo-12B-ArliAI-RPMax-v1.1 in 78.170s
arliai-mistral-nemo-12b-9104-v4-mkmlizer: creating bucket guanaco-mkml-models
Pipeline stage MKMLizer completed in 141.66s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.13s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service intervitens-mini-magnum-5180-v6
Waiting for inference service intervitens-mini-magnum-5180-v6 to be ready
Job arliai-mistral-nemo-12b-9104-v4-mkmlizer completed after 168.13s with status: succeeded
Stopping job with name arliai-mistral-nemo-12b-9104-v4-mkmlizer
Pipeline stage MKMLizer completed in 168.60s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.15s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service arliai-mistral-nemo-12b-9104-v4
Waiting for inference service arliai-mistral-nemo-12b-9104-v4 to be ready
Inference service chaiml-nemo-community-2-v1 ready after 130.30659532546997s
Pipeline stage MKMLDeployer completed in 130.84s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.961472988128662s
Received healthy response to inference request in 1.514986276626587s
Received healthy response to inference request in 1.8367865085601807s
Received healthy response to inference request in 1.6672732830047607s
Received healthy response to inference request in 1.6418750286102295s
5 requests
0 failed requests
5th percentile: 1.5403640270233154
10th percentile: 1.565741777420044
20th percentile: 1.616497278213501
30th percentile: 1.6469546794891357
40th percentile: 1.6571139812469482
50th percentile: 1.6672732830047607
60th percentile: 1.7350785732269287
70th percentile: 1.8028838634490967
80th percentile: 1.8617238044738769
90th percentile: 1.9115983963012695
95th percentile: 1.936535692214966
99th percentile: 1.9564855289459229
mean time: 1.724478816986084
Pipeline stage StressChecker completed in 10.75s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 3.63s
Shutdown handler de-registered
chaiml-nemo-community-2_v1 status is now deployed due to DeploymentManager action
chaiml-nemo-community-2_v1 status is now inactive due to auto deactivation removed underperforming models
Cleaning model data from S3
Deleting key chaiml-nemo-comm-2alinea-5104-v1/flywheel_model.0.safetensors from bucket guanaco-mkml-models
Deleting key chaiml-nemo-comm-2bbio-m-2877-v1/config.json from bucket guanaco-mkml-models
chaiml-nemo-community-2_v1 status is now torndown due to DeploymentManager action