Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name cgato-nemo-12b-humanize-7413-v3-mkmlizer
Waiting for job on cgato-nemo-12b-humanize-7413-v3-mkmlizer to finish
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ _____ __ __ ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ / _/ /_ ___ __/ / ___ ___ / / ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ / _/ / // / |/|/ / _ \/ -_) -_) / ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ /_//_/\_, /|__,__/_//_/\__/\__/_/ ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ /___/ ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ Version: 0.11.12 ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ Copyright 2023 MK ONE TECHNOLOGIES Inc. ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ https://mk1.ai ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ The license key for the current software has been verified as ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ belonging to: ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ Chai Research Corp. ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ Expiration: 2025-01-15 23:59:59 ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ║ ║
cgato-nemo-12b-humanize-7413-v3-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
cgato-nemo-12b-humanize-7413-v3-mkmlizer: Downloaded to shared memory in 47.543s
cgato-nemo-12b-humanize-7413-v3-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmp1k16865c, device:0
cgato-nemo-12b-humanize-7413-v3-mkmlizer: Saving flywheel model at /dev/shm/model_cache
cgato-nemo-12b-humanize-7413-v3-mkmlizer: quantized model in 37.109s
cgato-nemo-12b-humanize-7413-v3-mkmlizer: Processed model cgato/Nemo-12b-Humanize-KTO-Experimental-Latest in 84.652s
cgato-nemo-12b-humanize-7413-v3-mkmlizer: creating bucket guanaco-mkml-models
cgato-nemo-12b-humanize-7413-v3-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
cgato-nemo-12b-humanize-7413-v3-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/cgato-nemo-12b-humanize-7413-v3
cgato-nemo-12b-humanize-7413-v3-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/cgato-nemo-12b-humanize-7413-v3/special_tokens_map.json
cgato-nemo-12b-humanize-7413-v3-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/cgato-nemo-12b-humanize-7413-v3/config.json
cgato-nemo-12b-humanize-7413-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/cgato-nemo-12b-humanize-7413-v3/tokenizer_config.json
cgato-nemo-12b-humanize-7413-v3-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/cgato-nemo-12b-humanize-7413-v3/tokenizer.json
cgato-nemo-12b-humanize-7413-v3-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/cgato-nemo-12b-humanize-7413-v3/flywheel_model.0.safetensors
Job cgato-nemo-12b-humanize-7413-v3-mkmlizer completed after 114.49s with status: succeeded
Stopping job with name cgato-nemo-12b-humanize-7413-v3-mkmlizer
Pipeline stage MKMLizer completed in 115.03s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service cgato-nemo-12b-humanize-7413-v3
Waiting for inference service cgato-nemo-12b-humanize-7413-v3 to be ready
Inference service cgato-nemo-12b-humanize-7413-v3 ready after 180.65508818626404s
Pipeline stage MKMLDeployer completed in 181.21s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0332319736480713s
Received healthy response to inference request in 1.9812681674957275s
Received healthy response to inference request in 1.4569361209869385s
Received healthy response to inference request in 1.580282211303711s
Received healthy response to inference request in 1.465677261352539s
5 requests
0 failed requests
5th percentile: 1.4586843490600585
10th percentile: 1.4604325771331788
20th percentile: 1.463929033279419
30th percentile: 1.4885982513427733
40th percentile: 1.5344402313232421
50th percentile: 1.580282211303711
60th percentile: 1.7406765937805175
70th percentile: 1.9010709762573241
80th percentile: 1.9916609287261964
90th percentile: 2.0124464511871336
95th percentile: 2.0228392124176025
99th percentile: 2.0311534214019775
mean time: 1.7034791469573975
Pipeline stage StressChecker completed in 9.82s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.42s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 2.28s
Shutdown handler de-registered
cgato-nemo-12b-humanize-_7413_v3 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2811.56s
Shutdown handler de-registered
cgato-nemo-12b-humanize-_7413_v3 status is now inactive due to auto deactivation removed underperforming models