developer_uid: richhx
submission_id: chaiml-nis-qwen32b-sim_98336_v40
model_name: chaiml-nis-qwen32b-sim_98336_v40
model_group: ChaiML/nis-qwen32b-simpo
status: torndown
timestamp: 2026-01-14T16:59:36+00:00
num_battles: 15140
num_wins: 7272
celo_rating: 1284.31
family_friendly_score: 0.49219999999999997
family_friendly_standard_error: 0.007070207351980562
submission_type: basic
model_repo: ChaiML/nis-qwen32b-simpoexp1-s1-pref-1295datav1sft-try2
model_architecture: Qwen2ForCausalLM
model_num_parameters: 32759331840.0
best_of: 5
max_input_tokens: 768
max_output_tokens: 64
reward_model: default
latencies: [{'batch_size': 1, 'throughput': 0.42470700033163883, 'latency_mean': 2.354486783742905, 'latency_p50': 2.358576774597168, 'latency_p90': 2.6401683568954466}, {'batch_size': 3, 'throughput': 0.9267260978509363, 'latency_mean': 3.2359594070911406, 'latency_p50': 3.226755142211914, 'latency_p90': 3.532512640953064}, {'batch_size': 5, 'throughput': 1.230139367093474, 'latency_mean': 4.047938539981842, 'latency_p50': 4.014459013938904, 'latency_p90': 4.497701239585877}, {'batch_size': 6, 'throughput': 1.347862611941935, 'latency_mean': 4.425119992494583, 'latency_p50': 4.371065020561218, 'latency_p90': 5.013818526268005}, {'batch_size': 8, 'throughput': 1.5171807214265403, 'latency_mean': 5.252075741291046, 'latency_p50': 5.252602219581604, 'latency_p90': 5.923411965370178}, {'batch_size': 10, 'throughput': 1.625931704680697, 'latency_mean': 6.084514290094376, 'latency_p50': 6.110061764717102, 'latency_p90': 6.7834173202514645}]
gpu_counts: {'NVIDIA A100-SXM4-80GB': 1}
display_name: chaiml-nis-qwen32b-sim_98336_v40
is_internal_developer: True
language_model: ChaiML/nis-qwen32b-simpoexp1-s1-pref-1295datav1sft-try2
model_size: 33B
ranking_group: single
throughput_3p7s: 1.12
us_pacific_date: 2026-01-06
win_ratio: 0.4803170409511229
generation_params: {'temperature': 0.45, 'top_p': 0.95, 'min_p': 0.025, 'top_k': 40, 'presence_penalty': 0.35, 'frequency_penalty': 0.35, 'stopping_words': ['<|im_end|>', '<|im_start|>', '\n'], 'max_input_tokens': 768, 'best_of': 5, 'max_output_tokens': 64}
formatter: {'memory_template': '<|system|>Family Friendly{memory}\n', 'prompt_template': '', 'bot_template': '<|im_start|>assistant\n{message}<|im_end|>\n', 'user_template': '<|im_start|>user\nYou:{message}<|im_end|>\n', 'response_template': '<|im_start|>assistant\n', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
run pipeline stage %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Running pipeline stage MKMLizer
Starting job with name chaiml-nis-qwen32b-sim-98336-v40-mkmlizer
Waiting for job on chaiml-nis-qwen32b-sim-98336-v40-mkmlizer to finish
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: bash: cannot set terminal process group (-1): Inappropriate ioctl for device
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: bash: no job control in this shell
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: /root/miniconda3/envs/nvidia/lib/python3.11/site-packages/mk1/__init__.py:1: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: __import__('pkg_resources').declare_namespace(__name__)
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ Version: 0.30.6+torch280 ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ https://mk1.ai ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ The license key for the current software has been verified as ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ belonging to: ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ Chai Research Corp. ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ║ ║
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: Downloaded to shared memory in 67.689s
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: Checking if ChaiML/nis-qwen32b-simpoexp1-s1-pref-1295datav1sft-try2 already exists in ChaiML
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: quantizing model to /dev/shm/model_cache, profile:q4, folder:/tmp/tmpkvfeipue, device:0
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: Saving flywheel model at /dev/shm/model_cache
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: /s] Loading 0: 61%|██████ | 467/771 [04:02<01:50, 2.74it/s] Loading 0: 61%|██████ | 467/771 [04:02<01:50, 2.74it/s] Loading 0: 61%|██████ | 468/771 [04:04<02:17, 2.21it/s] Loading 0: 61%|██████ | 468/771 [04:04<02:17, 2.21it/s] Loading 0: 61%|██████ | 471/771 [04:05<02:21, 2.13it/s] Loading 0: 61%|██████ | 471/771 [04:05<02:21, 2.13it/s] Loading 0: 61%|██████ | 472/771 [04:07<02:52, 1.73it/s] Loading 0: 61%|██████ | 472/771 [04:07<02:52, 1.73it/s] Loading 0: 61%|██████▏ | 473/771 [04:09<03:32, 1.40it/s] Loading 0: 61%|██████▏ | 473/771 [04:09<03:32, 1.40it/s] Loading 0: 63%|██████▎ | 483/771 [04:11<02:00, 2.40it/s] Loading 0: 63%|██████▎ | 483/771 [04:11<02:00, 2.40it/s] Loading 0: 63%|██████▎ | 484/771 [04:13<02:27, 1.94it/s] Loading 0: 63%|██████▎ | 484/771 [04:13<02:27, 1.94it/s] Loading 0: 63%|██████▎ | 485/771 [04:14<03:02, 1.56it/s] Loading 0: 63%|██████▎ | 485/771 [04:14<03:02, 1.56it/s] Loading 0: 64%|██████▍ | 495/771 [04:17<01:51, 2.48it/s] Loading 0: 64%|██████▍ | 495/771 [04:17<01:51, 2.48it/s] Loading 0: 64%|██████▍ | 496/771 [04:18<02:16, 2.01it/s] Loading 0: 64%|██████▍ | 496/771 [04:18<02:16, 2.01it/s] Loading 0: 64%|██████▍ | 497/771 [04:20<02:49, 1.62it/s] Loading 0: 64%|██████▍ | 497/771 [04:20<02:49, 1.62it/s] Loading 0: 66%|██████▌ | 507/771 [04:23<01:45, 2.51it/s] Loading 0: 66%|██████▌ | 507/771 [04:23<01:45, 2.51it/s] Loading 0: 66%|██████▌ | 508/771 [04:24<02:09, 2.03it/s] Loading 0: 66%|██████▌ | 508/771 [04:24<02:09, 2.03it/s] Loading 0: 66%|██████▌ | 509/771 [04:26<02:40, 1.63it/s] Loading 0: 66%|██████▌ | 509/771 [04:26<02:40, 1.63it/s] Loading 0: 67%|██████▋ | 518/771 [04:28<01:46, 2.37it/s] Loading 0: 67%|██████▋ | 518/771 [04:28<01:46, 2.37it/s] Loading 0: 68%|██████▊ | 526/771 [04:29<01:13, 3.35it/s] Loading 0: 68%|██████▊ | 526/771 [04:29<01:13, 3.35it/s] Loading 0: 68%|██████▊ | 527/771 [04:31<01:35, 2.57it/s] Loading 0: 68%|██████▊ | 527/771 [04:31<01:35, 2.57it/s] Loading 0: 68%|██████▊ | 528/771 [04:33<02:03, 1.97it/s] Loading 0: 68%|██████▊ | 528/771 [04:33<02:03, 1.97it/s] Loading 0: 69%|██████▉ | 531/771 [04:34<02:03, 1.94it/s] Loading 0: 69%|██████▉ | 531/771 [04:34<02:03, 1.94it/s] Loading 0: 69%|██████▉ | 532/771 [04:36<02:33, 1.56it/s] Loading 0: 69%|██████▉ | 532/771 [04:36<02:33, 1.56it/s] Loading 0: 69%|██████▉ | 533/771 [04:38<03:09, 1.26it/s] Loading 0: 69%|██████▉ | 533/771 [04:38<03:09, 1.26it/s] Loading 0: 70%|███████ | 543/771 [04:40<01:37, 2.35it/s] Loading 0: 70%|███████ | 543/771 [04:40<01:37, 2.35it/s] Loading 0: 71%|███████ | 544/771 [04:42<02:00, 1.89it/s] Loading 0: 71%|███████ | 544/771 [04:42<02:00, 1.89it/s] Loading 0: 71%|███████ | 545/771 [04:43<02:28, 1.52it/s] Loading 0: 71%|███████ | 545/771 [04:43<02:28, 1.52it/s] Loading 0: 72%|███████▏ | 555/771 [04:46<01:27, 2.46it/s] Loading 0: 72%|███████▏ | 555/771 [04:46<01:27, 2.46it/s] Loading 0: 72%|███████▏ | 556/771 [04:47<01:47, 1.99it/s] Loading 0: 72%|███████▏ | 556/771 [04:47<01:47, 1.99it/s] Loading 0: 72%|███████▏ | 557/771 [04:49<02:14, 1.60it/s] Loading 0: 72%|███████▏ | 557/771 [04:49<02:14, 1.60it/s] Loading 0: 74%|███████▎ | 567/771 [04:52<01:22, 2.49it/s] Loading 0: 74%|███████▎ | 567/771 [04:52<01:22, 2.49it/s] Loading 0: 74%|███████▎ | 568/771 [04:53<01:40, 2.02it/s] Loading 0: 74%|███████▎ | 568/771 [04:53<01:40, 2.02it/s] Loading 0: 74%|███████▍ | 569/771 [04:55<02:04, 1.62it/s] Loading 0: 74%|███████▍ | 569/771 [04:55<02:04, 1.62it/s] Loading 0: 75%|███████▍ | 578/771 [04:57<01:21, 2.36it/s] Loading 0: 75%|███████▍ | 578/771 [04:57<01:21, 2.36it/s] Loading 0: 76%|███████▌ | 586/771 [04:58<00:55, 3.31it/s] Loading 0: 76%|███████▌ | 586/771 [04:58<00:55, 3.31it/s] Loading 0: 76%|███████▌ | 587/771 [05:00<01:12, 2.55it/s] Loading 0: 76%|███████▌ | 587/771 [05:00<01:12, 2.55it/s] Loading 0: 76%|███████▋ | 588/771 [05:02<01:33, 1.96it/s] Loading 0: 76%|███████▋ | 588/771 [05:02<01:33, 1.96it/s] Loading 0: 77%|███████▋ | 591/771 [05:03<01:32, 1.94it/s] Loading 0: 77%|███████▋ | 591/771 [05:03<01:32, 1.94it/s] Loading 0: 77%|███████▋ | 592/771 [05:05<01:54, 1.56it/s] Loading 0: 77%|███████▋ | 592/771 [05:05<01:54, 1.56it/s] Loading 0: 77%|███████▋ | 593/771 [05:07<02:21, 1.26it/s] Loading 0: 77%|███████▋ | 593/771 [05:07<02:21, 1.26it/s] Loading 0: 78%|███████▊ | 603/771 [05:09<01:12, 2.33it/s] Loading 0: 78%|███████▊ | 603/771 [05:09<01:12, 2.33it/s] Loading 0: 78%|███████▊ | 604/771 [05:11<01:28, 1.88it/s] Loading 0: 78%|███████▊ | 604/771 [05:11<01:28, 1.88it/s] Loading 0: 78%|███████▊ | 605/771 [05:12<01:49, 1.51it/s] Loading 0: 78%|███████▊ | 605/771 [05:12<01:49, 1.51it/s] Loading 0: 80%|███████▉ | 615/771 [05:15<01:03, 2.45it/s] Loading 0: 80%|███████▉ | 615/771 [05:15<01:03, 2.45it/s] Loading 0: 80%|███████▉ | 616/771 [05:17<01:18, 1.99it/s] Loading 0: 80%|███████▉ | 616/771 [05:17<01:18, 1.99it/s] Loading 0: 80%|████████ | 617/771 [05:18<01:36, 1.60it/s] Loading 0: 80%|████████ | 617/771 [05:18<01:36, 1.60it/s] Loading 0: 81%|████████▏ | 627/771 [05:21<00:57, 2.49it/s] Loading 0: 81%|████████▏ | 627/771 [05:21<00:57, 2.49it/s] Loading 0: 81%|████████▏ | 628/771 [05:22<01:10, 2.02it/s] Loading 0: 81%|████████▏ | 628/771 [05:22<01:10, 2.02it/s] Loading 0: 82%|████████▏ | 629/771 [05:24<01:27, 1.62it/s] Loading 0: 82%|████████▏ | 629/771 [05:24<01:27, 1.62it/s] Loading 0: 83%|████████▎ | 638/771 [05:27<00:56, 2.36it/s] Loading 0: 83%|████████▎ | 638/771 [05:27<00:56, 2.36it/s] Loading 0: 84%|████████▍ | 646/771 [05:28<00:37, 3.31it/s] Loading 0: 84%|████████▍ | 646/771 [05:28<00:37, 3.31it/s] Loading 0: 84%|████████▍ | 647/771 [05:29<00:48, 2.55it/s] Loading 0: 84%|████████▍ | 647/771 [05:29<00:48, 2.55it/s] Loading 0: 84%|████████▍ | 648/771 [05:31<01:02, 1.96it/s] Loading 0: 84%|████████▍ | 648/771 [05:31<01:02, 1.96it/s] Loading 0: 84%|████████▍ | 651/771 [05:32<01:01, 1.94it/s] Loading 0: 84%|████████▍ | 651/771 [05:32<01:01, 1.94it/s] Loading 0: 85%|████████▍ | 652/771 [05:34<01:16, 1.56it/s] Loading 0: 85%|████████▍ | 652/771 [05:34<01:16, 1.56it/s] Loading 0: 85%|████████▍ | 653/771 [05:36<01:33, 1.26it/s] Loading 0: 85%|████████▍ | 653/771 [05:36<01:33, 1.26it/s] Loading 0: 86%|████████▌ | 663/771 [05:38<00:46, 2.35it/s] Loading 0: 86%|████████▌ | 663/771 [05:38<00:46, 2.35it/s] Loading 0: 86%|████████▌ | 664/771 [05:40<00:56, 1.89it/s] Loading 0: 86%|████████▌ | 664/771 [05:40<00:56, 1.89it/s] Loading 0: 86%|████████▋ | 665/771 [05:41<01:09, 1.52it/s] Loading 0: 86%|████████▋ | 665/771 [05:41<01:09, 1.52it/s] Loading 0: 88%|████████▊ | 675/771 [05:44<00:39, 2.46it/s] Loading 0: 88%|████████▊ | 675/771 [05:44<00:39, 2.46it/s] Loading 0: 88%|████████▊ | 676/771 [05:46<00:47, 1.99it/s] Loading 0: 88%|████████▊ | 676/771 [05:46<00:47, 1.99it/s] Loading 0: 88%|████████▊ | 677/771 [05:47<00:58, 1.60it/s] Loading 0: 88%|████████▊ | 677/771 [05:47<00:58, 1.60it/s] Loading 0: 89%|████████▉ | 687/771 [05:50<00:33, 2.50it/s] Loading 0: 89%|████████▉ | 687/771 [05:50<00:33, 2.50it/s] Loading 0: 89%|████████▉ | 688/771 [05:51<00:40, 2.03it/s] Loading 0: 89%|████████▉ | 688/771 [05:51<00:40, 2.03it/s] Loading 0: 89%|████████▉ | 689/771 [05:53<00:50, 1.63it/s] Loading 0: 89%|████████▉ | 689/771 [05:53<00:50, 1.63it/s] Loading 0: 91%|█████████ | 698/771 [05:55<00:30, 2.37it/s] Loading 0: 91%|█████████ | 698/771 [05:55<00:30, 2.37it/s] Loading 0: 92%|█████████▏| 706/771 [05:57<00:19, 3.32it/s] Loading 0: 92%|█████████▏| 706/771 [05:57<00:19, 3.32it/s] Loading 0: 92%|█████████▏| 707/771 [05:58<00:24, 2.56it/s] Loading 0: 92%|█████████▏| 707/771 [05:58<00:24, 2.56it/s] Loading 0: 92%|█████████▏| 708/771 [06:00<00:32, 1.97it/s] Loading 0: 92%|█████████▏| 708/771 [06:00<00:32, 1.97it/s] Loading 0: 92%|█████████▏| 711/771 [06:01<00:30, 1.95it/s] Loading 0: 92%|█████████▏| 711/771 [06:01<00:30, 1.95it/s] Loading 0: 92%|█████████▏| 712/771 [06:03<00:37, 1.56it/s] Loading 0: 92%|█████████▏| 712/771 [06:03<00:37, 1.56it/s] Loading 0: 92%|█████████▏| 713/771 [06:05<00:45, 1.26it/s] Loading 0: 92%|█████████▏| 713/771 [06:05<00:45, 1.26it/s] Loading 0: 94%|█████████▍| 723/771 [06:07<00:20, 2.34it/s] Loading 0: 94%|█████████▍| 723/771 [06:07<00:20, 2.34it/s] Loading 0: 94%|█████████▍| 724/771 [06:09<00:24, 1.89it/s] Loading 0: 94%|█████████▍| 724/771 [06:09<00:24, 1.89it/s] Loading 0: 94%|█████████▍| 725/771 [06:10<00:30, 1.52it/s] Loading 0: 94%|█████████▍| 725/771 [06:10<00:30, 1.52it/s] Loading 0: 95%|█████████▌| 735/771 [06:13<00:14, 2.46it/s] Loading 0: 95%|█████████▌| 735/771 [06:13<00:14, 2.46it/s] Loading 0: 95%|█████████▌| 736/771 [06:15<00:17, 1.99it/s] Loading 0: 95%|█████████▌| 736/771 [06:15<00:17, 1.99it/s] Loading 0: 96%|█████████▌| 737/771 [06:16<00:21, 1.60it/s] Loading 0: 96%|█████████▌| 737/771 [06:16<00:21, 1.60it/s] Loading 0: 97%|█████████▋| 747/771 [06:19<00:09, 2.50it/s] Loading 0: 97%|█████████▋| 747/771 [06:19<00:09, 2.50it/s] Loading 0: 97%|█████████▋| 748/771 [06:20<00:11, 2.02it/s] Loading 0: 97%|█████████▋| 748/771 [06:20<00:11, 2.02it/s] Loading 0: 97%|█████████▋| 749/771 [06:22<00:13, 1.63it/s] Loading 0: 97%|█████████▋| 749/771 [06:22<00:13, 1.63it/s] Loading 0: 98%|█████████▊| 758/771 [06:24<00:05, 2.36it/s] Loading 0: 98%|█████████▊| 758/771 [06:24<00:05, 2.36it/s] Loading 0: 99%|█████████▉| 766/771 [06:26<00:01, 3.22it/s] Loading 0: 99%|█████████▉| 766/771 [06:26<00:01, 3.22it/s] Loading 0: 100%|█████████▉| 768/771 [06:27<00:01, 2.66it/s] Loading 0: 100%|█████████▉| 768/771 [06:27<00:01, 2.66it/s] Loading 0: 100%|█████████▉| 769/771 [06:29<00:00, 2.05it/s] Loading 0: 100%|█████████▉| 769/771 [06:29<00:00, 2.05it/s] Loading 0: 100%|██████████| 771/771 [06:29<00:00, 2.05it/s] Loading 0: 100%|██████████| 771/771 [06:29<00:00, 1.98it/s]
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: quantized model in 402.523s
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: Processed model ChaiML/nis-qwen32b-simpoexp1-s1-pref-1295datav1sft-try2 in 470.212s
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: creating bucket guanaco-mkml-models
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/chaiml-nis-qwen32b-sim-98336-v40/nvidia
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: cp /dev/shm/model_cache/chat_template.jinja s3://guanaco-mkml-models/chaiml-nis-qwen32b-sim-98336-v40/nvidia/chat_template.jinja
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: cp /dev/shm/model_cache/added_tokens.json s3://guanaco-mkml-models/chaiml-nis-qwen32b-sim-98336-v40/nvidia/added_tokens.json
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/chaiml-nis-qwen32b-sim-98336-v40/nvidia/special_tokens_map.json
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/chaiml-nis-qwen32b-sim-98336-v40/nvidia/tokenizer_config.json
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/chaiml-nis-qwen32b-sim-98336-v40/nvidia/config.json
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: cp /dev/shm/model_cache/merges.txt s3://guanaco-mkml-models/chaiml-nis-qwen32b-sim-98336-v40/nvidia/merges.txt
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: cp /dev/shm/model_cache/vocab.json s3://guanaco-mkml-models/chaiml-nis-qwen32b-sim-98336-v40/nvidia/vocab.json
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/chaiml-nis-qwen32b-sim-98336-v40/nvidia/tokenizer.json
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/chaiml-nis-qwen32b-sim-98336-v40/nvidia/flywheel_model.1.safetensors
chaiml-nis-qwen32b-sim-98336-v40-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/chaiml-nis-qwen32b-sim-98336-v40/nvidia/flywheel_model.0.safetensors
Job chaiml-nis-qwen32b-sim-98336-v40-mkmlizer completed after 522.93s with status: succeeded
Stopping job with name chaiml-nis-qwen32b-sim-98336-v40-mkmlizer
Pipeline stage MKMLizer completed in 524.61s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service chaiml-nis-qwen32b-sim-98336-v40
Waiting for inference service chaiml-nis-qwen32b-sim-98336-v40 to be ready
Inference service chaiml-nis-qwen32b-sim-98336-v40 ready after 150.95303773880005s
Pipeline stage MKMLDeployer completed in 151.55s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.033907651901245s
Received healthy response to inference request in 2.5939853191375732s
Received healthy response to inference request in 2.321746349334717s
Received healthy response to inference request in 2.7395987510681152s
Received healthy response to inference request in 3.353825569152832s
5 requests
0 failed requests
5th percentile: 2.376194143295288
10th percentile: 2.430641937255859
20th percentile: 2.539537525177002
30th percentile: 2.6231080055236817
40th percentile: 2.6813533782958983
50th percentile: 2.7395987510681152
60th percentile: 2.8573223114013673
70th percentile: 2.975045871734619
80th percentile: 3.0978912353515624
90th percentile: 3.2258584022521974
95th percentile: 3.2898419857025147
99th percentile: 3.3410288524627685
mean time: 2.8086127281188964
Pipeline stage StressChecker completed in 15.46s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.71s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.73s
Shutdown handler de-registered
chaiml-nis-qwen32b-sim_98336_v40 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2715.65s
Shutdown handler de-registered
chaiml-nis-qwen32b-sim_98336_v40 status is now torndown due to DeploymentManager action