developer_uid: rirv938
submission_id: rirv938-slerp-grpo-cp31_93207_v1
model_name: rirv938-slerp-grpo-cp31_93207_v1
model_group: rirv938/slerp_grpo_cp312
status: torndown
timestamp: 2025-06-10T02:28:47+00:00
num_battles: 8335
num_wins: 4396
celo_rating: 1298.04
family_friendly_score: 0.5646
family_friendly_standard_error: 0.0070118020508283035
submission_type: basic
model_repo: rirv938/slerp_grpo_cp312_96ff_b3_r1_merged
model_architecture: MistralForCausalLM
model_num_parameters: 24096691200.0
best_of: 8
max_input_tokens: 1024
max_output_tokens: 64
reward_model: default
display_name: rirv938-slerp-grpo-cp31_93207_v1
is_internal_developer: True
language_model: rirv938/slerp_grpo_cp312_96ff_b3_r1_merged
model_size: 24B
ranking_group: single
us_pacific_date: 2025-06-09
win_ratio: 0.5274145170965807
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', 'Bot:', 'You:', '####', '<|im_end|>', 'User:', '\n', '<|eot_id|>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '<|system|>Family Friendly\n', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLizer
Starting job with name rirv938-slerp-grpo-cp31-93207-v1-mkmlizer
Waiting for job on rirv938-slerp-grpo-cp31-93207-v1-mkmlizer to finish
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ╔═════════════════════════════════════════════════════════════════════╗
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ ██████ ██████ █████ ████ ████ ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ ░███░█████░███ ░███ ███ ░███ ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ ░███░░███ ░███ ░███████ ░███ ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ ░███ ░███ ░███ ░░███ ░███ ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ █████ █████ █████ ░░████ █████ ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ Version: 0.27.1+vampire_v3 ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ Features: FLYWHEEL, CUDA ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ https://mk1.ai ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ The license key for the current software has been verified as ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ belonging to: ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ Chai Research Corp. ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ Expiration: 2028-03-31 23:59:59 ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ║ ║
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: ╚═════════════════════════════════════════════════════════════════════╝
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: Downloaded to shared memory in 185.260s
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: quantizing model to /dev/shm/model_cache, profile:s0, folder:/tmp/tmpn12df16w, device:0
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: Saving flywheel model at /dev/shm/model_cache
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: quantized model in 56.576s
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: Processed model rirv938/slerp_grpo_cp312_96ff_b3_r1_merged in 241.842s
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: creating bucket guanaco-mkml-models
Failed to get response for submission chaiml-cyndonia24b-cpos_11400_v1: ('http://chaiml-cyndonia24b-cpos-11400-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: Bucket 's3://guanaco-mkml-models/' created
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: uploading /dev/shm/model_cache to s3://guanaco-mkml-models/rirv938-slerp-grpo-cp31-93207-v1
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: cp /dev/shm/model_cache/config.json s3://guanaco-mkml-models/rirv938-slerp-grpo-cp31-93207-v1/config.json
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: DEBUG retryable error: RequestError: send request failed
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: caused by: Put "https://object.ord1.coreweave.com/guanaco-mkml-models/rirv938-slerp-grpo-cp31-93207-v1/special_tokens_map.json": read tcp 10.144.224.80:45520->216.153.53.63:443: read: connection reset by peer
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-mkml-models/rirv938-slerp-grpo-cp31-93207-v1/special_tokens_map.json
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-mkml-models/rirv938-slerp-grpo-cp31-93207-v1/tokenizer_config.json
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: cp /dev/shm/model_cache/tokenizer.json s3://guanaco-mkml-models/rirv938-slerp-grpo-cp31-93207-v1/tokenizer.json
Failed to get response for submission chaiml-cyndonia24b-cpos_11400_v1: ('http://chaiml-cyndonia24b-cpos-11400-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.1.safetensors s3://guanaco-mkml-models/rirv938-slerp-grpo-cp31-93207-v1/flywheel_model.1.safetensors
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: cp /dev/shm/model_cache/flywheel_model.0.safetensors s3://guanaco-mkml-models/rirv938-slerp-grpo-cp31-93207-v1/flywheel_model.0.safetensors
rirv938-slerp-grpo-cp31-93207-v1-mkmlizer: Loading 0: 0%| | 0/363 [00:00<?, ?it/s] Loading 0: 1%| | 3/363 [00:00<00:12, 28.49it/s] Loading 0: 2%|▏ | 6/363 [00:00<00:25, 13.91it/s] Loading 0: 3%|▎ | 11/363 [00:00<00:16, 21.86it/s] Loading 0: 4%|▍ | 14/363 [00:00<00:27, 12.89it/s] Loading 0: 4%|▍ | 16/363 [00:01<00:27, 12.64it/s] Loading 0: 6%|▌ | 21/363 [00:01<00:21, 16.27it/s] Loading 0: 6%|▋ | 23/363 [00:01<00:26, 13.02it/s] Loading 0: 8%|▊ | 28/363 [00:01<00:17, 18.83it/s] Loading 0: 9%|▉ | 32/363 [00:01<00:14, 22.44it/s] Loading 0: 10%|▉ | 35/363 [00:02<00:21, 15.36it/s] Loading 0: 10%|█ | 38/363 [00:02<00:20, 16.00it/s] Loading 0: 11%|█▏ | 41/363 [00:02<00:25, 12.56it/s] Loading 0: 13%|█▎ | 46/363 [00:02<00:17, 17.79it/s] Loading 0: 14%|█▍ | 50/363 [00:02<00:14, 21.41it/s] Loading 0: 15%|█▍ | 54/363 [00:03<00:22, 13.66it/s] Loading 0: 16%|█▌ | 57/363 [00:03<00:19, 15.60it/s] Loading 0: 17%|█▋ | 60/363 [00:03<00:21, 14.17it/s] Loading 0: 18%|█▊ | 64/363 [00:03<00:16, 17.61it/s] Loading 0: 19%|█▊ | 68/363 [00:04<00:14, 20.88it/s] Loading 0: 20%|█▉ | 71/363 [00:04<00:19, 14.93it/s] Loading 0: 20%|██ | 74/363 [00:04<00:18, 15.70it/s] Loading 0: 21%|██ | 77/363 [00:04<00:22, 12.46it/s] Loading 0: 23%|██▎ | 82/363 [00:05<00:16, 17.37it/s] Loading 0: 24%|██▎ | 86/363 [00:05<00:13, 20.31it/s] Loading 0: 25%|██▍ | 89/363 [00:05<00:18, 14.80it/s] Loading 0: 25%|██▌ | 92/363 [00:05<00:18, 14.84it/s] Loading 0: 26%|██▌ | 95/363 [00:05<00:15, 17.11it/s] Loading 0: 27%|██▋ | 99/363 [00:05<00:12, 20.84it/s] Loading 0: 28%|██▊ | 102/363 [00:06<00:13, 19.16it/s] Loading 0: 29%|██▉ | 105/363 [00:06<00:17, 14.54it/s] Loading 0: 29%|██▉ | 107/363 [00:06<00:19, 13.22it/s] Loading 0: 30%|███ | 109/363 [00:06<00:19, 13.02it/s] Loading 0: 31%|███ | 111/363 [00:06<00:18, 13.94it/s] Loading 0: 31%|███ | 113/363 [00:07<00:21, 11.50it/s] Loading 0: 33%|███▎ | 118/363 [00:07<00:13, 18.00it/s] Loading 0: 34%|███▎ | 122/363 [00:07<00:11, 21.56it/s] Loading 0: 34%|███▍ | 125/363 [00:07<00:15, 15.12it/s] Loading 0: 35%|███▌ | 128/363 [00:08<00:15, 15.65it/s] Loading 0: 36%|███▌ | 130/363 [00:08<00:16, 13.73it/s] Loading 0: 36%|███▋ | 132/363 [00:08<00:17, 13.35it/s] Loading 0: 37%|███▋ | 136/363 [00:08<00:12, 17.98it/s] Loading 0: 39%|███▊ | 140/363 [00:08<00:10, 21.39it/s] Loading 0: 39%|███▉ | 143/363 [00:08<00:14, 14.73it/s] Loading 0: 40%|███▉ | 145/363 [00:09<00:15, 14.10it/s] Loading 0: 40%|████ | 147/363 [00:09<00:14, 14.96it/s] Loading 0: 41%|████ | 149/363 [00:09<00:17, 12.16it/s] Loading 0: 42%|████▏ | 154/363 [00:09<00:11, 18.63it/s] Loading 0: 44%|████▎ | 158/363 [00:09<00:09, 22.35it/s] Loading 0: 44%|████▍ | 161/363 [00:10<00:12, 15.64it/s] Loading 0: 45%|████▌ | 164/363 [00:10<00:12, 16.43it/s] Loading 0: 46%|████▌ | 167/363 [00:10<00:15, 12.99it/s] Loading 0: 47%|████▋ | 172/363 [00:10<00:10, 18.25it/s] Loading 0: 48%|████▊ | 176/363 [00:10<00:08, 21.65it/s] Loading 0: 49%|████▉ | 179/363 [00:11<00:11, 15.91it/s] Loading 0: 50%|█████ | 182/363 [00:11<00:11, 16.44it/s] Loading 0: 51%|█████ | 185/363 [00:11<00:13, 13.03it/s] Loading 0: 52%|█████▏ | 190/363 [00:11<00:09, 18.25it/s] Loading 0: 53%|█████▎ | 194/363 [00:11<00:07, 21.90it/s] Loading 0: 55%|█████▍ | 198/363 [00:12<00:11, 14.37it/s] Loading 0: 55%|█████▌ | 201/363 [00:26<03:22, 1.25s/it] Loading 0: 56%|█████▌ | 203/363 [00:27<02:46, 1.04s/it] Loading 0: 57%|█████▋ | 207/363 [00:27<01:46, 1.47it/s] Loading 0: 58%|█████▊ | 210/363 [00:27<01:17, 1.97it/s] Loading 0: 59%|█████▊ | 213/363 [00:27<00:58, 2.58it/s] Loading 0: 60%|█████▉ | 216/363 [00:28<00:45, 3.22it/s] Loading 0: 60%|██████ | 219/363 [00:28<00:33, 4.33it/s] Loading 0: 61%|██████ | 222/363 [00:28<00:26, 5.30it/s] Loading 0: 63%|██████▎ | 227/363 [00:28<00:16, 8.33it/s] Loading 0: 64%|██████▎ | 231/363 [00:28<00:13, 9.87it/s] Loading 0: 64%|██████▍ | 234/363 [00:29<00:13, 9.23it/s] Loading 0: 65%|██████▌ | 237/363 [00:29<00:11, 11.16it/s] Loading 0: 66%|██████▌ | 240/363 [00:29<00:11, 11.16it/s] Loading 0: 67%|██████▋ | 244/363 [00:29<00:08, 14.63it/s] Loading 0: 68%|██████▊ | 248/363 [00:29<00:06, 17.97it/s] Loading 0: 69%|██████▉ | 251/363 [00:30<00:07, 14.16it/s] Loading 0: 70%|██████▉ | 254/363 [00:30<00:07, 15.19it/s] Loading 0: 71%|███████ | 257/363 [00:30<00:08, 12.31it/s] Loading 0: 72%|███████▏ | 262/363 [00:30<00:05, 17.24it/s] Loading 0: 73%|███████▎ | 266/363 [00:30<00:04, 20.60it/s] Loading 0: 74%|███████▍ | 269/363 [00:31<00:06, 15.61it/s] Loading 0: 75%|███████▍ | 272/363 [00:31<00:05, 16.20it/s] Loading 0: 76%|███████▌ | 275/363 [00:31<00:06, 12.83it/s] Loading 0: 77%|███████▋ | 280/363 [00:31<00:04, 17.92it/s] Loading 0: 78%|███████▊ | 284/363 [00:31<00:03, 21.48it/s] Loading 0: 79%|███████▉ | 287/363 [00:32<00:04, 15.90it/s] Loading 0: 80%|███████▉ | 290/363 [00:32<00:04, 16.49it/s] Loading 0: 81%|████████ | 293/363 [00:32<00:05, 12.66it/s] Loading 0: 82%|████████▏ | 298/363 [00:32<00:03, 17.75it/s] Loading 0: 83%|████████▎ | 302/363 [00:32<00:02, 21.12it/s] Loading 0: 84%|████████▍ | 305/363 [00:33<00:03, 15.69it/s] Loading 0: 85%|████████▍ | 308/363 [00:33<00:03, 16.34it/s] Loading 0: 86%|████████▌ | 311/363 [00:33<00:04, 12.78it/s] Loading 0: 87%|████████▋ | 316/363 [00:33<00:02, 17.88it/s] Loading 0: 88%|████████▊ | 320/363 [00:34<00:01, 21.54it/s] Loading 0: 89%|████████▉ | 324/363 [00:34<00:02, 14.05it/s] Loading 0: 90%|█████████ | 327/363 [00:34<00:02, 15.97it/s] Loading 0: 91%|█████████ | 330/363 [00:34<00:02, 14.55it/s] Loading 0: 92%|█████████▏| 334/363 [00:35<00:01, 18.38it/s] Loading 0: 93%|█████████▎| 338/363 [00:35<00:01, 21.95it/s] Loading 0: 94%|█████████▍| 341/363 [00:35<00:01, 15.99it/s] Loading 0: 95%|█████████▍| 344/363 [00:35<00:01, 16.47it/s] Loading 0: 96%|█████████▌| 347/363 [00:36<00:01, 12.71it/s] Loading 0: 97%|█████████▋| 352/363 [00:36<00:00, 17.58it/s] Loading 0: 98%|█████████▊| 356/363 [00:36<00:00, 21.09it/s] Loading 0: 99%|█████████▉| 359/363 [00:36<00:00, 11.47it/s] Loading 0: 100%|█████████▉| 362/363 [00:37<00:00, 11.42it/s]
Job rirv938-slerp-grpo-cp31-93207-v1-mkmlizer completed after 282.02s with status: succeeded
Stopping job with name rirv938-slerp-grpo-cp31-93207-v1-mkmlizer
Pipeline stage MKMLizer completed in 282.57s
run pipeline stage %s
Running pipeline stage MKMLTemplater
Pipeline stage MKMLTemplater completed in 0.16s
run pipeline stage %s
Running pipeline stage MKMLDeployer
Creating inference service rirv938-slerp-grpo-cp31-93207-v1
Waiting for inference service rirv938-slerp-grpo-cp31-93207-v1 to be ready
Failed to get response for submission chaiml-cyndonia24b-cpos_11400_v1: ('http://chaiml-cyndonia24b-cpos-11400-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-cyndonia24b-cpos_11400_v1: ('http://chaiml-cyndonia24b-cpos-11400-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Failed to get response for submission chaiml-cyndonia24b-cpos_11400_v1: ('http://chaiml-cyndonia24b-cpos-11400-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', '')
Inference service rirv938-slerp-grpo-cp31-93207-v1 ready after 110.5057282447815s
Pipeline stage MKMLDeployer completed in 111.10s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7655303478240967s
Received healthy response to inference request in 1.198495626449585s
{"detail":"HTTPConnectionPool(host='rirv938-slerp-grpo-cp31-93207-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Max retries exceeded with url: /v1/models/GPT-J-6B-lit-v2:predict (Caused by ConnectTimeoutError(<urllib3.connection.HTTPConnection object at 0x7a145834cd50>, 'Connection to rirv938-slerp-grpo-cp31-93207-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com timed out. (connect timeout=12.0)'))"}
Received unhealthy response to inference request!
Received healthy response to inference request in 1.464810848236084s
Received healthy response to inference request in 2.232192277908325s
5 requests
1 failed requests
5th percentile: 1.2517586708068849
10th percentile: 1.3050217151641845
20th percentile: 1.411547803878784
30th percentile: 1.5249547481536865
40th percentile: 1.6452425479888917
50th percentile: 1.7655303478240967
60th percentile: 1.952195119857788
70th percentile: 2.1388598918914794
80th percentile: 4.229708909988405
90th percentile: 8.22474217414856
95th percentile: 10.222258806228636
99th percentile: 11.8202721118927
mean time: 3.7761609077453615
%s, retrying in %s seconds...
Received healthy response to inference request in 2.0049617290496826s
Received healthy response to inference request in 1.9109864234924316s
Received healthy response to inference request in 1.3179805278778076s
Received healthy response to inference request in 1.32733154296875s
Received healthy response to inference request in 1.744516372680664s
5 requests
0 failed requests
5th percentile: 1.3198507308959961
10th percentile: 1.3217209339141847
20th percentile: 1.3254613399505615
30th percentile: 1.4107685089111328
40th percentile: 1.5776424407958984
50th percentile: 1.744516372680664
60th percentile: 1.8111043930053712
70th percentile: 1.877692413330078
80th percentile: 1.9297814846038819
90th percentile: 1.9673716068267821
95th percentile: 1.9861666679382324
99th percentile: 2.0012027168273927
mean time: 1.6611553192138673
Pipeline stage StressChecker completed in 29.80s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.75s
run pipeline stage %s
Running pipeline stage TriggerMKMLProfilingPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage TriggerMKMLProfilingPipeline completed in 0.74s
Shutdown handler de-registered
rirv938-slerp-grpo-cp31_93207_v1 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeleter
Skipping teardown as no inference service was successfully deployed
Pipeline stage MKMLProfilerDeleter completed in 0.14s
run pipeline stage %s
Running pipeline stage MKMLProfilerTemplater
Pipeline stage MKMLProfilerTemplater completed in 0.13s
run pipeline stage %s
Running pipeline stage MKMLProfilerDeployer
Creating inference service rirv938-slerp-grpo-cp31-93207-v1-profiler
Waiting for inference service rirv938-slerp-grpo-cp31-93207-v1-profiler to be ready
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4447.45s
Shutdown handler de-registered
rirv938-slerp-grpo-cp31_93207_v1 status is now inactive due to auto deactivation removed underperforming models
rirv938-slerp-grpo-cp31_93207_v1 status is now torndown due to DeploymentManager action