developer_uid: rirv938
submission_id: chaiml-96p-4ff-chaiml-m_45592_v3
model_name: chaiml-96p-4ff-chaiml-m_45592_v3
model_group: ChaiML/96p_4ff_chaiml_mi
status: protected
timestamp: 2026-01-31T01:03:39+00:00
num_battles: 11447
num_wins: 6184
celo_rating: 1334.9
family_friendly_score: 0.5044
family_friendly_standard_error: 0.007070794014818986
submission_type: basic
model_repo: ChaiML/96p_4ff_chaiml_mistral_24b_2048_63507_v3_cp312_merged
model_architecture: MistralForCausalLM
model_num_parameters: 24096691200.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 96
reward_model: default
display_name: chaiml-96p-4ff-chaiml-m_45592_v3
ineligible_reason: max_output_tokens!=64
is_internal_developer: True
language_model: ChaiML/96p_4ff_chaiml_mistral_24b_2048_63507_v3_cp312_merged
model_size: 24B
ranking_group: single
us_pacific_date: 2026-01-30
win_ratio: 0.5402288809295012
generation_params: {'temperature': 1.0, 'top_p': 0.95, 'min_p': 0.05, 'top_k': 60, 'presence_penalty': 0.1, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '\n', 'You:', '###', '<|im_start|>', '<|im_end|>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 96}
formatter: {'memory_template': '', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-96p-4ff-chaiml-m-45592-v3-uploader
Waiting for job on chaiml-96p-4ff-chaiml-m-45592-v3-uploader to finish
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: /root/miniconda3/envs/nvidia/lib/python3.11/site-packages/mk1/__init__.py:1: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: __import__('pkg_resources').declare_namespace(__name__)
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ╔═════════════════════════════════════════════════════════════════════╗
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ ██████ ██████ █████ ████ ████ ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ ░░██████ ██████ ░░███ ███░ ░░███ ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ ░███░█████░███ ░███ ███ ░███ ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ ░███░░███ ░███ ░███████ ░███ ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ ░███ ░░░ ░███ ░███░░███ ░███ ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ ░███ ░███ ░███ ░░███ ░███ ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ █████ █████ █████ ░░████ █████ ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ ░░░░░ ░░░░░ ░░░░░ ░░░░ ░░░░░ ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ Version: 0.30.6+torch280 ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ Features: FLYWHEEL, CUDA ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ Copyright 2023-2025 MK ONE TECHNOLOGIES Inc. ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ https://mk1.ai ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ The license key for the current software has been verified as ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ belonging to: ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ Chai Research Corp. ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ Account ID: 7997a29f-0ceb-4cc7-9adf-840c57b4ae6f ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ Expiration: 2028-03-31 23:59:59 ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ║ ║
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: ╚═════════════════════════════════════════════════════════════════════╝
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: Downloaded to shared memory in 88.568s
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: Processed model ChaiML/96p_4ff_chaiml_mistral_24b_2048_63507_v3_cp312_merged in 129.704s
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: creating bucket guanaco-vllm-models
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: uploading /dev/shm/model_cache to s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/generation_config.json s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/generation_config.json
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/README.md s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/README.md
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/.gitattributes s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/.gitattributes
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/tokenizer_config.json s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/tokenizer_config.json
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/config.json s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/config.json
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model.safetensors.index.json
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/special_tokens_map.json s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/special_tokens_map.json
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00005-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00005-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00011-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00011-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00019-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00019-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00015-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00015-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00010-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00010-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00012-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00012-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00008-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00008-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00017-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00017-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00021-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00021-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00007-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00007-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00013-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00013-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00004-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00004-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00003-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00003-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00002-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00002-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00018-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00018-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00001-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00001-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00009-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00009-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00016-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00016-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00014-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00014-of-00021.safetensors
chaiml-96p-4ff-chaiml-m-45592-v3-uploader: cp /dev/shm/model_cache/model-00006-of-00021.safetensors s3://guanaco-vllm-models/chaiml-96p-4ff-chaiml-m-45592-v3/model-00006-of-00021.safetensors
Job chaiml-96p-4ff-chaiml-m-45592-v3-uploader completed after 297.51s with status: succeeded
Stopping job with name chaiml-96p-4ff-chaiml-m-45592-v3-uploader
Pipeline stage VLLMUploader completed in 298.49s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 0.26s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-96p-4ff-chaiml-m-45592-v3
Waiting for inference service chaiml-96p-4ff-chaiml-m-45592-v3 to be ready
Inference service chaiml-96p-4ff-chaiml-m-45592-v3 ready after 233.58215832710266s
Pipeline stage VLLMDeployer completed in 234.59s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.5716004371643066s
Received healthy response to inference request in 2.894354820251465s
Received healthy response to inference request in 2.231506586074829s
Received healthy response to inference request in 2.9818785190582275s
Received healthy response to inference request in 3.493274211883545s
Received healthy response to inference request in 2.144045352935791s
Received healthy response to inference request in 2.2343063354492188s
Received healthy response to inference request in 2.1825757026672363s
Received healthy response to inference request in 2.555976390838623s
Received healthy response to inference request in 2.0662784576416016s
Received healthy response to inference request in 2.4461824893951416s
Received healthy response to inference request in 2.2298524379730225s
Received healthy response to inference request in 2.349030017852783s
Received healthy response to inference request in 2.627642869949341s
Received healthy response to inference request in 2.2989697456359863s
Received healthy response to inference request in 2.5523183345794678s
Received healthy response to inference request in 2.935236692428589s
Received healthy response to inference request in 2.5155892372131348s
Received healthy response to inference request in 2.480804920196533s
Received healthy response to inference request in 2.3997130393981934s
Received healthy response to inference request in 2.797119617462158s
Received healthy response to inference request in 2.6109282970428467s
Received healthy response to inference request in 2.464798927307129s
Received healthy response to inference request in 2.5117013454437256s
Received healthy response to inference request in 2.547600030899048s
Received healthy response to inference request in 1.8038289546966553s
Received healthy response to inference request in 2.4362001419067383s
Received healthy response to inference request in 2.188650369644165s
Received healthy response to inference request in 2.2330431938171387s
Received healthy response to inference request in 2.132810115814209s
30 requests
0 failed requests
5th percentile: 2.096217703819275
10th percentile: 2.1429218292236327
20th percentile: 2.221612024307251
30th percentile: 2.2339273929595946
40th percentile: 2.3794398307800293
50th percentile: 2.4554907083511353
60th percentile: 2.5132565021514894
70th percentile: 2.5534157514572144
80th percentile: 2.661538219451905
90th percentile: 2.9399008750915527
95th percentile: 3.2631461501121506
99th percentile: 3.548885831832886
mean time: 2.497260586420695
Pipeline stage StressChecker completed in 80.03s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.17s
Shutdown handler de-registered
chaiml-96p-4ff-chaiml-m_45592_v3 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2248.13s
Shutdown handler de-registered
chaiml-96p-4ff-chaiml-m_45592_v3 status is now inactive due to auto deactivation removed underperforming models
chaiml-96p-4ff-chaiml-m_45592_v3 status is now protected due to ABTestQueueItem