developer_uid: rirv938
submission_id: chaiml-glm-air-4-5-sft-_55881_v1
model_name: chaiml-glm-air-4-5-sft-_55881_v1
model_group: ChaiML/glm_air_4_5_sft_2
status: inactive
timestamp: 2026-02-28T18:17:27+00:00
num_battles: 13913
num_wins: 6612
celo_rating: 1276.23
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/glm_air_4_5_sft_20260227_not_q235_e1
model_architecture: Glm4MoeForCausalLM
model_num_parameters: 9073971200.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 64
reward_model: default
display_name: chaiml-glm-air-4-5-sft-_55881_v1
is_internal_developer: True
language_model: ChaiML/glm_air_4_5_sft_20260227_not_q235_e1
model_size: 9B
ranking_group: single
us_pacific_date: 2026-02-28
win_ratio: 0.4752389851218285
generation_params: {'temperature': 1.0, 'top_p': 0.95, 'min_p': 0.05, 'top_k': 60, 'presence_penalty': 0.1, 'frequency_penalty': 0.0, 'stopping_words': ['###', '<|im_end|>', '</s>', 'You:', '<|im_start|>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '[gMASK]<sop><|system|>\nRespond as a high quality storyteller.<|user|>\n', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '<|assistant|>\n<think></think>\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-glm-air-4-5-sft-55881-v1-uploader
Waiting for job on chaiml-glm-air-4-5-sft-55881-v1-uploader to finish
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-glm-air-4-5-sft-40077-v1-uploader
Waiting for job on chaiml-glm-air-4-5-sft-40077-v1-uploader to finish
chaiml-glm-air-4-5-sft-55881-v1-uploader: Using quantization_mode: none
chaiml-glm-air-4-5-sft-55881-v1-uploader: Downloading snapshot of ChaiML/glm_air_4_5_sft_20260227_not_q235_e1...
chaiml-glm-air-4-5-sft-40077-v1-uploader: Using quantization_mode: none
chaiml-glm-air-4-5-sft-40077-v1-uploader: Downloading snapshot of ChaiML/glm_air_4_5_sft_20260227_not_q235_e2...
chaiml-glm-air-4-5-sft-55881-v1-uploader: Downloaded in 71.488s
chaiml-glm-air-4-5-sft-40077-v1-uploader: Downloaded in 70.687s
chaiml-glm-air-4-5-sft-55881-v1-uploader: Processed model ChaiML/glm_air_4_5_sft_20260227_not_q235_e1 in 150.479s
chaiml-glm-air-4-5-sft-55881-v1-uploader: creating bucket guanaco-vllm-models
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-glm-air-4-5-sft-55881-v1-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-glm-air-4-5-sft-55881-v1-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-glm-air-4-5-sft-55881-v1-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-glm-air-4-5-sft-55881-v1-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-glm-air-4-5-sft-55881-v1-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-glm-air-4-5-sft-55881-v1-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-glm-air-4-5-sft-55881-v1-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-glm-air-4-5-sft-40077-v1-uploader: Processed model ChaiML/glm_air_4_5_sft_20260227_not_q235_e2 in 151.425s
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-glm-air-4-5-sft-55881-v1-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-glm-air-4-5-sft-55881-v1-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-glm-air-4-5-sft-55881-v1-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/README.md
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/args.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/args.json
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/config.json
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/chat_template.jinja
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model.safetensors.index.json
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/special_tokens_map.json
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/.gitattributes
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/tokenizer.json
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/tokenizer_config.json
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model.safetensors.index.json
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/special_tokens_map.json
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00043-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00043-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00021-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00021-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00034-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00034-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00001-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00001-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00011-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00011-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00012-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00012-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00008-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00008-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00013-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00013-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00003-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00003-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00007-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00007-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00037-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00037-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00002-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00002-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00017-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00017-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00041-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00041-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00016-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00016-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00004-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00004-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00015-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00015-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00032-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00032-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00028-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00028-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00026-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00026-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00005-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00005-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00009-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00009-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00030-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00030-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00027-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00027-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00025-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00025-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00019-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00019-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00023-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00023-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00018-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00018-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00014-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00014-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00029-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00029-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00040-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00040-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00039-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00039-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00042-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00042-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00024-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00024-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00035-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00035-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00031-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00031-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00010-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00010-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00038-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00038-of-00043.safetensors
Job chaiml-glm-air-4-5-sft-55881-v1-uploader completed after 233.49s with status: succeeded
Stopping job with name chaiml-glm-air-4-5-sft-55881-v1-uploader
Pipeline stage VLLMUploader completed in 235.54s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 1.13s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-glm-air-4-5-sft-55881-v1
Waiting for inference service chaiml-glm-air-4-5-sft-55881-v1 to be ready
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00043-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00043-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00036-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00036-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00025-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00025-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00014-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00014-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00003-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00003-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00033-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00033-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00008-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00008-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00024-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00024-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00007-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00007-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00020-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00020-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00009-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00009-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00006-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00006-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00041-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00041-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00019-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00019-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00010-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00010-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00016-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00016-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00032-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00032-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00017-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00017-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00028-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00028-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00034-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00034-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00023-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00023-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00015-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00015-of-00043.safetensors
Job chaiml-glm-air-4-5-sft-40077-v1-uploader completed after 271.08s with status: succeeded
Stopping job with name chaiml-glm-air-4-5-sft-40077-v1-uploader
Pipeline stage VLLMUploader completed in 272.87s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 1.04s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-glm-air-4-5-sft-40077-v1
Waiting for inference service chaiml-glm-air-4-5-sft-40077-v1 to be ready
Inference service chaiml-glm-air-4-5-sft-55881-v1 ready after 548.2415251731873s
Pipeline stage VLLMDeployer completed in 550.04s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.4504988193511963s
Received healthy response to inference request in 1.9575912952423096s
Received healthy response to inference request in 1.8580811023712158s
Received healthy response to inference request in 1.9933357238769531s
Received healthy response to inference request in 1.8157002925872803s
Received healthy response to inference request in 1.9335134029388428s
Received healthy response to inference request in 1.7962396144866943s
Received healthy response to inference request in 2.298945903778076s
Received healthy response to inference request in 1.8634896278381348s
Received healthy response to inference request in 1.8585729598999023s
Received healthy response to inference request in 2.026310443878174s
Received healthy response to inference request in 1.900045394897461s
Received healthy response to inference request in 1.8919684886932373s
Received healthy response to inference request in 2.034473419189453s
Received healthy response to inference request in 2.083508014678955s
Received healthy response to inference request in 2.2027766704559326s
Received healthy response to inference request in 1.8159234523773193s
Received healthy response to inference request in 2.9188904762268066s
Received healthy response to inference request in 2.9295387268066406s
Received healthy response to inference request in 1.9449183940887451s
Received healthy response to inference request in 2.131909132003784s
Received healthy response to inference request in 1.9095537662506104s
Received healthy response to inference request in 2.5048627853393555s
Received healthy response to inference request in 1.8255126476287842s
Received healthy response to inference request in 2.1036412715911865s
Inference service chaiml-glm-air-4-5-sft-40077-v1 ready after 568.0828440189362s
Pipeline stage VLLMDeployer completed in 570.70s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.896056890487671s
Received healthy response to inference request in 1.941697359085083s
Received healthy response to inference request in 1.8913304805755615s
Received healthy response to inference request in 1.8320143222808838s
Received healthy response to inference request in 2.350023031234741s
Received healthy response to inference request in 2.883476734161377s
Received healthy response to inference request in 1.9336893558502197s
Received healthy response to inference request in 1.8863286972045898s
30 requests
Received healthy response to inference request in 2.118736505508423s
0 failed requests
5th percentile: 1.815800714492798
10th percentile: 1.8245537281036377
20th percentile: 1.8584745883941651
30th percentile: 1.88982994556427
40th percentile: 1.898449993133545
50th percentile: 1.939215898513794
60th percentile: 2.0065256118774415
Received healthy response to inference request in 1.980151891708374s
70th percentile: 2.0895479917526245
80th percentile: 2.2220105171203617
90th percentile: 2.5427241802215583
95th percentile: 2.902954292297363
99th percentile: 2.9264507341384887
mean time: 2.0813002983729043
Pipeline stage StressChecker completed in 78.85s
run pipeline stage %s
Received healthy response to inference request in 1.9766221046447754s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.20s
Received healthy response to inference request in 2.037226676940918s
Shutdown handler de-registered
chaiml-glm-air-4-5-sft-_55881_v1 status is now deployed due to DeploymentManager action
chaiml-glm-air-4-5-sft-_55881_v1 status is now inactive due to auto deactivation removed underperforming models