developer_uid: rirv938
submission_id: chaiml-glm-air-4-5-sft-_40077_v1
model_name: chaiml-glm-air-4-5-sft-_40077_v1
model_group: ChaiML/glm_air_4_5_sft_2
status: inactive
timestamp: 2026-02-28T18:17:27+00:00
num_battles: 12414
num_wins: 6048
celo_rating: 1290.46
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: basic
model_repo: ChaiML/glm_air_4_5_sft_20260227_not_q235_e2
model_architecture: Glm4MoeForCausalLM
model_num_parameters: 9073971200.0
best_of: 8
max_input_tokens: 2048
max_output_tokens: 64
reward_model: default
display_name: chaiml-glm-air-4-5-sft-_40077_v1
is_internal_developer: True
language_model: ChaiML/glm_air_4_5_sft_20260227_not_q235_e2
model_size: 9B
ranking_group: single
us_pacific_date: 2026-02-28
win_ratio: 0.4871918801353311
generation_params: {'temperature': 1.0, 'top_p': 0.95, 'min_p': 0.05, 'top_k': 60, 'presence_penalty': 0.1, 'frequency_penalty': 0.0, 'stopping_words': ['###', '</s>', '<|im_start|>', 'You:', '<|im_end|>'], 'max_input_tokens': 2048, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '[gMASK]<sop><|system|>\nRespond as a high quality storyteller.<|user|>\n', 'prompt_template': '', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '<|assistant|>\n<think></think>\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage VLLMUploader
Starting job with name chaiml-glm-air-4-5-sft-40077-v1-uploader
Waiting for job on chaiml-glm-air-4-5-sft-40077-v1-uploader to finish
chaiml-glm-air-4-5-sft-55881-v1-uploader: Using quantization_mode: none
chaiml-glm-air-4-5-sft-55881-v1-uploader: Downloading snapshot of ChaiML/glm_air_4_5_sft_20260227_not_q235_e1...
chaiml-glm-air-4-5-sft-40077-v1-uploader: Using quantization_mode: none
chaiml-glm-air-4-5-sft-40077-v1-uploader: Downloading snapshot of ChaiML/glm_air_4_5_sft_20260227_not_q235_e2...
chaiml-glm-air-4-5-sft-55881-v1-uploader: Downloaded in 71.488s
chaiml-glm-air-4-5-sft-40077-v1-uploader: Downloaded in 70.687s
chaiml-glm-air-4-5-sft-55881-v1-uploader: Processed model ChaiML/glm_air_4_5_sft_20260227_not_q235_e1 in 150.479s
chaiml-glm-air-4-5-sft-55881-v1-uploader: creating bucket guanaco-vllm-models
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:56: SyntaxWarning: invalid escape sequence '\.'
chaiml-glm-air-4-5-sft-55881-v1-uploader: RE_S3_DATESTRING = re.compile('\.[0-9]*(?:[Z\\-\\+]*?)')
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/BaseUtils.py:57: SyntaxWarning: invalid escape sequence '\s'
chaiml-glm-air-4-5-sft-55881-v1-uploader: RE_XML_NAMESPACE = re.compile(b'^(<?[^>]+?>\s*|\s*)(<\w+) xmlns=[\'"](https?://[^\'"]+)[\'"]', re.MULTILINE)
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:240: SyntaxWarning: invalid escape sequence '\.'
chaiml-glm-air-4-5-sft-55881-v1-uploader: invalid = re.search("([^a-z0-9\.-])", bucket, re.UNICODE)
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:244: SyntaxWarning: invalid escape sequence '\.'
chaiml-glm-air-4-5-sft-55881-v1-uploader: invalid = re.search("([^A-Za-z0-9\._-])", bucket, re.UNICODE)
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:255: SyntaxWarning: invalid escape sequence '\.'
chaiml-glm-air-4-5-sft-55881-v1-uploader: if re.search("-\.", bucket, re.UNICODE):
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/Utils.py:257: SyntaxWarning: invalid escape sequence '\.'
chaiml-glm-air-4-5-sft-55881-v1-uploader: if re.search("\.\.", bucket, re.UNICODE):
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/S3Uri.py:155: SyntaxWarning: invalid escape sequence '\w'
chaiml-glm-air-4-5-sft-55881-v1-uploader: _re = re.compile("^(\w+://)?(.*)", re.UNICODE)
chaiml-glm-air-4-5-sft-40077-v1-uploader: Processed model ChaiML/glm_air_4_5_sft_20260227_not_q235_e2 in 151.425s
chaiml-glm-air-4-5-sft-55881-v1-uploader: /usr/lib/python3/dist-packages/S3/FileLists.py:480: SyntaxWarning: invalid escape sequence '\*'
chaiml-glm-air-4-5-sft-55881-v1-uploader: wildcard_split_result = re.split("\*|\?", uri_str, maxsplit=1)
chaiml-glm-air-4-5-sft-55881-v1-uploader: Bucket 's3://guanaco-vllm-models/' created
chaiml-glm-air-4-5-sft-55881-v1-uploader: uploading /dev/shm/model_output to s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/README.md s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/README.md
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/args.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/args.json
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/config.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/config.json
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/chat_template.jinja s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/chat_template.jinja
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model.safetensors.index.json
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/special_tokens_map.json
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/.gitattributes s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/.gitattributes
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/tokenizer.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/tokenizer.json
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/tokenizer_config.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/tokenizer_config.json
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model.safetensors.index.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model.safetensors.index.json
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/special_tokens_map.json s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/special_tokens_map.json
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00043-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00043-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00021-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00021-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00034-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00034-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00001-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00001-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00011-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00011-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00012-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00012-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00008-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00008-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00013-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00013-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00003-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00003-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00007-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00007-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00037-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00037-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00002-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00002-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00017-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00017-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00041-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00041-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00016-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00016-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00004-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00004-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00015-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00015-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00032-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00032-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00028-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00028-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00026-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00026-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00005-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00005-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00009-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00009-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00030-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00030-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00027-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00027-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00025-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00025-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00019-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00019-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00023-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00023-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00018-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00018-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00014-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00014-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00029-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00029-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00040-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00040-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00039-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00039-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00042-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00042-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00024-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00024-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00035-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00035-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00031-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00031-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00010-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00010-of-00043.safetensors
chaiml-glm-air-4-5-sft-55881-v1-uploader: cp /dev/shm/model_output/model-00038-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-55881-v1/default/model-00038-of-00043.safetensors
Job chaiml-glm-air-4-5-sft-55881-v1-uploader completed after 233.49s with status: succeeded
Stopping job with name chaiml-glm-air-4-5-sft-55881-v1-uploader
Pipeline stage VLLMUploader completed in 235.54s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 1.13s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-glm-air-4-5-sft-55881-v1
Waiting for inference service chaiml-glm-air-4-5-sft-55881-v1 to be ready
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00043-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00043-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00036-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00036-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00025-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00025-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00014-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00014-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00003-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00003-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00033-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00033-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00008-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00008-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00024-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00024-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00007-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00007-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00020-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00020-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00009-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00009-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00006-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00006-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00041-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00041-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00019-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00019-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00010-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00010-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00016-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00016-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00032-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00032-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00017-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00017-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00028-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00028-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00034-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00034-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00023-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00023-of-00043.safetensors
chaiml-glm-air-4-5-sft-40077-v1-uploader: cp /dev/shm/model_output/model-00015-of-00043.safetensors s3://guanaco-vllm-models/chaiml-glm-air-4-5-sft-40077-v1/default/model-00015-of-00043.safetensors
Job chaiml-glm-air-4-5-sft-40077-v1-uploader completed after 271.08s with status: succeeded
Stopping job with name chaiml-glm-air-4-5-sft-40077-v1-uploader
Pipeline stage VLLMUploader completed in 272.87s
run pipeline stage %s
Running pipeline stage VLLMTemplater
Pipeline stage VLLMTemplater completed in 1.04s
run pipeline stage %s
Running pipeline stage VLLMDeployer
Creating inference service chaiml-glm-air-4-5-sft-40077-v1
Waiting for inference service chaiml-glm-air-4-5-sft-40077-v1 to be ready
Inference service chaiml-glm-air-4-5-sft-55881-v1 ready after 548.2415251731873s
Pipeline stage VLLMDeployer completed in 550.04s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.4504988193511963s
Received healthy response to inference request in 1.9575912952423096s
Received healthy response to inference request in 1.8580811023712158s
Received healthy response to inference request in 1.9933357238769531s
Received healthy response to inference request in 1.8157002925872803s
Received healthy response to inference request in 1.9335134029388428s
Received healthy response to inference request in 1.7962396144866943s
Received healthy response to inference request in 2.298945903778076s
Received healthy response to inference request in 1.8634896278381348s
Received healthy response to inference request in 1.8585729598999023s
Received healthy response to inference request in 2.026310443878174s
Received healthy response to inference request in 1.900045394897461s
Received healthy response to inference request in 1.8919684886932373s
Received healthy response to inference request in 2.034473419189453s
Received healthy response to inference request in 2.083508014678955s
Received healthy response to inference request in 2.2027766704559326s
Received healthy response to inference request in 1.8159234523773193s
Received healthy response to inference request in 2.9188904762268066s
Received healthy response to inference request in 2.9295387268066406s
Received healthy response to inference request in 1.9449183940887451s
Received healthy response to inference request in 2.131909132003784s
Received healthy response to inference request in 1.9095537662506104s
Received healthy response to inference request in 2.5048627853393555s
Received healthy response to inference request in 1.8255126476287842s
Received healthy response to inference request in 2.1036412715911865s
Inference service chaiml-glm-air-4-5-sft-40077-v1 ready after 568.0828440189362s
Pipeline stage VLLMDeployer completed in 570.70s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.896056890487671s
Received healthy response to inference request in 1.941697359085083s
Received healthy response to inference request in 1.8913304805755615s
Received healthy response to inference request in 1.8320143222808838s
Received healthy response to inference request in 2.350023031234741s
Received healthy response to inference request in 2.883476734161377s
Received healthy response to inference request in 1.9336893558502197s
Received healthy response to inference request in 1.8863286972045898s
30 requests
Received healthy response to inference request in 2.118736505508423s
0 failed requests
5th percentile: 1.815800714492798
10th percentile: 1.8245537281036377
20th percentile: 1.8584745883941651
30th percentile: 1.88982994556427
40th percentile: 1.898449993133545
50th percentile: 1.939215898513794
60th percentile: 2.0065256118774415
Received healthy response to inference request in 1.980151891708374s
70th percentile: 2.0895479917526245
80th percentile: 2.2220105171203617
90th percentile: 2.5427241802215583
95th percentile: 2.902954292297363
99th percentile: 2.9264507341384887
mean time: 2.0813002983729043
Pipeline stage StressChecker completed in 78.85s
run pipeline stage %s
Received healthy response to inference request in 1.9766221046447754s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.20s
Received healthy response to inference request in 2.037226676940918s
Shutdown handler de-registered
chaiml-glm-air-4-5-sft-_55881_v1 status is now deployed due to DeploymentManager action
Received healthy response to inference request in 1.878943681716919s
Received healthy response to inference request in 2.8904991149902344s
Received healthy response to inference request in 1.9922716617584229s
Received healthy response to inference request in 1.847346544265747s
Received healthy response to inference request in 2.96293044090271s
Received healthy response to inference request in 1.9305431842803955s
Received healthy response to inference request in 2.593675374984741s
Received healthy response to inference request in 2.138899087905884s
Received healthy response to inference request in 1.8863465785980225s
Received healthy response to inference request in 1.9323821067810059s
Received healthy response to inference request in 1.828660011291504s
Received healthy response to inference request in 2.015688896179199s
Received healthy response to inference request in 1.8210515975952148s
Received healthy response to inference request in 1.9828503131866455s
Received healthy response to inference request in 1.8493807315826416s
Received healthy response to inference request in 2.1622610092163086s
Received healthy response to inference request in 1.925217866897583s
Received healthy response to inference request in 2.4918441772460938s
Received healthy response to inference request in 2.024233102798462s
Received healthy response to inference request in 2.1953625679016113s
Received healthy response to inference request in 2.133241653442383s
Received healthy response to inference request in 1.8951447010040283s
Received healthy response to inference request in 1.822986125946045s
30 requests
0 failed requests
5th percentile: 1.8255393743515014
10th percentile: 1.8454778909683227
20th percentile: 1.8848659992218018
30th percentile: 1.9289455890655518
40th percentile: 1.9384941577911377
50th percentile: 1.9815011024475098
60th percentile: 2.0191065788269045
70th percentile: 2.1230880498886107
80th percentile: 2.168881320953369
90th percentile: 2.502027297019959
95th percentile: 2.7569284319877614
99th percentile: 2.9419253563880923
mean time: 2.0846635818481447
Pipeline stage StressChecker completed in 90.07s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.41s
Shutdown handler de-registered
chaiml-glm-air-4-5-sft-_40077_v1 status is now deployed due to DeploymentManager action
chaiml-glm-air-4-5-sft-_40077_v1 status is now inactive due to auto deactivation removed underperforming models