developer_uid: chai_backend_admin
submission_id: function_hofub_2025-12-23
model_name: abtest_blend
model_group:
status: torndown
timestamp: 2025-12-26T03:51:22+00:00
num_battles: 6097
num_wins: 3444
celo_rating: 1338.52
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: abtest_blend
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-25
win_ratio: 0.5648679678530425
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7804720401763916s
Received healthy response to inference request in 1.914491891860962s
Received healthy response to inference request in 1.7214436531066895s
Received healthy response to inference request in 1.5318107604980469s
Received healthy response to inference request in 2.8167507648468018s
Received healthy response to inference request in 1.7867512702941895s
Received healthy response to inference request in 1.5654237270355225s
Received healthy response to inference request in 1.6332063674926758s
Received healthy response to inference request in 1.5702457427978516s
Received healthy response to inference request in 2.018683433532715s
10 requests
0 failed requests
5th percentile: 1.5469365954399108
10th percentile: 1.562062430381775
20th percentile: 1.5692813396453857
30th percentile: 1.6143181800842286
40th percentile: 1.686148738861084
50th percentile: 1.7509578466415405
60th percentile: 1.7829837322235107
70th percentile: 1.8250734567642213
80th percentile: 1.9353302001953125
90th percentile: 2.0984901666641234
95th percentile: 2.4576204657554617
99th percentile: 2.744924705028534
mean time: 1.8339279651641847
Pipeline stage StressChecker completed in 19.85s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.71s
Shutdown handler de-registered
function_hofub_2025-12-23 status is now deployed due to DeploymentManager action
function_hofub_2025-12-23 status is now inactive due to auto deactivation removed underperforming models
function_hofub_2025-12-23 status is now torndown due to DeploymentManager action