developer_uid: chai_backend_admin
submission_id: function_nudaf_2026-01-10
model_name: abtest_blend
model_group:
status: torndown
timestamp: 2026-01-14T16:59:58+00:00
num_battles: 12544
num_wins: 6343
celo_rating: 1299.84
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: abtest_blend
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-14
win_ratio: 0.5056600765306123
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.5792834758758545s
Received healthy response to inference request in 3.417722702026367s
Received healthy response to inference request in 1.8899214267730713s
Received healthy response to inference request in 1.8400452136993408s
Received healthy response to inference request in 2.894944429397583s
Received healthy response to inference request in 2.0738110542297363s
Received healthy response to inference request in 2.3137717247009277s
Received healthy response to inference request in 2.5961103439331055s
Received healthy response to inference request in 1.9780282974243164s
Received healthy response to inference request in 2.2240774631500244s
10 requests
0 failed requests
5th percentile: 1.8624895095825196
10th percentile: 1.8849338054656983
20th percentile: 1.9604069232940673
30th percentile: 2.0450762271881104
40th percentile: 2.1639708995819094
50th percentile: 2.268924593925476
60th percentile: 2.4267071723937987
70th percentile: 2.6857605695724485
80th percentile: 2.99950008392334
90th percentile: 3.5338787794113156
95th percentile: 4.056581127643584
99th percentile: 4.474743006229401
mean time: 2.5807716131210325
Pipeline stage StressChecker completed in 27.23s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_nudaf_2026-01-10 status is now deployed due to DeploymentManager action
function_nudaf_2026-01-10 status is now inactive due to auto deactivation removed underperforming models
function_nudaf_2026-01-10 status is now torndown due to DeploymentManager action