developer_uid: chai_backend_admin
submission_id: function_pabel_2025-12-23
model_name: abtest_blend
model_group:
status: torndown
timestamp: 2025-12-26T05:21:27+00:00
num_battles: 6687
num_wins: 3787
celo_rating: 1339.62
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: abtest_blend
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-25
win_ratio: 0.5663227157170629
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7272577285766602s
Received healthy response to inference request in 1.5574469566345215s
Received healthy response to inference request in 1.8314361572265625s
Received healthy response to inference request in 2.227487087249756s
Received healthy response to inference request in 1.7119364738464355s
Received healthy response to inference request in 1.5640935897827148s
Received healthy response to inference request in 1.870274543762207s
Received healthy response to inference request in 1.6305465698242188s
Received healthy response to inference request in 1.5525169372558594s
Received healthy response to inference request in 1.8917016983032227s
10 requests
0 failed requests
5th percentile: 1.5547354459762572
10th percentile: 1.5569539546966553
20th percentile: 1.5627642631530763
30th percentile: 1.6106106758117675
40th percentile: 1.679380512237549
50th percentile: 1.7195971012115479
60th percentile: 1.768929100036621
70th percentile: 1.8430876731872559
80th percentile: 1.8745599746704102
90th percentile: 1.9252802371978759
95th percentile: 2.0763836622238157
99th percentile: 2.197266402244568
mean time: 1.7564697742462159
Pipeline stage StressChecker completed in 18.77s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_pabel_2025-12-23 status is now deployed due to DeploymentManager action
function_pabel_2025-12-23 status is now inactive due to auto deactivation removed underperforming models
function_pabel_2025-12-23 status is now torndown due to DeploymentManager action