developer_uid: chai_backend_admin
submission_id: function_dikul_2025-12-23
model_name: abtest_blend
model_group:
status: torndown
timestamp: 2025-12-26T02:31:39+00:00
num_battles: 5741
num_wins: 3390
celo_rating: 1348.75
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: abtest_blend
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-22
win_ratio: 0.5904894617662428
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.5636861324310303s
Received healthy response to inference request in 1.744356632232666s
Received healthy response to inference request in 1.8505125045776367s
Received healthy response to inference request in 1.7866981029510498s
Received healthy response to inference request in 2.4549074172973633s
Received healthy response to inference request in 2.371910333633423s
Received healthy response to inference request in 2.3854122161865234s
Received healthy response to inference request in 2.360060214996338s
Received healthy response to inference request in 1.7038209438323975s
Received healthy response to inference request in 2.7918081283569336s
10 requests
0 failed requests
5th percentile: 1.7220620036125183
10th percentile: 1.740303063392639
20th percentile: 1.7782298088073731
30th percentile: 1.8313681840896607
40th percentile: 2.1562411308288576
50th percentile: 2.3659852743148804
60th percentile: 2.377311086654663
70th percentile: 2.406260776519775
80th percentile: 2.4766631603240965
90th percentile: 2.5864983320236203
95th percentile: 2.6891532301902767
99th percentile: 2.7712771487236023
mean time: 2.2013172626495363
Pipeline stage StressChecker completed in 23.25s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.66s
Shutdown handler de-registered
function_dikul_2025-12-23 status is now deployed due to DeploymentManager action
function_dikul_2025-12-23 status is now inactive due to auto deactivation removed underperforming models
function_dikul_2025-12-23 status is now torndown due to DeploymentManager action