developer_uid: chai_backend_admin
submission_id: function_jedam_2025-12-23
model_name: abtest_blend
model_group:
status: torndown
timestamp: 2025-12-26T07:31:46+00:00
num_battles: 6729
num_wins: 4034
celo_rating: 1363.38
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: abtest_blend
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-25
win_ratio: 0.5994947243275375
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.9789881706237793s
Received healthy response to inference request in 2.1296212673187256s
Received healthy response to inference request in 2.915422201156616s
Received healthy response to inference request in 2.1382055282592773s
Received healthy response to inference request in 2.45625376701355s
Received healthy response to inference request in 2.043876886367798s
Received healthy response to inference request in 2.137742757797241s
Received healthy response to inference request in 3.5652413368225098s
Received healthy response to inference request in 2.946877956390381s
Received healthy response to inference request in 2.113227605819702s
10 requests
0 failed requests
5th percentile: 2.0750847101211547
10th percentile: 2.1062925338745115
20th percentile: 2.126342535018921
30th percentile: 2.1353063106536867
40th percentile: 2.138020420074463
50th percentile: 2.2972296476364136
60th percentile: 2.639921140670776
70th percentile: 2.9248589277267456
80th percentile: 2.9532999992370605
90th percentile: 3.037613487243652
95th percentile: 3.3014274120330804
99th percentile: 3.512478551864624
mean time: 2.542545747756958
Pipeline stage StressChecker completed in 26.73s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_jedam_2025-12-23 status is now deployed due to DeploymentManager action
function_jedam_2025-12-23 status is now inactive due to auto deactivation removed underperforming models
function_jedam_2025-12-23 status is now torndown due to DeploymentManager action