developer_uid: chai_backend_admin
submission_id: function_jolin_2025-12-23
model_name: abtest_blend
model_group:
status: torndown
timestamp: 2025-12-26T07:31:46+00:00
num_battles: 6634
num_wins: 3866
celo_rating: 1351.24
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: abtest_blend
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-25
win_ratio: 0.582755501959602
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.6492400169372559s
Received healthy response to inference request in 1.6049587726593018s
Received healthy response to inference request in 1.5886542797088623s
Received healthy response to inference request in 1.5318419933319092s
Received healthy response to inference request in 1.6874263286590576s
Received healthy response to inference request in 2.8265013694763184s
Received healthy response to inference request in 2.4551639556884766s
Received healthy response to inference request in 2.183685302734375s
Received healthy response to inference request in 1.6026480197906494s
Received healthy response to inference request in 1.6217067241668701s
10 requests
0 failed requests
5th percentile: 1.557407522201538
10th percentile: 1.582973051071167
20th percentile: 1.599849271774292
30th percentile: 1.6042655467987061
40th percentile: 1.6150075435638427
50th percentile: 1.635473370552063
60th percentile: 1.6645145416259766
70th percentile: 1.8363040208816528
80th percentile: 2.2379810333251955
90th percentile: 2.4922976970672606
95th percentile: 2.659399533271789
99th percentile: 2.7930810022354127
mean time: 1.8751826763153077
Pipeline stage StressChecker completed in 20.40s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_jolin_2025-12-23 status is now deployed due to DeploymentManager action
function_jolin_2025-12-23 status is now inactive due to auto deactivation removed underperforming models
function_jolin_2025-12-23 status is now torndown due to DeploymentManager action