developer_uid: chai_backend_admin
submission_id: function_benit_2025-12-17
model_name: function_benit_2025-12-17
model_group:
status: torndown
timestamp: 2025-12-20T22:21:18+00:00
num_battles: 6110
num_wins: 3224
celo_rating: 1312.49
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_benit_2025-12-17
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.5276595744680851
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8564236164093018s
Received healthy response to inference request in 1.1747064590454102s
Received healthy response to inference request in 2.2559432983398438s
Received healthy response to inference request in 1.8997218608856201s
Received healthy response to inference request in 2.6218316555023193s
Received healthy response to inference request in 4.5463831424713135s
Received healthy response to inference request in 1.9275553226470947s
Received healthy response to inference request in 1.496539831161499s
Received healthy response to inference request in 3.24859356880188s
Received healthy response to inference request in 3.0380733013153076s
10 requests
0 failed requests
5th percentile: 1.3195314764976502
10th percentile: 1.46435649394989
20th percentile: 1.7844468593597411
30th percentile: 1.8867323875427247
40th percentile: 1.916421937942505
50th percentile: 2.0917493104934692
60th percentile: 2.4022986412048337
70th percentile: 2.7467041492462156
80th percentile: 3.080177354812622
90th percentile: 3.378372526168823
95th percentile: 3.962377834320067
99th percentile: 4.429582080841064
mean time: 2.406577205657959
Pipeline stage StressChecker completed in 25.33s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_benit_2025-12-17 status is now deployed due to DeploymentManager action
function_benit_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_benit_2025-12-17 status is now torndown due to DeploymentManager action