developer_uid: chai_backend_admin
submission_id: function_surek_2025-12-19
model_name: function_surek_2025-12-19
model_group:
status: torndown
timestamp: 2025-12-22T02:41:11+00:00
num_battles: 5459
num_wins: 3117
celo_rating: 1344.29
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_surek_2025-12-19
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.5709836966477376
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.6879100799560547s
Received healthy response to inference request in 2.603616714477539s
Received healthy response to inference request in 2.66166090965271s
Received healthy response to inference request in 2.8050942420959473s
Received healthy response to inference request in 2.5648276805877686s
Received healthy response to inference request in 2.344757080078125s
Received healthy response to inference request in 2.374330997467041s
Received healthy response to inference request in 2.377763271331787s
Received healthy response to inference request in 3.735405921936035s
Received healthy response to inference request in 2.7846076488494873s
10 requests
0 failed requests
5th percentile: 2.3580653429031373
10th percentile: 2.3713736057281496
20th percentile: 2.377076816558838
30th percentile: 2.508708357810974
40th percentile: 2.588101100921631
50th percentile: 2.6326388120651245
60th percentile: 2.672160577774048
70th percentile: 2.7169193506240843
80th percentile: 2.7887049674987794
90th percentile: 2.8981254100799556
95th percentile: 3.3167656660079947
99th percentile: 3.6516778707504276
mean time: 2.6939974546432497
Pipeline stage StressChecker completed in 28.35s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_surek_2025-12-19 status is now deployed due to DeploymentManager action
function_surek_2025-12-19 status is now inactive due to auto deactivation removed underperforming models
function_surek_2025-12-19 status is now torndown due to DeploymentManager action