function_sahen_2026-01-26

developer_uid: chai_backend_admin

submission_id: function_sahen_2026-01-26

model_name: function_sahen_2026-01-26

model_group:

status: inactive

timestamp: 2026-01-26T04:17:23+00:00

num_battles: 11318

num_wins: 4454

celo_rating: 1236.93

family_friendly_score: 0.6434

family_friendly_standard_error: 0.006774015648048061

submission_type: function

display_name: function_sahen_2026-01-26

is_internal_developer: True

ranking_group: single

us_pacific_date: 2026-01-25

win_ratio: 0.3935324262237144

generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.2110099792480469s
Received healthy response to inference request in 1.2785229682922363s
Received healthy response to inference request in 0.8744986057281494s
Received healthy response to inference request in 1.0225722789764404s
Received healthy response to inference request in 1.018669605255127s
Received healthy response to inference request in 1.2457728385925293s
Received healthy response to inference request in 1.6223735809326172s
Received healthy response to inference request in 1.0285515785217285s
Received healthy response to inference request in 2.2454521656036377s
Received healthy response to inference request in 1.2467081546783447s
10 requests
0 failed requests
5th percentile: 0.9393755555152893
10th percentile: 1.0042525053024292
20th percentile: 1.0217917442321778
30th percentile: 1.026757788658142
40th percentile: 1.1380266189575194
50th percentile: 1.228391408920288
60th percentile: 1.2461469650268555
70th percentile: 1.2562525987625122
80th percentile: 1.3472930908203127
90th percentile: 1.684681439399719
95th percentile: 1.9650668025016778
99th percentile: 2.189375092983246
mean time: 1.2794131755828857
Pipeline stage StressChecker completed in 14.89s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_sahen_2026-01-26 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 1163.58s
Shutdown handler de-registered
function_sahen_2026-01-26 status is now inactive due to auto deactivation removed underperforming models