developer_uid: chai_backend_admin
submission_id: function_fisib_2026-01-02
model_name: abtest_blend
model_group:
status: torndown
timestamp: 2026-01-05T08:56:36+00:00
num_battles: 2142
num_wins: 1018
celo_rating: 9999.0
family_friendly_score: 0.5482
family_friendly_standard_error: 0.007038135548566822
submission_type: function
display_name: abtest_blend
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-02
win_ratio: 0.4752567693744164
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.2772364616394043s
Received healthy response to inference request in 1.4678974151611328s
Received healthy response to inference request in 1.96207594871521s
Received healthy response to inference request in 1.601647138595581s
Received healthy response to inference request in 0.8884875774383545s
Received healthy response to inference request in 1.1863734722137451s
Received healthy response to inference request in 1.8624351024627686s
Received healthy response to inference request in 1.237335443496704s
Received healthy response to inference request in 1.6575379371643066s
Received healthy response to inference request in 1.0785572528839111s
10 requests
0 failed requests
5th percentile: 0.974018931388855
10th percentile: 1.0595502853393555
20th percentile: 1.1648102283477784
30th percentile: 1.2220468521118164
40th percentile: 1.2612760543823243
50th percentile: 1.3725669384002686
60th percentile: 1.521397304534912
70th percentile: 1.6184143781661988
80th percentile: 1.698517370223999
90th percentile: 1.8723991870880126
95th percentile: 1.9172375679016112
99th percentile: 1.9531082725524902
mean time: 1.4219583749771119
Pipeline stage StressChecker completed in 15.67s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.54s
Shutdown handler de-registered
function_fisib_2026-01-02 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 1586.49s
Shutdown handler de-registered
function_fisib_2026-01-02 status is now inactive due to admin request
function_fisib_2026-01-02 status is now torndown due to DeploymentManager action