developer_uid: chai_backend_admin
submission_id: function_kohir_2026-01-19
model_name: abtest_tai
model_group:
status: torndown
timestamp: 2026-01-22T14:43:15+00:00
num_battles: 8722
num_wins: 4418
celo_rating: 1312.54
family_friendly_score: 0.562
family_friendly_standard_error: 0.00701649485141976
submission_type: function
display_name: abtest_tai
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-19
win_ratio: 0.5065351983490025
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
run pipeline %s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
run pipeline stage %s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Running pipeline stage StressChecker
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.7366321086883545s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.3882036209106445s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.270505428314209s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.8191354274749756s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.2076997756958008s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.4447021484375s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.3280017375946045s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.2792026996612549s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 2.410984992980957s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.335472583770752s
10 requests
0 failed requests
5th percentile: 1.2359623193740845
10th percentile: 1.2642248630523683
20th percentile: 1.2774632453918457
30th percentile: 1.3133620262145995
40th percentile: 1.332484245300293
50th percentile: 1.3618381023406982
Falling back to EndpointApi.from_submission implementation
60th percentile: 1.4108030319213867
Falling back to EndpointApi.from_submission implementation
70th percentile: 1.5322811365127562
Falling back to EndpointApi.from_submission implementation
80th percentile: 1.7531327724456787
Falling back to EndpointApi.from_submission implementation
90th percentile: 1.8783203840255736
95th percentile: 2.144652688503265
99th percentile: 2.3577185320854186
mean time: 1.5220540523529054
Falling back to EndpointApi.from_submission implementation
Pipeline stage StressChecker completed in 17.75s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
run pipeline stage %s
Falling back to EndpointApi.from_submission implementation
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
run_pipeline:run_in_cloud %s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
starting trigger_guanaco_pipeline args=%s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.36s
Shutdown handler de-registered
Falling back to EndpointApi.from_submission implementation
function_kohir_2026-01-19 status is now deployed due to DeploymentManager action
Falling back to EndpointApi.from_submission implementation
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2391.72s
Shutdown handler de-registered
Falling back to EndpointApi.from_submission implementation
function_kohir_2026-01-19 status is now inactive due to admin request
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
function_kohir_2026-01-19 status is now torndown due to DeploymentManager action