developer_uid: chai_backend_admin
submission_id: function_bubam_2026-01-11
model_name: function_bubam_2026-01-11
model_group:
status: torndown
timestamp: 2026-01-14T17:18:43+00:00
num_battles: 5
num_wins: 5
family_friendly_score: 0.5167999999999999
family_friendly_standard_error: 0.00706707520831638
submission_type: function
display_name: function_bubam_2026-01-11
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-11
win_ratio: 1.0
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.500800132751465s
Received healthy response to inference request in 2.692002773284912s
Received healthy response to inference request in 2.4711267948150635s
Received healthy response to inference request in 2.2765047550201416s
Received healthy response to inference request in 2.480764627456665s
Received healthy response to inference request in 3.0124709606170654s
Received healthy response to inference request in 3.8743135929107666s
Received healthy response to inference request in 2.0828311443328857s
Received healthy response to inference request in 2.8076648712158203s
Received healthy response to inference request in 2.2166907787323s
10 requests
0 failed requests
5th percentile: 2.143067979812622
10th percentile: 2.203304815292358
20th percentile: 2.2645419597625733
30th percentile: 2.4127401828765866
40th percentile: 2.4769094944000245
50th percentile: 2.490782380104065
60th percentile: 2.5772811889648435
70th percentile: 2.7267014026641845
80th percentile: 2.8486260890960695
90th percentile: 3.098655223846435
95th percentile: 3.4864844083786
99th percentile: 3.7967477560043337
mean time: 2.6415170431137085
Pipeline stage StressChecker completed in 27.73s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_bubam_2026-01-11 status is now deployed due to DeploymentManager action
function_bubam_2026-01-11 status is now inactive due to admin request
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 3081.46s
Shutdown handler de-registered
function_bubam_2026-01-11 status is now torndown due to DeploymentManager action