developer_uid: chai_backend_admin
submission_id: function_bonaf_2025-12-03
model_name: function_bonaf_2025-12-03
model_group:
status: inactive
timestamp: 2025-12-03T00:39:32+00:00
num_battles: 9070
num_wins: 4739
celo_rating: 1304.8
family_friendly_score: 0.5452
family_friendly_standard_error: 0.007042115591212629
submission_type: function
display_name: function_bonaf_2025-12-03
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-02
win_ratio: 0.5224917309812569
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.3710126876831055s
Received healthy response to inference request in 1.5436875820159912s
Received healthy response to inference request in 2.4331858158111572s
Received healthy response to inference request in 2.5328054428100586s
Received healthy response to inference request in 2.9306819438934326s
Received healthy response to inference request in 3.0810160636901855s
Received healthy response to inference request in 3.1304447650909424s
Received healthy response to inference request in 3.3381662368774414s
Received healthy response to inference request in 3.3702404499053955s
Received healthy response to inference request in 3.295738697052002s
10 requests
0 failed requests
5th percentile: 1.448716390132904
10th percentile: 1.5264200925827027
20th percentile: 2.255286169052124
30th percentile: 2.502919554710388
40th percentile: 2.771531343460083
50th percentile: 3.005849003791809
60th percentile: 3.100787544250488
70th percentile: 3.1800329446792603
80th percentile: 3.30422420501709
90th percentile: 3.341373658180237
95th percentile: 3.355807054042816
99th percentile: 3.36735377073288
mean time: 2.7026979684829713
Pipeline stage StressChecker completed in 28.49s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_bonaf_2025-12-03 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2395.19s
Shutdown handler de-registered