developer_uid: NischayDnk
submission_id: function_bafis_2025-07-09
model_name: function_bafis_2025-07-09
model_group:
status: torndown
timestamp: 2025-07-09T21:06:28+00:00
num_battles: 9841
num_wins: 4911
celo_rating: 1286.6
family_friendly_score: 0.5376000000000001
family_friendly_standard_error: 0.007051045879867752
submission_type: function
display_name: function_bafis_2025-07-09
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-07-09
win_ratio: 0.4990346509501067
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 5.479592561721802s
Received healthy response to inference request in 4.836116552352905s
Received healthy response to inference request in 4.0309343338012695s
Received healthy response to inference request in 3.3344948291778564s
Received healthy response to inference request in 4.088778495788574s
5 requests
0 failed requests
5th percentile: 3.4737827301025392
10th percentile: 3.6130706310272216
20th percentile: 3.8916464328765867
30th percentile: 4.04250316619873
40th percentile: 4.065640830993653
50th percentile: 4.088778495788574
60th percentile: 4.3877137184143065
70th percentile: 4.686648941040039
80th percentile: 4.964811754226685
90th percentile: 5.222202157974243
95th percentile: 5.350897359848022
99th percentile: 5.453853521347046
mean time: 4.353983354568482
%s, retrying in %s seconds...
Received healthy response to inference request in 2.218885660171509s
Received healthy response to inference request in 3.593377113342285s
Received healthy response to inference request in 3.5868003368377686s
Received healthy response to inference request in 2.6959574222564697s
Received healthy response to inference request in 3.6690475940704346s
5 requests
0 failed requests
5th percentile: 2.314300012588501
10th percentile: 2.409714365005493
20th percentile: 2.6005430698394774
30th percentile: 2.8741260051727293
40th percentile: 3.230463171005249
50th percentile: 3.5868003368377686
60th percentile: 3.589431047439575
70th percentile: 3.592061758041382
80th percentile: 3.608511209487915
90th percentile: 3.638779401779175
95th percentile: 3.6539134979248047
99th percentile: 3.6660207748413085
mean time: 3.1528136253356935
Pipeline stage StressChecker completed in 41.32s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.79s
Shutdown handler de-registered
function_bafis_2025-07-09 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4227.49s
Shutdown handler de-registered
function_bafis_2025-07-09 status is now inactive due to auto deactivation removed underperforming models
function_bafis_2025-07-09 status is now torndown due to DeploymentManager action