developer_uid: chai_backend_admin
submission_id: function_tebos_2025-07-07
model_name: function_tebos_2025-07-07
model_group:
status: torndown
timestamp: 2025-07-07T22:25:17+00:00
num_battles: 5543
num_wins: 2805
celo_rating: 1294.06
family_friendly_score: 0.5442
family_friendly_standard_error: 0.00704338498166897
submission_type: function
display_name: function_tebos_2025-07-07
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-07-07
win_ratio: 0.506043658668591
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.1697306632995605s
Received healthy response to inference request in 4.7954864501953125s
Received healthy response to inference request in 3.5258400440216064s
Received healthy response to inference request in 4.522293329238892s
Received healthy response to inference request in 4.876136302947998s
5 requests
0 failed requests
5th percentile: 3.654618167877197
10th percentile: 3.783396291732788
20th percentile: 4.040952539443969
30th percentile: 4.240243196487427
40th percentile: 4.381268262863159
50th percentile: 4.522293329238892
60th percentile: 4.63157057762146
70th percentile: 4.740847826004028
80th percentile: 4.81161642074585
90th percentile: 4.843876361846924
95th percentile: 4.860006332397461
99th percentile: 4.8729103088378904
mean time: 4.3778973579406735
%s, retrying in %s seconds...
Received healthy response to inference request in 2.5601119995117188s
Received healthy response to inference request in 3.777928590774536s
Received healthy response to inference request in 3.3799078464508057s
Received healthy response to inference request in 5.053741931915283s
Received healthy response to inference request in 7.03580904006958s
5 requests
0 failed requests
5th percentile: 2.724071168899536
10th percentile: 2.8880303382873533
20th percentile: 3.2159486770629884
30th percentile: 3.4595119953155518
40th percentile: 3.618720293045044
50th percentile: 3.777928590774536
60th percentile: 4.288253927230835
70th percentile: 4.798579263687134
80th percentile: 5.450155353546143
90th percentile: 6.242982196807861
95th percentile: 6.63939561843872
99th percentile: 6.956526355743408
mean time: 4.361499881744384
%s, retrying in %s seconds...
Received healthy response to inference request in 3.2606325149536133s
Received healthy response to inference request in 3.716628074645996s
Received healthy response to inference request in 1.9679129123687744s
Received healthy response to inference request in 4.365322828292847s
Received healthy response to inference request in 3.1249353885650635s
5 requests
0 failed requests
5th percentile: 2.1993174076080324
10th percentile: 2.43072190284729
20th percentile: 2.8935308933258055
30th percentile: 3.1520748138427734
40th percentile: 3.2063536643981934
50th percentile: 3.2606325149536133
60th percentile: 3.4430307388305663
70th percentile: 3.6254289627075194
80th percentile: 3.846367025375366
90th percentile: 4.105844926834107
95th percentile: 4.235583877563476
99th percentile: 4.339375038146972
mean time: 3.287086343765259
Pipeline stage StressChecker completed in 63.43s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_tebos_2025-07-07 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 7020.15s
Shutdown handler de-registered
function_tebos_2025-07-07 status is now inactive due to auto deactivation removed underperforming models
function_tebos_2025-07-07 status is now torndown due to DeploymentManager action