developer_uid: chai_backend_admin
submission_id: function_resin_2025-07-09
model_name: function_resin_2025-07-09
model_group:
status: torndown
timestamp: 2025-07-09T22:23:30+00:00
num_battles: 7031
num_wins: 3639
celo_rating: 1296.41
family_friendly_score: 0.5382
family_friendly_standard_error: 0.007050400839668621
submission_type: function
display_name: function_resin_2025-07-09
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-07-09
win_ratio: 0.5175650689802304
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.175310850143433s
Received healthy response to inference request in 3.8827013969421387s
Received healthy response to inference request in 3.7747116088867188s
Received healthy response to inference request in 2.5285801887512207s
Received healthy response to inference request in 3.5918431282043457s
5 requests
0 failed requests
5th percentile: 2.7412327766418456
10th percentile: 2.9538853645324705
20th percentile: 3.379190540313721
30th percentile: 3.62841682434082
40th percentile: 3.7015642166137694
50th percentile: 3.7747116088867188
60th percentile: 3.817907524108887
70th percentile: 3.8611034393310546
80th percentile: 3.9412232875823974
90th percentile: 4.058267068862915
95th percentile: 4.1167889595031735
99th percentile: 4.163606472015381
mean time: 3.5906294345855714
%s, retrying in %s seconds...
Received healthy response to inference request in 2.669839382171631s
Received healthy response to inference request in 2.82489275932312s
Received healthy response to inference request in 2.397456169128418s
Received healthy response to inference request in 3.7003040313720703s
Received healthy response to inference request in 3.647254228591919s
5 requests
0 failed requests
5th percentile: 2.4519328117370605
10th percentile: 2.506409454345703
20th percentile: 2.6153627395629884
30th percentile: 2.7008500576019285
40th percentile: 2.7628714084625243
50th percentile: 2.82489275932312
60th percentile: 3.1538373470306396
70th percentile: 3.482781934738159
80th percentile: 3.6578641891479493
90th percentile: 3.6790841102600096
95th percentile: 3.68969407081604
99th percentile: 3.6981820392608644
mean time: 3.0479493141174316
Pipeline stage StressChecker completed in 35.97s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.96s
Shutdown handler de-registered
function_resin_2025-07-09 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3636.21s
Shutdown handler de-registered
function_resin_2025-07-09 status is now inactive due to auto deactivation removed underperforming models
function_resin_2025-07-09 status is now torndown due to DeploymentManager action