developer_uid: chai_backend_admin
submission_id: function_ruras_2025-12-05
model_name: function_ruras_2025-12-05
model_group:
status: torndown
timestamp: 2025-12-12T18:30:21+00:00
num_battles: 28766
num_wins: 15265
celo_rating: 1314.33
family_friendly_score: 0.5846
family_friendly_standard_error: 0.006969115295358515
submission_type: function
display_name: function_ruras_2025-12-05
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-05
win_ratio: 0.5306611972467496
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.7576546669006348s
Received healthy response to inference request in 2.558680534362793s
Received healthy response to inference request in 2.6907031536102295s
Received healthy response to inference request in 0.5378239154815674s
Received healthy response to inference request in 0.48329758644104004s
Received healthy response to inference request in 0.5129501819610596s
Received healthy response to inference request in 0.7017340660095215s
Received healthy response to inference request in 0.6867239475250244s
Received healthy response to inference request in 0.43134570121765137s
Received healthy response to inference request in 0.6501173973083496s
10 requests
0 failed requests
5th percentile: 0.45472404956817625
10th percentile: 0.4781023979187012
20th percentile: 0.5070196628570557
30th percentile: 0.5303617954254151
40th percentile: 0.6052000045776367
50th percentile: 0.668420672416687
60th percentile: 0.6927279949188232
70th percentile: 1.2588180065155026
80th percentile: 2.58508505821228
90th percentile: 2.69739830493927
95th percentile: 2.7275264859199524
99th percentile: 2.7516290307044984
mean time: 1.201103115081787
Pipeline stage StressChecker completed in 14.55s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.64s
Shutdown handler de-registered
function_ruras_2025-12-05 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 3322.38s
Shutdown handler de-registered
function_ruras_2025-12-05 status is now inactive due to auto deactivation removed underperforming models
function_ruras_2025-12-05 status is now torndown due to DeploymentManager action