developer_uid: chai_evaluation_service
submission_id: function_fuden_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T03:51:29+00:00
num_battles: 7872
num_wins: 3966
celo_rating: 1296.04
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.5038109756097561
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.78187894821167s
Received healthy response to inference request in 2.8458614349365234s
Received healthy response to inference request in 1.6640169620513916s
Received healthy response to inference request in 3.6602964401245117s
Received healthy response to inference request in 3.586439847946167s
Received healthy response to inference request in 4.01858115196228s
Received healthy response to inference request in 2.2351646423339844s
Received healthy response to inference request in 3.7444403171539307s
Received healthy response to inference request in 1.9586656093597412s
Received healthy response to inference request in 1.745943546295166s
10 requests
0 failed requests
5th percentile: 1.70088392496109
10th percentile: 1.7377508878707886
20th percentile: 1.7746918678283692
30th percentile: 1.9056296110153197
40th percentile: 2.124565029144287
50th percentile: 2.540513038635254
60th percentile: 3.1420928001403805
70th percentile: 3.6085968255996703
80th percentile: 3.6771252155303955
90th percentile: 3.7718544006347656
95th percentile: 3.8952177762985225
99th percentile: 3.993908476829529
mean time: 2.7241288900375364
Pipeline stage StressChecker completed in 29.32s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_fuden_2025-12-18 status is now deployed due to DeploymentManager action
function_fuden_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_fuden_2025-12-18 status is now torndown due to DeploymentManager action