developer_uid: chai_evaluation_service
submission_id: function_fuhik_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-17T18:41:24+00:00
num_battles: 10584
num_wins: 5318
celo_rating: 1256.39
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.5024565381708239
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.3090786933898926s
Received healthy response to inference request in 3.304800271987915s
Received healthy response to inference request in 2.3261396884918213s
Received healthy response to inference request in 3.019378662109375s
Received healthy response to inference request in 3.4629573822021484s
Received healthy response to inference request in 2.136005401611328s
Received healthy response to inference request in 2.217252016067505s
Received healthy response to inference request in 3.020071506500244s
Received healthy response to inference request in 2.4760468006134033s
Received healthy response to inference request in 3.3193747997283936s
10 requests
0 failed requests
5th percentile: 2.1725663781166076
10th percentile: 2.209127354621887
20th percentile: 2.304362154006958
30th percentile: 2.4310746669769285
40th percentile: 2.8020459175109864
50th percentile: 3.0197250843048096
60th percentile: 3.1339630126953124
70th percentile: 3.3060837984085083
80th percentile: 3.3111379146575928
90th percentile: 3.333733057975769
95th percentile: 3.3983452200889586
99th percentile: 3.4500349497795106
mean time: 2.8591105222702025
Pipeline stage StressChecker completed in 30.56s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_fuhik_2025-12-14 status is now deployed due to DeploymentManager action
function_fuhik_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_fuhik_2025-12-14 status is now torndown due to DeploymentManager action