developer_uid: chai_evaluation_service
submission_id: function_dabom_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T10:21:15+00:00
num_battles: 8547
num_wins: 4222
celo_rating: 1289.08
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.493974493974494
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.364780902862549s
Received healthy response to inference request in 1.755721092224121s
Received healthy response to inference request in 2.1537814140319824s
Received healthy response to inference request in 2.064760446548462s
Received healthy response to inference request in 1.8341290950775146s
Received healthy response to inference request in 2.091550350189209s
Received healthy response to inference request in 1.764554500579834s
Received healthy response to inference request in 2.310178518295288s
Received healthy response to inference request in 1.699922800064087s
Received healthy response to inference request in 2.291363000869751s
10 requests
0 failed requests
5th percentile: 1.7250320315361023
10th percentile: 1.7501412630081177
20th percentile: 1.7627878189086914
30th percentile: 1.8132567167282105
40th percentile: 1.972507905960083
50th percentile: 2.0781553983688354
60th percentile: 2.1164427757263184
70th percentile: 2.195055890083313
80th percentile: 2.2951261043548583
90th percentile: 2.3156387567520142
95th percentile: 2.3402098298072813
99th percentile: 2.359866688251495
mean time: 2.03307421207428
Pipeline stage StressChecker completed in 21.72s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.69s
Shutdown handler de-registered
function_dabom_2025-12-17 status is now deployed due to DeploymentManager action
function_dabom_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_dabom_2025-12-17 status is now torndown due to DeploymentManager action