developer_uid: chai_evaluation_service
submission_id: function_nidit_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T23:51:20+00:00
num_battles: 6623
num_wins: 3289
celo_rating: 1290.89
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.496602747999396
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.017035484313965s
Received healthy response to inference request in 2.194091558456421s
Received healthy response to inference request in 2.8491992950439453s
Received healthy response to inference request in 2.3228201866149902s
Received healthy response to inference request in 2.8439009189605713s
Received healthy response to inference request in 2.5721189975738525s
Received healthy response to inference request in 2.249077320098877s
Received healthy response to inference request in 3.0218446254730225s
Received healthy response to inference request in 2.9577813148498535s
Received healthy response to inference request in 2.41733980178833s
10 requests
0 failed requests
5th percentile: 2.09671071767807
10th percentile: 2.1763859510421755
20th percentile: 2.2380801677703857
30th percentile: 2.3006973266601562
40th percentile: 2.3795319557189942
50th percentile: 2.4947293996810913
60th percentile: 2.6808317661285397
70th percentile: 2.8454904317855836
80th percentile: 2.8709156990051268
90th percentile: 2.9641876459121703
95th percentile: 2.993016135692596
99th percentile: 3.0160789275169373
mean time: 2.544520950317383
Pipeline stage StressChecker completed in 27.20s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_nidit_2025-12-17 status is now deployed due to DeploymentManager action
function_nidit_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_nidit_2025-12-17 status is now torndown due to DeploymentManager action