developer_uid: chai_evaluation_service
submission_id: function_faden_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T23:06:27+00:00
num_battles: 8229
num_wins: 4154
celo_rating: 1296.66
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.5048000972171588
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.519679069519043s
Received healthy response to inference request in 3.061555862426758s
Received healthy response to inference request in 2.450667381286621s
Received healthy response to inference request in 3.33838152885437s
Received healthy response to inference request in 2.5705032348632812s
Received healthy response to inference request in 2.405578374862671s
Received healthy response to inference request in 2.346282720565796s
Received healthy response to inference request in 2.5537312030792236s
Received healthy response to inference request in 2.73371958732605s
Received healthy response to inference request in 2.083486795425415s
10 requests
0 failed requests
5th percentile: 2.2017449617385862
10th percentile: 2.320003128051758
20th percentile: 2.3937192440032957
30th percentile: 2.437140679359436
40th percentile: 2.492074394226074
50th percentile: 2.5367051362991333
60th percentile: 2.560440015792847
70th percentile: 2.6194681406021116
80th percentile: 2.7992868423461914
90th percentile: 3.089238429069519
95th percentile: 3.2138099789619443
99th percentile: 3.313467218875885
mean time: 2.6063585758209227
Pipeline stage StressChecker completed in 27.30s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.62s
Shutdown handler de-registered
function_faden_2025-12-16 status is now deployed due to DeploymentManager action
function_faden_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_faden_2025-12-16 status is now torndown due to DeploymentManager action