developer_uid: chai_evaluation_service
submission_id: function_kuser_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T17:51:27+00:00
num_battles: 8728
num_wins: 4343
celo_rating: 1291.65
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.49759395050412464
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.585576057434082s
Received healthy response to inference request in 2.169597864151001s
Received healthy response to inference request in 3.4644675254821777s
Received healthy response to inference request in 3.3088207244873047s
Received healthy response to inference request in 2.3510632514953613s
Received healthy response to inference request in 2.6397042274475098s
Received healthy response to inference request in 2.8548972606658936s
Received healthy response to inference request in 3.0836727619171143s
Received healthy response to inference request in 2.1365065574645996s
Received healthy response to inference request in 4.568568229675293s
10 requests
0 failed requests
5th percentile: 2.1513976454734802
10th percentile: 2.166288733482361
20th percentile: 2.3147701740264894
30th percentile: 2.5152222156524657
40th percentile: 2.6180529594421387
50th percentile: 2.7473007440567017
60th percentile: 2.9464074611663817
70th percentile: 3.151217150688171
80th percentile: 3.3399500846862793
90th percentile: 3.574877595901489
95th percentile: 4.07172291278839
99th percentile: 4.469199166297913
mean time: 2.9162874460220336
Pipeline stage StressChecker completed in 30.40s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_kuser_2025-12-17 status is now deployed due to DeploymentManager action
function_kuser_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_kuser_2025-12-17 status is now torndown due to DeploymentManager action