developer_uid: chai_evaluation_service
submission_id: function_tisek_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T14:51:18+00:00
num_battles: 9357
num_wins: 4757
celo_rating: 1299.24
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.5083894410601688
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.38313889503479s
Received healthy response to inference request in 2.1184442043304443s
Received healthy response to inference request in 2.094621419906616s
Received healthy response to inference request in 2.529982328414917s
Received healthy response to inference request in 2.8068559169769287s
Received healthy response to inference request in 2.77811861038208s
Received healthy response to inference request in 1.9512135982513428s
Received healthy response to inference request in 2.621856451034546s
Received healthy response to inference request in 1.9299321174621582s
Received healthy response to inference request in 1.761476993560791s
10 requests
0 failed requests
5th percentile: 1.8372817993164063
10th percentile: 1.9130866050720214
20th percentile: 1.9469573020935058
30th percentile: 2.051599073410034
40th percentile: 2.108915090560913
50th percentile: 2.250791549682617
60th percentile: 2.441876268386841
70th percentile: 2.5575445652008058
80th percentile: 2.6531088829040526
90th percentile: 2.780992341041565
95th percentile: 2.793924129009247
99th percentile: 2.804269559383392
mean time: 2.2975640535354613
Pipeline stage StressChecker completed in 24.29s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
function_tisek_2025-12-16 status is now deployed due to DeploymentManager action
function_tisek_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_tisek_2025-12-16 status is now torndown due to DeploymentManager action