developer_uid: chai_evaluation_service
submission_id: function_rehes_2025-12-15
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-18T17:51:26+00:00
num_battles: 9220
num_wins: 4558
celo_rating: 1289.23
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.4943600867678959
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9185357093811035s
Received healthy response to inference request in 3.335568904876709s
Received healthy response to inference request in 4.5363404750823975s
Received healthy response to inference request in 2.771761894226074s
Received healthy response to inference request in 2.525909662246704s
Received healthy response to inference request in 5.269866228103638s
Received healthy response to inference request in 3.321915864944458s
Received healthy response to inference request in 2.752903699874878s
Received healthy response to inference request in 1.9203417301177979s
Received healthy response to inference request in 3.682816743850708s
10 requests
0 failed requests
5th percentile: 1.919348418712616
10th percentile: 1.9201611280441284
20th percentile: 2.404796075820923
30th percentile: 2.684805488586426
40th percentile: 2.7642186164855955
50th percentile: 3.046838879585266
60th percentile: 3.3273770809173584
70th percentile: 3.439743256568909
80th percentile: 3.8535214900970463
90th percentile: 4.6096930503845215
95th percentile: 4.939779639244079
99th percentile: 5.203848910331726
mean time: 3.203596091270447
Pipeline stage StressChecker completed in 33.90s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.55s
Shutdown handler de-registered
function_rehes_2025-12-15 status is now deployed due to DeploymentManager action
function_rehes_2025-12-15 status is now inactive due to auto deactivation removed underperforming models
function_rehes_2025-12-15 status is now torndown due to DeploymentManager action