developer_uid: chai_evaluation_service
submission_id: function_jofet_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T04:21:15+00:00
num_battles: 6796
num_wins: 3337
celo_rating: 1286.81
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.4910241318422601
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2740061283111572s
Received healthy response to inference request in 2.06221079826355s
Received healthy response to inference request in 1.841151475906372s
Received healthy response to inference request in 3.6302666664123535s
Received healthy response to inference request in 3.61557674407959s
Received healthy response to inference request in 1.857158899307251s
Received healthy response to inference request in 2.1219277381896973s
Received healthy response to inference request in 2.421370029449463s
Received healthy response to inference request in 3.502525568008423s
Received healthy response to inference request in 2.2306160926818848s
10 requests
0 failed requests
5th percentile: 1.8483548164367676
10th percentile: 1.855558156967163
20th percentile: 2.02120041847229
30th percentile: 2.104012656211853
40th percentile: 2.1871407508850096
50th percentile: 2.252311110496521
60th percentile: 2.3329516887664794
70th percentile: 2.745716691017151
80th percentile: 3.525135803222656
90th percentile: 3.6170457363128663
95th percentile: 3.6236562013626097
99th percentile: 3.628944573402405
mean time: 2.555681014060974
Pipeline stage StressChecker completed in 26.86s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_jofet_2025-12-17 status is now deployed due to DeploymentManager action
function_jofet_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_jofet_2025-12-17 status is now torndown due to DeploymentManager action