developer_uid: chai_evaluation_service
submission_id: function_sejir_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T20:21:17+00:00
num_battles: 7326
num_wins: 3672
celo_rating: 1294.14
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-21
win_ratio: 0.5012285012285013
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.063058614730835s
Received healthy response to inference request in 3.6379482746124268s
Received healthy response to inference request in 3.075554847717285s
Received healthy response to inference request in 3.4329237937927246s
Received healthy response to inference request in 3.7921605110168457s
Received healthy response to inference request in 2.168807029724121s
Received healthy response to inference request in 3.4388153553009033s
Received healthy response to inference request in 3.4542629718780518s
Received healthy response to inference request in 2.170654535293579s
Received healthy response to inference request in 3.4238197803497314s
10 requests
0 failed requests
5th percentile: 2.1106454014778135
10th percentile: 2.1582321882247926
20th percentile: 2.1702850341796873
30th percentile: 2.804084753990173
40th percentile: 3.284513807296753
50th percentile: 3.428371787071228
60th percentile: 3.435280418395996
70th percentile: 3.4434496402740478
80th percentile: 3.491000032424927
90th percentile: 3.6533694982528684
95th percentile: 3.722765004634857
99th percentile: 3.778281409740448
mean time: 3.0658005714416503
Pipeline stage StressChecker completed in 32.03s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_sejir_2025-12-18 status is now deployed due to DeploymentManager action
function_sejir_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_sejir_2025-12-18 status is now torndown due to DeploymentManager action