developer_uid: chai_evaluation_service
submission_id: function_dotub_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T16:41:06+00:00
num_battles: 9988
num_wins: 4964
celo_rating: 1291.1
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.4969963956748098
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.575917959213257s
Received healthy response to inference request in 2.743086099624634s
Received healthy response to inference request in 1.7450180053710938s
Received healthy response to inference request in 1.8346867561340332s
Received healthy response to inference request in 3.0769898891448975s
Received healthy response to inference request in 3.9287092685699463s
Received healthy response to inference request in 3.801401376724243s
Received healthy response to inference request in 2.3205223083496094s
Received healthy response to inference request in 2.19964337348938s
Received healthy response to inference request in 2.6914684772491455s
10 requests
0 failed requests
5th percentile: 1.7853689432144164
10th percentile: 1.8257198810577393
20th percentile: 2.1266520500183104
30th percentile: 2.2842586278915404
40th percentile: 2.4737596988677977
50th percentile: 2.633693218231201
60th percentile: 2.712115526199341
70th percentile: 2.8432572364807127
80th percentile: 3.2218721866607667
90th percentile: 3.8141321659088137
95th percentile: 3.8714207172393795
99th percentile: 3.917251558303833
mean time: 2.691744351387024
Pipeline stage StressChecker completed in 28.24s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.79s
Shutdown handler de-registered
function_dotub_2025-12-17 status is now deployed due to DeploymentManager action
function_dotub_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_dotub_2025-12-17 status is now torndown due to DeploymentManager action