developer_uid: chai_evaluation_service
submission_id: function_huhaf_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T04:21:19+00:00
num_battles: 11612
num_wins: 5772
celo_rating: 1291.25
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.49707199448846023
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.9323501586914062s
Received healthy response to inference request in 2.5728759765625s
Received healthy response to inference request in 2.0534417629241943s
Received healthy response to inference request in 2.567106008529663s
Received healthy response to inference request in 2.2458834648132324s
Received healthy response to inference request in 2.531215190887451s
Received healthy response to inference request in 2.6810855865478516s
Received healthy response to inference request in 2.2609097957611084s
Received healthy response to inference request in 2.949392557144165s
Received healthy response to inference request in 1.9805946350097656s
10 requests
0 failed requests
5th percentile: 2.0133758425712585
10th percentile: 2.0461570501327513
20th percentile: 2.2073951244354246
30th percentile: 2.2564018964767456
40th percentile: 2.4230930328369142
50th percentile: 2.549160599708557
60th percentile: 2.569413995742798
70th percentile: 2.6053388595581053
80th percentile: 2.7313385009765625
90th percentile: 2.934054398536682
95th percentile: 2.9417234778404238
99th percentile: 2.947858741283417
mean time: 2.477485513687134
Pipeline stage StressChecker completed in 26.55s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_huhaf_2025-12-16 status is now deployed due to DeploymentManager action
function_huhaf_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_huhaf_2025-12-16 status is now torndown due to DeploymentManager action