developer_uid: chai_evaluation_service
submission_id: function_reher_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-18T00:21:21+00:00
num_battles: 8233
num_wins: 4146
celo_rating: 1256.41
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.503583141017855
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.4631261825561523s
Received healthy response to inference request in 2.8273022174835205s
Received healthy response to inference request in 4.597491025924683s
Received healthy response to inference request in 5.303937911987305s
Received healthy response to inference request in 4.914802312850952s
Received healthy response to inference request in 2.7242965698242188s
Received healthy response to inference request in 9.102296352386475s
Received healthy response to inference request in 2.991447687149048s
Received healthy response to inference request in 3.07698655128479s
Received healthy response to inference request in 1.6786167621612549s
10 requests
0 failed requests
5th percentile: 2.0316460013389586
10th percentile: 2.3846752405166627
20th percentile: 2.6720624923706056
30th percentile: 2.79640052318573
40th percentile: 2.925789499282837
50th percentile: 3.034217119216919
60th percentile: 3.685188341140746
70th percentile: 4.6926844120025635
80th percentile: 4.992629432678223
90th percentile: 5.68377375602722
95th percentile: 7.393035054206845
99th percentile: 8.76044409275055
mean time: 3.9680303573608398
Pipeline stage StressChecker completed in 41.15s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_reher_2025-12-14 status is now deployed due to DeploymentManager action
function_reher_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_reher_2025-12-14 status is now torndown due to DeploymentManager action