developer_uid: chai_evaluation_service
submission_id: function_pabar_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T19:21:15+00:00
num_battles: 7413
num_wins: 3677
celo_rating: 1290.53
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.4960205045190881
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 6.06738018989563s
Received healthy response to inference request in 3.309513568878174s
Received healthy response to inference request in 4.021894454956055s
Received healthy response to inference request in 2.0679233074188232s
Received healthy response to inference request in 1.8187618255615234s
Received healthy response to inference request in 2.4340851306915283s
Received healthy response to inference request in 2.6247336864471436s
Received healthy response to inference request in 2.999462604522705s
Received healthy response to inference request in 1.908233404159546s
Received healthy response to inference request in 1.9403371810913086s
10 requests
0 failed requests
5th percentile: 1.8590240359306336
10th percentile: 1.8992862462997437
20th percentile: 1.9339164257049561
30th percentile: 2.029647469520569
40th percentile: 2.2876204013824464
50th percentile: 2.529409408569336
60th percentile: 2.774625253677368
70th percentile: 3.092477893829346
80th percentile: 3.45198974609375
90th percentile: 4.226443028450012
95th percentile: 5.146911609172819
99th percentile: 5.883286473751069
mean time: 2.9192325353622435
Pipeline stage StressChecker completed in 30.50s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_pabar_2025-12-17 status is now deployed due to DeploymentManager action
function_pabar_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_pabar_2025-12-17 status is now torndown due to DeploymentManager action