developer_uid: chai_evaluation_service
submission_id: function_dagon_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T14:01:07+00:00
num_battles: 8209
num_wins: 4093
celo_rating: 1292.24
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-21
win_ratio: 0.49859909855037154
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.862327814102173s
Received healthy response to inference request in 3.6458895206451416s
Received healthy response to inference request in 3.5977659225463867s
Received healthy response to inference request in 2.170717239379883s
Received healthy response to inference request in 2.301673650741577s
Received healthy response to inference request in 2.0840137004852295s
Received healthy response to inference request in 2.482208013534546s
Received healthy response to inference request in 1.841667652130127s
Received healthy response to inference request in 1.8753936290740967s
Received healthy response to inference request in 2.1574018001556396s
10 requests
0 failed requests
5th percentile: 1.8568443417549134
10th percentile: 1.8720210313796997
20th percentile: 2.042289686203003
30th percentile: 2.1353853702545167
40th percentile: 2.1653910636901856
50th percentile: 2.23619544506073
60th percentile: 2.3738873958587647
70th percentile: 2.596243953704834
80th percentile: 3.009415435791016
90th percentile: 3.602578282356262
95th percentile: 3.624233901500702
99th percentile: 3.6415583968162535
mean time: 2.50190589427948
Pipeline stage StressChecker completed in 27.01s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_dagon_2025-12-18 status is now deployed due to DeploymentManager action
function_dagon_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_dagon_2025-12-18 status is now torndown due to DeploymentManager action