developer_uid: chai_evaluation_service
submission_id: function_mopan_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T14:51:18+00:00
num_battles: 8881
num_wins: 4451
celo_rating: 1294.02
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.5011822992906204
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 10.177368640899658s
Received healthy response to inference request in 4.154570579528809s
Received healthy response to inference request in 5.491484880447388s
Received healthy response to inference request in 2.060727834701538s
Received healthy response to inference request in 3.8631627559661865s
Received healthy response to inference request in 3.621347665786743s
Received healthy response to inference request in 3.205368995666504s
Received healthy response to inference request in 3.1835012435913086s
Received healthy response to inference request in 2.600693941116333s
Received healthy response to inference request in 6.613973379135132s
10 requests
0 failed requests
5th percentile: 2.303712582588196
10th percentile: 2.5466973304748537
20th percentile: 3.0669397830963137
30th percentile: 3.1988086700439453
40th percentile: 3.4549561977386474
50th percentile: 3.742255210876465
60th percentile: 3.979725885391235
70th percentile: 4.555644869804382
80th percentile: 5.715982580184937
90th percentile: 6.970312905311583
95th percentile: 8.573840773105617
99th percentile: 9.856663067340852
mean time: 4.49721999168396
Pipeline stage StressChecker completed in 47.09s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.52s
Shutdown handler de-registered
function_mopan_2025-12-16 status is now deployed due to DeploymentManager action
function_mopan_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_mopan_2025-12-16 status is now torndown due to DeploymentManager action