developer_uid: chai_evaluation_service
submission_id: function_rebut_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T20:17:23+00:00
num_battles: 7682
num_wins: 3863
celo_rating: 1256.32
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.5028638375423067
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.405864953994751s
Received healthy response to inference request in 2.0874886512756348s
Received healthy response to inference request in 2.8362796306610107s
Received healthy response to inference request in 3.8456625938415527s
Received healthy response to inference request in 2.931546449661255s
Received healthy response to inference request in 3.1554250717163086s
Received healthy response to inference request in 2.994753360748291s
Received healthy response to inference request in 2.412194013595581s
Received healthy response to inference request in 2.728384494781494s
Received healthy response to inference request in 4.824756145477295s
10 requests
0 failed requests
5th percentile: 2.2336060643196105
10th percentile: 2.3797234773635862
20th percentile: 2.6651463985443113
30th percentile: 2.8039110898971558
40th percentile: 2.8934397220611574
50th percentile: 2.963149905204773
60th percentile: 3.059022045135498
70th percentile: 3.2305570363998415
80th percentile: 3.4938244819641113
90th percentile: 3.9435719490051264
95th percentile: 4.38416404724121
99th percentile: 4.736637725830079
mean time: 3.1222355365753174
Pipeline stage StressChecker completed in 32.55s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_rebut_2025-12-13 status is now deployed due to DeploymentManager action
function_rebut_2025-12-13 status is now inactive due to auto deactivation removed underperforming models