developer_uid: chai_evaluation_service
submission_id: function_sodek_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T10:56:29+00:00
num_battles: 8224
num_wins: 4125
celo_rating: 1256.38
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.5015807392996109
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.6585140228271484s
Received healthy response to inference request in 6.247203826904297s
Received healthy response to inference request in 4.788532257080078s
Received healthy response to inference request in 2.7451322078704834s
Received healthy response to inference request in 3.4906816482543945s
Received healthy response to inference request in 2.302703380584717s
Received healthy response to inference request in 3.965252161026001s
Received healthy response to inference request in 2.668792724609375s
Received healthy response to inference request in 4.434471130371094s
Received healthy response to inference request in 2.3695778846740723s
10 requests
0 failed requests
5th percentile: 2.332796907424927
10th percentile: 2.362890434265137
20th percentile: 2.6089497566223145
30th percentile: 2.7222303628921507
40th percentile: 3.19246187210083
50th percentile: 3.5745978355407715
60th percentile: 3.7812092781066893
70th percentile: 4.106017851829529
80th percentile: 4.505283355712891
90th percentile: 4.9343994140625
95th percentile: 5.590801620483397
99th percentile: 6.115923385620118
mean time: 3.667086124420166
Pipeline stage StressChecker completed in 38.31s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_sodek_2025-12-14 status is now deployed due to DeploymentManager action
function_sodek_2025-12-14 status is now inactive due to auto deactivation removed underperforming models