developer_uid: chai_evaluation_service
submission_id: function_tojit_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T07:16:57+00:00
num_battles: 9241
num_wins: 4749
celo_rating: 1298.65
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-12
win_ratio: 0.5139054214911806
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.412811040878296s
Received healthy response to inference request in 2.418545722961426s
Received healthy response to inference request in 2.7092299461364746s
Received healthy response to inference request in 3.803253412246704s
Received healthy response to inference request in 2.69691801071167s
Received healthy response to inference request in 3.5744826793670654s
Received healthy response to inference request in 4.801912069320679s
Received healthy response to inference request in 1.817497968673706s
Received healthy response to inference request in 2.254293203353882s
Received healthy response to inference request in 3.3303048610687256s
10 requests
0 failed requests
5th percentile: 2.0140558242797852
10th percentile: 2.2106136798858644
20th percentile: 2.381107473373413
30th percentile: 2.4168253183364867
40th percentile: 2.5855690956115724
50th percentile: 2.7030739784240723
60th percentile: 2.9576599121093747
70th percentile: 3.4035582065582277
80th percentile: 3.6202368259429933
90th percentile: 3.9031192779541013
95th percentile: 4.352515673637389
99th percentile: 4.712032790184021
mean time: 2.981924891471863
Pipeline stage StressChecker completed in 31.73s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_tojit_2025-12-13 status is now deployed due to DeploymentManager action
function_tojit_2025-12-13 status is now inactive due to auto deactivation removed underperforming models