developer_uid: chai_evaluation_service
submission_id: function_bugos_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T12:27:34+00:00
num_battles: 8287
num_wins: 4264
celo_rating: 1256.38
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.5145408471099312
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.110833168029785s
Received healthy response to inference request in 2.476424217224121s
Received healthy response to inference request in 2.5783376693725586s
Received healthy response to inference request in 1.7533187866210938s
Received healthy response to inference request in 2.075221061706543s
Received healthy response to inference request in 2.360799789428711s
Received healthy response to inference request in 1.8550224304199219s
Received healthy response to inference request in 2.1143271923065186s
Received healthy response to inference request in 1.689748764038086s
Received healthy response to inference request in 1.9223897457122803s
10 requests
0 failed requests
5th percentile: 1.7183552742004395
10th percentile: 1.746961784362793
20th percentile: 1.8346817016601562
30th percentile: 1.9021795511245727
40th percentile: 2.014088535308838
50th percentile: 2.093027114868164
60th percentile: 2.1122307777404785
70th percentile: 2.1882689714431764
80th percentile: 2.383924674987793
90th percentile: 2.4866155624389648
95th percentile: 2.5324766159057615
99th percentile: 2.5691654586791994
mean time: 2.0936422824859617
Pipeline stage StressChecker completed in 23.17s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
function_bugos_2025-12-14 status is now deployed due to DeploymentManager action
function_bugos_2025-12-14 status is now inactive due to auto deactivation removed underperforming models