developer_uid: chai_evaluation_service
submission_id: function_bikaf_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T14:31:18+00:00
num_battles: 10196
num_wins: 5089
celo_rating: 1292.53
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.49911730090231465
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.7043139934539795s
Received healthy response to inference request in 2.393906831741333s
Received healthy response to inference request in 1.928403615951538s
Received healthy response to inference request in 2.219675302505493s
Received healthy response to inference request in 2.248433828353882s
Received healthy response to inference request in 2.105020523071289s
Received healthy response to inference request in 2.3415327072143555s
Received healthy response to inference request in 1.9422593116760254s
Received healthy response to inference request in 2.29311203956604s
Received healthy response to inference request in 2.20351505279541s
10 requests
0 failed requests
5th percentile: 1.9346386790275574
10th percentile: 1.9408737421035767
20th percentile: 2.0724682807922363
30th percentile: 2.1739666938781737
40th percentile: 2.21321120262146
50th percentile: 2.2340545654296875
60th percentile: 2.266305112838745
70th percentile: 2.307638239860535
80th percentile: 2.352007532119751
90th percentile: 2.4249475479125975
95th percentile: 2.564630770683288
99th percentile: 2.6763773488998415
mean time: 2.2380173206329346
Pipeline stage StressChecker completed in 23.71s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
Shutdown handler de-registered
function_bikaf_2025-12-17 status is now deployed due to DeploymentManager action
function_bikaf_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_bikaf_2025-12-17 status is now torndown due to DeploymentManager action