developer_uid: chai_evaluation_service
submission_id: function_kasun_2025-12-15
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-18T13:31:11+00:00
num_battles: 5413
num_wins: 2697
celo_rating: 1291.64
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.49824496582301864
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9293797016143799s
Received healthy response to inference request in 2.2788608074188232s
Received healthy response to inference request in 2.9829061031341553s
Received healthy response to inference request in 2.3816874027252197s
Received healthy response to inference request in 3.0351815223693848s
Received healthy response to inference request in 2.4339630603790283s
Received healthy response to inference request in 5.780808925628662s
Received healthy response to inference request in 1.784928798675537s
Received healthy response to inference request in 4.568122625350952s
Received healthy response to inference request in 3.2173562049865723s
10 requests
0 failed requests
5th percentile: 1.8499317049980164
10th percentile: 1.9149346113204957
20th percentile: 2.2089645862579346
30th percentile: 2.3508394241333006
40th percentile: 2.413052797317505
50th percentile: 2.708434581756592
60th percentile: 3.003816270828247
70th percentile: 3.089833927154541
80th percentile: 3.4875094890594482
90th percentile: 4.689391255378723
95th percentile: 5.235100090503691
99th percentile: 5.671667158603668
mean time: 3.0393195152282715
Pipeline stage StressChecker completed in 32.22s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_kasun_2025-12-15 status is now deployed due to DeploymentManager action
function_kasun_2025-12-15 status is now inactive due to auto deactivation removed underperforming models
function_kasun_2025-12-15 status is now torndown due to DeploymentManager action