developer_uid: chai_evaluation_service
submission_id: function_sedef_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T18:51:15+00:00
num_battles: 10803
num_wins: 5434
celo_rating: 1295.25
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.5030084235860409
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.486915111541748s
Received healthy response to inference request in 2.259540557861328s
Received healthy response to inference request in 2.478801965713501s
Received healthy response to inference request in 2.303980827331543s
Received healthy response to inference request in 1.8123319149017334s
Received healthy response to inference request in 2.999892234802246s
Received healthy response to inference request in 2.097635269165039s
Received healthy response to inference request in 2.7876884937286377s
Received healthy response to inference request in 2.016766309738159s
Received healthy response to inference request in 2.832378625869751s
10 requests
0 failed requests
5th percentile: 1.904327392578125
10th percentile: 1.9963228702545166
20th percentile: 2.081461477279663
30th percentile: 2.2109689712524414
40th percentile: 2.286204719543457
50th percentile: 2.391391396522522
60th percentile: 2.4820472240447997
70th percentile: 2.577147126197815
80th percentile: 2.7966265201568605
90th percentile: 2.849129986763
95th percentile: 2.924511110782623
99th percentile: 2.9848160099983216
mean time: 2.4075931310653687
Pipeline stage StressChecker completed in 25.39s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_sedef_2025-12-17 status is now deployed due to DeploymentManager action
function_sedef_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_sedef_2025-12-17 status is now torndown due to DeploymentManager action