developer_uid: chai_evaluation_service
submission_id: function_fuhuf_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-17T18:26:29+00:00
num_battles: 11793
num_wins: 5818
celo_rating: 1256.39
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.49334350886118883
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.197897434234619s
Received healthy response to inference request in 1.8798367977142334s
Received healthy response to inference request in 2.3555870056152344s
Received healthy response to inference request in 3.620537042617798s
Received healthy response to inference request in 3.339240074157715s
Received healthy response to inference request in 2.577420711517334s
Received healthy response to inference request in 2.756882905960083s
Received healthy response to inference request in 3.545297384262085s
Received healthy response to inference request in 1.869276762008667s
Received healthy response to inference request in 2.409428119659424s
10 requests
0 failed requests
5th percentile: 1.874028778076172
10th percentile: 1.8787807941436767
20th percentile: 2.2604369640350344
30th percentile: 2.393275785446167
40th percentile: 2.5102236747741697
50th percentile: 2.6671518087387085
60th percentile: 2.9332887172698974
70th percentile: 3.2403002262115477
80th percentile: 3.380451536178589
90th percentile: 3.5528213500976564
95th percentile: 3.586679196357727
99th percentile: 3.613765473365784
mean time: 2.7551404237747192
Pipeline stage StressChecker completed in 28.92s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.91s
Shutdown handler de-registered
function_fuhuf_2025-12-14 status is now deployed due to DeploymentManager action
function_fuhuf_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_fuhuf_2025-12-14 status is now torndown due to DeploymentManager action