developer_uid: chai_evaluation_service
submission_id: function_joluk_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T23:51:19+00:00
num_battles: 6609
num_wins: 3195
celo_rating: 1281.53
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.4834316840671811
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.84842848777771s
Received healthy response to inference request in 3.4581120014190674s
Received healthy response to inference request in 2.4761388301849365s
Received healthy response to inference request in 1.6399800777435303s
Received healthy response to inference request in 2.4728667736053467s
Received healthy response to inference request in 2.806300401687622s
Received healthy response to inference request in 2.5480458736419678s
Received healthy response to inference request in 1.917910099029541s
Received healthy response to inference request in 1.8996694087982178s
Received healthy response to inference request in 1.7921817302703857s
10 requests
0 failed requests
5th percentile: 1.7084708213806152
10th percentile: 1.7769615650177002
20th percentile: 1.8781718730926513
30th percentile: 1.9124378919601441
40th percentile: 2.2508841037750242
50th percentile: 2.4745028018951416
60th percentile: 2.504901647567749
70th percentile: 2.625522232055664
80th percentile: 2.8147260189056396
90th percentile: 2.9093968391418454
95th percentile: 3.1837544202804557
99th percentile: 3.403240485191345
mean time: 2.3859633684158323
Pipeline stage StressChecker completed in 25.46s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.86s
Shutdown handler de-registered
function_joluk_2025-12-17 status is now deployed due to DeploymentManager action
function_joluk_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_joluk_2025-12-17 status is now torndown due to DeploymentManager action