developer_uid: chai_evaluation_service
submission_id: function_magor_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T23:41:12+00:00
num_battles: 7628
num_wins: 3810
celo_rating: 1292.94
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.49947561615102254
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2727956771850586s
Received healthy response to inference request in 2.83408260345459s
Received healthy response to inference request in 2.1780953407287598s
Received healthy response to inference request in 2.188028335571289s
Received healthy response to inference request in 2.036208152770996s
Received healthy response to inference request in 2.3208823204040527s
Received healthy response to inference request in 2.1077256202697754s
Received healthy response to inference request in 2.6268744468688965s
Received healthy response to inference request in 2.3900022506713867s
Received healthy response to inference request in 2.8208374977111816s
10 requests
0 failed requests
5th percentile: 2.0683910131454466
10th percentile: 2.1005738735198975
20th percentile: 2.164021396636963
30th percentile: 2.1850484371185304
40th percentile: 2.238888740539551
50th percentile: 2.2968389987945557
60th percentile: 2.3485302925109863
70th percentile: 2.4610639095306395
80th percentile: 2.6656670570373535
90th percentile: 2.8221620082855225
95th percentile: 2.828122305870056
99th percentile: 2.8328905439376832
mean time: 2.3775532245635986
Pipeline stage StressChecker completed in 25.01s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.56s
Shutdown handler de-registered
function_magor_2025-12-16 status is now deployed due to DeploymentManager action
function_magor_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_magor_2025-12-16 status is now torndown due to DeploymentManager action