developer_uid: chai_evaluation_service
submission_id: function_suhok_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T01:21:21+00:00
num_battles: 8512
num_wins: 4226
celo_rating: 1290.83
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.49647556390977443
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.182879686355591s
Received healthy response to inference request in 2.3403162956237793s
Received healthy response to inference request in 2.6537208557128906s
Received healthy response to inference request in 3.482454776763916s
Received healthy response to inference request in 2.382981538772583s
Received healthy response to inference request in 2.3358535766601562s
Received healthy response to inference request in 2.6130266189575195s
Received healthy response to inference request in 2.5612823963165283s
Received healthy response to inference request in 3.4955856800079346s
Received healthy response to inference request in 2.794696807861328s
10 requests
0 failed requests
5th percentile: 2.2517179369926454
10th percentile: 2.3205561876296996
20th percentile: 2.3394237518310548
30th percentile: 2.3701819658279417
40th percentile: 2.4899620532989504
50th percentile: 2.587154507637024
60th percentile: 2.629304313659668
70th percentile: 2.696013641357422
80th percentile: 2.932248401641846
90th percentile: 3.483767867088318
95th percentile: 3.489676773548126
99th percentile: 3.494403898715973
mean time: 2.6842798233032226
Pipeline stage StressChecker completed in 28.14s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.62s
Shutdown handler de-registered
function_suhok_2025-12-17 status is now deployed due to DeploymentManager action
function_suhok_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_suhok_2025-12-17 status is now torndown due to DeploymentManager action