developer_uid: chai_evaluation_service
submission_id: function_kohos_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T06:51:16+00:00
num_battles: 7690
num_wins: 3812
celo_rating: 1256.68
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-15
win_ratio: 0.49570871261378413
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.6104540824890137s
Received healthy response to inference request in 3.7141611576080322s
Received healthy response to inference request in 2.956238269805908s
Received healthy response to inference request in 2.233515977859497s
Received healthy response to inference request in 1.8778367042541504s
Received healthy response to inference request in 2.0450050830841064s
Received healthy response to inference request in 1.8177189826965332s
Received healthy response to inference request in 3.8972201347351074s
Received healthy response to inference request in 5.916613578796387s
Received healthy response to inference request in 3.48868465423584s
10 requests
0 failed requests
5th percentile: 1.844771957397461
10th percentile: 1.8718249320983886
20th percentile: 2.0115714073181152
30th percentile: 2.17696270942688
40th percentile: 2.459678840637207
50th percentile: 2.783346176147461
60th percentile: 3.1692168235778806
70th percentile: 3.5563276052474975
80th percentile: 3.7507729530334473
90th percentile: 4.0991594791412345
95th percentile: 5.007886528968809
99th percentile: 5.734868168830872
mean time: 3.0557448625564576
Pipeline stage StressChecker completed in 33.46s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.67s
Shutdown handler de-registered
function_kohos_2025-12-16 status is now deployed due to DeploymentManager action
function_kohos_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_kohos_2025-12-16 status is now torndown due to DeploymentManager action