developer_uid: chai_evaluation_service
submission_id: function_naheb_2025-12-15
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-18T13:31:12+00:00
num_battles: 5467
num_wins: 2767
celo_rating: 1297.54
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.5061276751417596
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9012107849121094s
Received healthy response to inference request in 1.6922495365142822s
Received healthy response to inference request in 1.8259093761444092s
Received healthy response to inference request in 2.415699005126953s
Received healthy response to inference request in 2.0829036235809326s
Received healthy response to inference request in 2.8708784580230713s
Received healthy response to inference request in 1.8224866390228271s
Received healthy response to inference request in 1.9550158977508545s
Received healthy response to inference request in 2.6172475814819336s
Received healthy response to inference request in 2.4322495460510254s
10 requests
0 failed requests
5th percentile: 1.7508562326431274
10th percentile: 1.8094629287719726
20th percentile: 1.8252248287200927
30th percentile: 1.8786203622817994
40th percentile: 1.9334938526153564
50th percentile: 2.0189597606658936
60th percentile: 2.216021776199341
70th percentile: 2.420664167404175
80th percentile: 2.469249153137207
90th percentile: 2.6426106691360474
95th percentile: 2.756744563579559
99th percentile: 2.848051679134369
mean time: 2.16158504486084
Pipeline stage StressChecker completed in 23.48s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_naheb_2025-12-15 status is now deployed due to DeploymentManager action
function_naheb_2025-12-15 status is now inactive due to auto deactivation removed underperforming models
function_naheb_2025-12-15 status is now torndown due to DeploymentManager action