developer_uid: chai_evaluation_service
submission_id: function_nebeb_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T03:41:45+00:00
num_battles: 8187
num_wins: 4143
celo_rating: 1297.28
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.5060461707585197
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7451395988464355s
Received healthy response to inference request in 2.4674534797668457s
Received healthy response to inference request in 2.3699018955230713s
Received healthy response to inference request in 3.344698667526245s
Received healthy response to inference request in 2.6063060760498047s
Received healthy response to inference request in 2.4592154026031494s
Received healthy response to inference request in 2.4969804286956787s
Received healthy response to inference request in 1.8897128105163574s
Received healthy response to inference request in 2.623957872390747s
Received healthy response to inference request in 3.6324567794799805s
10 requests
0 failed requests
5th percentile: 1.8101975440979003
10th percentile: 1.8752554893493651
20th percentile: 2.2738640785217283
30th percentile: 2.432421350479126
40th percentile: 2.4641582489013674
50th percentile: 2.482216954231262
60th percentile: 2.540710687637329
70th percentile: 2.6116016149520873
80th percentile: 2.768106031417847
90th percentile: 3.3734744787216187
95th percentile: 3.502965629100799
99th percentile: 3.606558549404144
mean time: 2.5635823011398315
Pipeline stage StressChecker completed in 28.03s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_nebeb_2025-12-16 status is now deployed due to DeploymentManager action
function_nebeb_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_nebeb_2025-12-16 status is now torndown due to DeploymentManager action