developer_uid: chai_evaluation_service
submission_id: function_nobar_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-17T22:31:14+00:00
num_battles: 14679
num_wins: 7236
celo_rating: 1256.41
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.49294911097486205
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 8.104395627975464s
Received healthy response to inference request in 2.3435702323913574s
Received healthy response to inference request in 3.3133232593536377s
Received healthy response to inference request in 3.592205762863159s
Received healthy response to inference request in 2.728102684020996s
Received healthy response to inference request in 5.363752603530884s
Received healthy response to inference request in 3.105117082595825s
Received healthy response to inference request in 2.6160058975219727s
Received healthy response to inference request in 3.3561668395996094s
Received healthy response to inference request in 2.5210933685302734s
10 requests
0 failed requests
5th percentile: 2.4234556436538695
10th percentile: 2.503341054916382
20th percentile: 2.5970233917236327
30th percentile: 2.694473648071289
40th percentile: 2.9543113231658937
50th percentile: 3.2092201709747314
60th percentile: 3.3304606914520263
70th percentile: 3.426978516578674
80th percentile: 3.9465151309967044
90th percentile: 5.637816905975341
95th percentile: 6.8711062669754
99th percentile: 7.857737755775452
mean time: 3.704373335838318
Pipeline stage StressChecker completed in 38.32s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.55s
Shutdown handler de-registered
function_nobar_2025-12-14 status is now deployed due to DeploymentManager action
function_nobar_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_nobar_2025-12-14 status is now torndown due to DeploymentManager action