developer_uid: chai_evaluation_service
submission_id: function_getof_2025-12-19
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-22T01:31:04+00:00
num_battles: 7351
num_wins: 3655
celo_rating: 1291.42
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-21
win_ratio: 0.4972112637736362
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.346553325653076s
Received healthy response to inference request in 3.273268222808838s
Received healthy response to inference request in 2.9058520793914795s
Received healthy response to inference request in 1.5883879661560059s
Received healthy response to inference request in 1.831679344177246s
Received healthy response to inference request in 2.890261173248291s
Received healthy response to inference request in 2.943983554840088s
Received healthy response to inference request in 1.9403555393218994s
Received healthy response to inference request in 2.294016122817993s
Received healthy response to inference request in 2.2004122734069824s
10 requests
0 failed requests
5th percentile: 1.6978690862655639
10th percentile: 1.8073502063751221
20th percentile: 1.9186203002929687
30th percentile: 2.1223952531814576
40th percentile: 2.256574583053589
50th percentile: 2.3202847242355347
60th percentile: 2.5640364646911618
70th percentile: 2.8949384450912476
80th percentile: 2.913478374481201
90th percentile: 2.976912021636963
95th percentile: 3.1250901222229
99th percentile: 3.2436326026916507
mean time: 2.42147696018219
Pipeline stage StressChecker completed in 25.55s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_getof_2025-12-19 status is now deployed due to DeploymentManager action
function_getof_2025-12-19 status is now inactive due to auto deactivation removed underperforming models
function_getof_2025-12-19 status is now torndown due to DeploymentManager action