developer_uid: chai_evaluation_service
submission_id: function_hirof_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-22T00:31:04+00:00
num_battles: 9592
num_wins: 4730
celo_rating: 1288.5
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-21
win_ratio: 0.49311926605504586
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.5395047664642334s
Received healthy response to inference request in 2.8418750762939453s
Received healthy response to inference request in 2.087663412094116s
Received healthy response to inference request in 3.567754030227661s
Received healthy response to inference request in 2.9097073078155518s
Received healthy response to inference request in 3.8157052993774414s
Received healthy response to inference request in 3.033759593963623s
Received healthy response to inference request in 3.0818593502044678s
Received healthy response to inference request in 2.7074618339538574s
Received healthy response to inference request in 3.9835329055786133s
10 requests
0 failed requests
5th percentile: 2.3665727019309997
10th percentile: 2.645481991767883
20th percentile: 2.8149924278259277
30th percentile: 2.88935763835907
40th percentile: 2.9841386795043947
50th percentile: 3.0578094720840454
60th percentile: 3.264917516708374
70th percentile: 3.547979545593262
80th percentile: 3.617344284057617
90th percentile: 3.8324880599975586
95th percentile: 3.908010482788086
99th percentile: 3.9684284210205076
mean time: 3.1568823575973513
Pipeline stage StressChecker completed in 32.83s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_hirof_2025-12-18 status is now deployed due to DeploymentManager action
function_hirof_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_hirof_2025-12-18 status is now torndown due to DeploymentManager action