developer_uid: chai_evaluation_service
submission_id: function_helib_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T12:01:28+00:00
num_battles: 9001
num_wins: 4526
celo_rating: 1295.23
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-21
win_ratio: 0.502833018553494
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2975971698760986s
Received healthy response to inference request in 1.7733769416809082s
Received healthy response to inference request in 1.7942063808441162s
Received healthy response to inference request in 2.8776135444641113s
Received healthy response to inference request in 2.3060178756713867s
Received healthy response to inference request in 2.3412740230560303s
Received healthy response to inference request in 1.9960477352142334s
Received healthy response to inference request in 1.9158952236175537s
Received healthy response to inference request in 2.010389566421509s
Received healthy response to inference request in 2.3653879165649414s
10 requests
0 failed requests
5th percentile: 1.7827501893043518
10th percentile: 1.7921234369277954
20th percentile: 1.8915574550628662
30th percentile: 1.9720019817352294
40th percentile: 2.0046528339385987
50th percentile: 2.1539933681488037
60th percentile: 2.300965452194214
70th percentile: 2.31659471988678
80th percentile: 2.3460968017578123
90th percentile: 2.4166104793548584
95th percentile: 2.6471120119094844
99th percentile: 2.831513237953186
mean time: 2.167780637741089
Pipeline stage StressChecker completed in 23.20s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_helib_2025-12-18 status is now deployed due to DeploymentManager action
function_helib_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_helib_2025-12-18 status is now torndown due to DeploymentManager action