developer_uid: chai_evaluation_service
submission_id: function_gegib_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T05:51:15+00:00
num_battles: 6759
num_wins: 3370
celo_rating: 1286.04
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-17
win_ratio: 0.4985944666370765
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8676717281341553s
Received healthy response to inference request in 2.5653882026672363s
Received healthy response to inference request in 2.409654140472412s
Received healthy response to inference request in 1.7448368072509766s
Received healthy response to inference request in 1.954836368560791s
Received healthy response to inference request in 1.8512277603149414s
Received healthy response to inference request in 1.9354183673858643s
Received healthy response to inference request in 3.1099812984466553s
Received healthy response to inference request in 2.8047029972076416s
Received healthy response to inference request in 2.710491180419922s
10 requests
0 failed requests
5th percentile: 1.7927127361297608
10th percentile: 1.840588665008545
20th percentile: 1.8643829345703125
30th percentile: 1.9150943756103516
40th percentile: 1.9470691680908203
50th percentile: 2.1822452545166016
60th percentile: 2.471947765350342
70th percentile: 2.608919095993042
80th percentile: 2.7293335437774657
90th percentile: 2.835230827331543
95th percentile: 2.9726060628890987
99th percentile: 3.082506251335144
mean time: 2.2954208850860596
Pipeline stage StressChecker completed in 25.15s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_gegib_2025-12-18 status is now deployed due to DeploymentManager action
function_gegib_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_gegib_2025-12-18 status is now torndown due to DeploymentManager action