developer_uid: chai_evaluation_service
submission_id: function_kogal_2025-12-15
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-18T09:01:09+00:00
num_battles: 7454
num_wins: 3720
celo_rating: 1292.44
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.4990609068956265
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.1387457847595215s
Received healthy response to inference request in 1.9196739196777344s
Received healthy response to inference request in 2.004460096359253s
Received healthy response to inference request in 3.0981605052948s
Received healthy response to inference request in 2.112931251525879s
Received healthy response to inference request in 2.4668750762939453s
Received healthy response to inference request in 1.9110593795776367s
Received healthy response to inference request in 1.8736460208892822s
Received healthy response to inference request in 2.249927043914795s
Received healthy response to inference request in 1.8204011917114258s
10 requests
0 failed requests
5th percentile: 1.8443613648414612
10th percentile: 1.8683215379714966
20th percentile: 1.903576707839966
30th percentile: 1.9170895576477052
40th percentile: 1.9705456256866456
50th percentile: 2.058695673942566
60th percentile: 2.167729568481445
70th percentile: 2.31501145362854
80th percentile: 2.593132162094116
90th percentile: 3.102219033241272
95th percentile: 3.1204824090003966
99th percentile: 3.1350931096076966
mean time: 2.259588027000427
Pipeline stage StressChecker completed in 24.34s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_kogal_2025-12-15 status is now deployed due to DeploymentManager action
function_kogal_2025-12-15 status is now inactive due to auto deactivation removed underperforming models
function_kogal_2025-12-15 status is now torndown due to DeploymentManager action