developer_uid: chai_evaluation_service
submission_id: function_puhik_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T03:21:22+00:00
num_battles: 7343
num_wins: 3668
celo_rating: 1292.83
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.49952335557673977
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.4046247005462646s
Received healthy response to inference request in 1.994225025177002s
Received healthy response to inference request in 3.1454081535339355s
Received healthy response to inference request in 2.4036829471588135s
Received healthy response to inference request in 3.121220111846924s
Received healthy response to inference request in 2.323570966720581s
Received healthy response to inference request in 2.4508626461029053s
Received healthy response to inference request in 2.2231433391571045s
Received healthy response to inference request in 2.2256991863250732s
Received healthy response to inference request in 2.221273899078369s
10 requests
0 failed requests
5th percentile: 2.0963970184326173
10th percentile: 2.1985690116882326
20th percentile: 2.2227694511413576
30th percentile: 2.2249324321746826
40th percentile: 2.284422254562378
50th percentile: 2.3636269569396973
60th percentile: 2.4225548267364503
70th percentile: 2.651969885826111
80th percentile: 3.1260577201843263
90th percentile: 3.171329808235168
95th percentile: 3.287977254390716
99th percentile: 3.381295211315155
mean time: 2.5513710975646973
Pipeline stage StressChecker completed in 26.79s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.56s
Shutdown handler de-registered
function_puhik_2025-12-18 status is now deployed due to DeploymentManager action
function_puhik_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_puhik_2025-12-18 status is now torndown due to DeploymentManager action