developer_uid: chai_evaluation_service
submission_id: function_hupen_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T18:21:20+00:00
num_battles: 11272
num_wins: 5603
celo_rating: 1291.1
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.4970723917672108
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.896514654159546s
Received healthy response to inference request in 1.7042384147644043s
Received healthy response to inference request in 1.9159858226776123s
Received healthy response to inference request in 3.113955497741699s
Received healthy response to inference request in 1.6632771492004395s
Received healthy response to inference request in 1.7426669597625732s
Received healthy response to inference request in 3.1282551288604736s
Received healthy response to inference request in 3.0911338329315186s
Received healthy response to inference request in 3.18035888671875s
Received healthy response to inference request in 3.0635764598846436s
10 requests
0 failed requests
5th percentile: 1.6817097187042236
10th percentile: 1.7001422882080077
20th percentile: 1.7349812507629394
30th percentile: 1.8639901638031005
40th percentile: 2.5043031215667724
50th percentile: 2.9800455570220947
60th percentile: 3.0745994091033935
70th percentile: 3.097980332374573
80th percentile: 3.116815423965454
90th percentile: 3.1334655046463014
95th percentile: 3.1569121956825255
99th percentile: 3.175669548511505
mean time: 2.549996280670166
Pipeline stage StressChecker completed in 27.09s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
function_hupen_2025-12-16 status is now deployed due to DeploymentManager action
function_hupen_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_hupen_2025-12-16 status is now torndown due to DeploymentManager action