developer_uid: chai_evaluation_service
submission_id: function_fupuk_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T04:47:38+00:00
num_battles: 6107
num_wins: 3111
celo_rating: 1299.9
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-12
win_ratio: 0.5094154249222204
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2223634719848633s
Received healthy response to inference request in 2.0442349910736084s
Received healthy response to inference request in 3.7721316814422607s
Received healthy response to inference request in 2.048203945159912s
Received healthy response to inference request in 3.1406240463256836s
Received healthy response to inference request in 2.3039629459381104s
Received healthy response to inference request in 3.5327301025390625s
Received healthy response to inference request in 3.08469820022583s
Received healthy response to inference request in 1.727168083190918s
Received healthy response to inference request in 3.3546035289764404s
10 requests
0 failed requests
5th percentile: 1.8698481917381287
10th percentile: 2.0125283002853394
20th percentile: 2.0474101543426513
30th percentile: 2.170115613937378
40th percentile: 2.2713231563568117
50th percentile: 2.69433057308197
60th percentile: 3.1070685386657715
70th percentile: 3.2048178911209106
80th percentile: 3.390228843688965
90th percentile: 3.556670260429382
95th percentile: 3.664400970935821
99th percentile: 3.750585539340973
mean time: 2.7230720996856688
Pipeline stage StressChecker completed in 28.79s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.76s
Shutdown handler de-registered
function_fupuk_2025-12-13 status is now deployed due to DeploymentManager action
function_fupuk_2025-12-13 status is now inactive due to auto deactivation removed underperforming models