developer_uid: chai_evaluation_service
submission_id: function_nolif_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T07:27:12+00:00
num_battles: 6326
num_wins: 3257
celo_rating: 1327.07
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.5148593107809042
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.1691408157348633s
Received healthy response to inference request in 2.8132710456848145s
Received healthy response to inference request in 2.5081262588500977s
Received healthy response to inference request in 3.4616012573242188s
Received healthy response to inference request in 2.181406259536743s
Received healthy response to inference request in 2.8434650897979736s
Received healthy response to inference request in 3.0244836807250977s
Received healthy response to inference request in 2.1620047092437744s
Received healthy response to inference request in 1.9991302490234375s
Received healthy response to inference request in 2.6537106037139893s
10 requests
0 failed requests
5th percentile: 2.072423756122589
10th percentile: 2.1457172632217407
20th percentile: 2.1677135944366457
30th percentile: 2.177726626396179
40th percentile: 2.377438259124756
50th percentile: 2.5809184312820435
60th percentile: 2.7175347805023193
70th percentile: 2.8223292589187623
80th percentile: 2.8796688079833985
90th percentile: 3.0681954383850094
95th percentile: 3.2648983478546136
99th percentile: 3.422260675430298
mean time: 2.581633996963501
Pipeline stage StressChecker completed in 27.54s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
function_nolif_2025-12-14 status is now deployed due to DeploymentManager action
function_nolif_2025-12-14 status is now inactive due to auto deactivation removed underperforming models