developer_uid: chai_evaluation_service
submission_id: function_petom_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T22:16:54+00:00
num_battles: 7074
num_wins: 3557
celo_rating: 1256.34
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.5028272547356517
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9597914218902588s
Received healthy response to inference request in 2.214144468307495s
Received healthy response to inference request in 2.1489529609680176s
Received healthy response to inference request in 1.812227487564087s
Received healthy response to inference request in 2.230381727218628s
Received healthy response to inference request in 2.101069688796997s
Received healthy response to inference request in 3.1770622730255127s
Received healthy response to inference request in 1.9443297386169434s
Received healthy response to inference request in 2.7513935565948486s
Received healthy response to inference request in 2.838732957839966s
10 requests
0 failed requests
5th percentile: 1.8716735005378724
10th percentile: 1.9311195135116577
20th percentile: 1.9566990852355957
30th percentile: 2.0586862087249758
40th percentile: 2.129799652099609
50th percentile: 2.1815487146377563
60th percentile: 2.220639371871948
70th percentile: 2.386685276031494
80th percentile: 2.768861436843872
90th percentile: 2.8725658893585204
95th percentile: 3.0248140811920163
99th percentile: 3.1466126346588137
mean time: 2.3178086280822754
Pipeline stage StressChecker completed in 24.61s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.56s
Shutdown handler de-registered
function_petom_2025-12-13 status is now deployed due to DeploymentManager action
function_petom_2025-12-13 status is now inactive due to auto deactivation removed underperforming models