developer_uid: chai_evaluation_service
submission_id: function_gepef_2025-12-15
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-18T02:21:32+00:00
num_battles: 9478
num_wins: 4683
celo_rating: 1289.11
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-17
win_ratio: 0.49409158050221563
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.995213747024536s
Received healthy response to inference request in 4.073127508163452s
Received healthy response to inference request in 2.7929160594940186s
Received healthy response to inference request in 2.6443729400634766s
Received healthy response to inference request in 2.036724090576172s
Received healthy response to inference request in 3.3214707374572754s
Received healthy response to inference request in 4.498187065124512s
Received healthy response to inference request in 2.494534969329834s
Received healthy response to inference request in 3.505495071411133s
Received healthy response to inference request in 4.4535071849823s
10 requests
0 failed requests
5th percentile: 2.2427389860153197
10th percentile: 2.4487538814544676
20th percentile: 2.6144053459167482
30th percentile: 2.748353123664856
40th percentile: 3.1100488662719727
50th percentile: 3.413482904434204
60th percentile: 3.701382541656494
70th percentile: 4.018587875366211
80th percentile: 4.149203443527222
90th percentile: 4.457975172996521
95th percentile: 4.4780811190605165
99th percentile: 4.494165875911713
mean time: 3.381554937362671
Pipeline stage StressChecker completed in 36.09s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.56s
Shutdown handler de-registered
function_gepef_2025-12-15 status is now deployed due to DeploymentManager action
function_gepef_2025-12-15 status is now inactive due to auto deactivation removed underperforming models
function_gepef_2025-12-15 status is now torndown due to DeploymentManager action