developer_uid: chai_evaluation_service
submission_id: function_pemor_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T09:21:16+00:00
num_battles: 7933
num_wins: 3978
celo_rating: 1294.12
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.5014496407412076
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.258878231048584s
Received healthy response to inference request in 2.6770222187042236s
Received healthy response to inference request in 1.8429462909698486s
Received healthy response to inference request in 2.006431818008423s
Received healthy response to inference request in 2.5024538040161133s
Received healthy response to inference request in 2.2574572563171387s
Received healthy response to inference request in 1.7972147464752197s
Received healthy response to inference request in 1.7033727169036865s
Received healthy response to inference request in 1.8894917964935303s
Received healthy response to inference request in 2.8767449855804443s
10 requests
0 failed requests
5th percentile: 1.7456016302108766
10th percentile: 1.7878305435180664
20th percentile: 1.8337999820709228
30th percentile: 1.8755281448364258
40th percentile: 1.9596558094024659
50th percentile: 2.1319445371627808
60th percentile: 2.258025646209717
70th percentile: 2.3319509029388428
80th percentile: 2.5373674869537353
90th percentile: 2.6969944953918454
95th percentile: 2.7868697404861447
99th percentile: 2.8587699365615844
mean time: 2.181201386451721
Pipeline stage StressChecker completed in 23.23s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_pemor_2025-12-17 status is now deployed due to DeploymentManager action
function_pemor_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_pemor_2025-12-17 status is now torndown due to DeploymentManager action