developer_uid: chai_evaluation_service
submission_id: function_gomun_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T21:51:15+00:00
num_battles: 10062
num_wins: 5040
celo_rating: 1293.79
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.5008944543828264
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.8666770458221436s
Received healthy response to inference request in 2.5313987731933594s
Received healthy response to inference request in 3.02915096282959s
Received healthy response to inference request in 2.0422000885009766s
Received healthy response to inference request in 2.245253086090088s
Received healthy response to inference request in 2.7921557426452637s
Received healthy response to inference request in 1.591496229171753s
Received healthy response to inference request in 3.2034547328948975s
Received healthy response to inference request in 3.0323734283447266s
Received healthy response to inference request in 2.4622066020965576s
10 requests
0 failed requests
5th percentile: 1.7943129658699035
10th percentile: 1.997129702568054
20th percentile: 2.204642486572266
30th percentile: 2.3971205472946164
40th percentile: 2.503721904754639
50th percentile: 2.6617772579193115
60th percentile: 2.886953830718994
70th percentile: 3.030117702484131
80th percentile: 3.066589689254761
90th percentile: 3.269776964187622
95th percentile: 3.568227005004882
99th percentile: 3.8069870376586916
mean time: 2.6796366691589357
Pipeline stage StressChecker completed in 28.47s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.76s
Shutdown handler de-registered
function_gomun_2025-12-16 status is now deployed due to DeploymentManager action
function_gomun_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_gomun_2025-12-16 status is now torndown due to DeploymentManager action