developer_uid: chai_evaluation_service
submission_id: function_gipen_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T08:51:14+00:00
num_battles: 7089
num_wins: 3526
celo_rating: 1291.22
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.4973903230356891
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.410485744476318s
Received healthy response to inference request in 1.9534590244293213s
Received healthy response to inference request in 2.2806625366210938s
Received healthy response to inference request in 2.4065046310424805s
Received healthy response to inference request in 2.0393760204315186s
Received healthy response to inference request in 1.6666452884674072s
Received healthy response to inference request in 1.989320993423462s
Received healthy response to inference request in 2.613065719604492s
Received healthy response to inference request in 2.550565004348755s
Received healthy response to inference request in 1.7662360668182373s
10 requests
0 failed requests
5th percentile: 1.7114611387252807
10th percentile: 1.7562769889831542
20th percentile: 1.9160144329071045
30th percentile: 1.9785624027252198
40th percentile: 2.019354009628296
50th percentile: 2.160019278526306
60th percentile: 2.3309993743896484
70th percentile: 2.449722743034363
80th percentile: 2.5630651473999024
90th percentile: 2.7928077220916743
95th percentile: 3.6016467332839945
99th percentile: 4.248717942237854
mean time: 2.3676321029663088
Pipeline stage StressChecker completed in 25.23s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_gipen_2025-12-17 status is now deployed due to DeploymentManager action
function_gipen_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_gipen_2025-12-17 status is now torndown due to DeploymentManager action