developer_uid: chai_evaluation_service
submission_id: function_gogir_2025-12-15
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-18T19:51:15+00:00
num_battles: 10929
num_wins: 5373
celo_rating: 1287.44
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.49162777930277246
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.6424081325531006s
Received healthy response to inference request in 4.363595008850098s
Received healthy response to inference request in 2.799091339111328s
Received healthy response to inference request in 3.250361204147339s
Received healthy response to inference request in 3.675428867340088s
Received healthy response to inference request in 2.9169676303863525s
Received healthy response to inference request in 4.025040149688721s
Received healthy response to inference request in 2.274869918823242s
Received healthy response to inference request in 2.9865968227386475s
Received healthy response to inference request in 3.0262277126312256s
10 requests
0 failed requests
5th percentile: 2.510769557952881
10th percentile: 2.7466691970825194
20th percentile: 2.8933923721313475
30th percentile: 2.965708065032959
40th percentile: 3.0103753566741944
50th percentile: 3.1382944583892822
60th percentile: 3.4071799755096435
70th percentile: 3.6523143529891966
80th percentile: 3.7453511238098147
90th percentile: 4.058895635604858
95th percentile: 4.2112453222274775
99th percentile: 4.333125071525574
mean time: 3.296058678627014
Pipeline stage StressChecker completed in 34.25s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.87s
Shutdown handler de-registered
function_gogir_2025-12-15 status is now deployed due to DeploymentManager action
function_gogir_2025-12-15 status is now inactive due to auto deactivation removed underperforming models
function_gogir_2025-12-15 status is now torndown due to DeploymentManager action