developer_uid: chai_evaluation_service
submission_id: function_dipam_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-17T17:21:23+00:00
num_battles: 7410
num_wins: 3619
celo_rating: 1256.38
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.48839406207827263
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.7487905025482178s
Received healthy response to inference request in 3.246892213821411s
Received healthy response to inference request in 3.145974636077881s
Received healthy response to inference request in 3.7251081466674805s
Received healthy response to inference request in 2.7817745208740234s
Received healthy response to inference request in 3.351881265640259s
Received healthy response to inference request in 3.817758083343506s
Received healthy response to inference request in 2.6605241298675537s
Received healthy response to inference request in 2.061659812927246s
Received healthy response to inference request in 3.7769405841827393s
10 requests
0 failed requests
5th percentile: 2.3311487555503847
10th percentile: 2.600637698173523
20th percentile: 2.7575244426727297
30th percentile: 3.0367146015167235
40th percentile: 3.206525182723999
50th percentile: 3.299386739730835
60th percentile: 3.501172018051147
70th percentile: 3.732212853431702
80th percentile: 3.754420518875122
90th percentile: 3.781022334098816
95th percentile: 3.799390208721161
99th percentile: 3.814084508419037
mean time: 3.2317303895950316
Pipeline stage StressChecker completed in 33.88s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.56s
Shutdown handler de-registered
function_dipam_2025-12-14 status is now deployed due to DeploymentManager action
function_dipam_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_dipam_2025-12-14 status is now torndown due to DeploymentManager action