developer_uid: chai_evaluation_service
submission_id: function_diror_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T09:27:07+00:00
num_battles: 8318
num_wins: 4114
celo_rating: 1315.5
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.4945900456840587
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.003328800201416s
Received healthy response to inference request in 4.424797058105469s
Received healthy response to inference request in 3.2244253158569336s
Received healthy response to inference request in 4.788560390472412s
Received healthy response to inference request in 3.43204927444458s
Received healthy response to inference request in 1.8025286197662354s
Received healthy response to inference request in 2.928037166595459s
Received healthy response to inference request in 2.771089792251587s
Received healthy response to inference request in 4.219870567321777s
Received healthy response to inference request in 3.4458415508270264s
10 requests
0 failed requests
5th percentile: 1.8928887009620667
10th percentile: 1.983248782157898
20th percentile: 2.617537593841553
30th percentile: 2.8809529542922974
40th percentile: 3.1058700561523436
50th percentile: 3.328237295150757
60th percentile: 3.4375661849975585
70th percentile: 3.6780502557754513
80th percentile: 4.260855865478516
90th percentile: 4.461173391342163
95th percentile: 4.624866890907287
99th percentile: 4.7558216905593875
mean time: 3.3040528535842895
Pipeline stage StressChecker completed in 34.24s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.56s
Shutdown handler de-registered
function_diror_2025-12-14 status is now deployed due to DeploymentManager action
function_diror_2025-12-14 status is now inactive due to auto deactivation removed underperforming models