developer_uid: chai_evaluation_service
submission_id: function_garot_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T06:26:57+00:00
num_battles: 7640
num_wins: 3771
celo_rating: 1256.35
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.49358638743455496
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2303237915039062s
Received healthy response to inference request in 4.300536632537842s
Received healthy response to inference request in 3.0412895679473877s
Received healthy response to inference request in 2.2289481163024902s
Received healthy response to inference request in 2.7494468688964844s
Received healthy response to inference request in 1.9412310123443604s
Received healthy response to inference request in 2.6470377445220947s
Received healthy response to inference request in 2.3995721340179443s
Received healthy response to inference request in 3.45680570602417s
Received healthy response to inference request in 7.09744119644165s
10 requests
0 failed requests
5th percentile: 2.0707037091255187
10th percentile: 2.200176405906677
20th percentile: 2.230048656463623
30th percentile: 2.348797631263733
40th percentile: 2.5480515003204345
50th percentile: 2.6982423067092896
60th percentile: 2.8661839485168454
70th percentile: 3.165944409370422
80th percentile: 3.6255518913269045
90th percentile: 4.580227088928222
95th percentile: 5.8388341426849335
99th percentile: 6.845719785690308
mean time: 3.209263277053833
Pipeline stage StressChecker completed in 33.47s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_garot_2025-12-14 status is now deployed due to DeploymentManager action
function_garot_2025-12-14 status is now inactive due to auto deactivation removed underperforming models