developer_uid: chai_evaluation_service
submission_id: function_rager_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T09:56:43+00:00
num_battles: 8341
num_wins: 4169
celo_rating: 1219.17
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.499820165447788
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9279894828796387s
Received healthy response to inference request in 2.2957706451416016s
Received healthy response to inference request in 2.2278025150299072s
Received healthy response to inference request in 2.501131296157837s
Received healthy response to inference request in 1.6553895473480225s
Received healthy response to inference request in 2.825144052505493s
Received healthy response to inference request in 2.2938363552093506s
Received healthy response to inference request in 2.9476330280303955s
Received healthy response to inference request in 3.136924982070923s
Received healthy response to inference request in 2.7936646938323975s
10 requests
0 failed requests
5th percentile: 1.7780595183372498
10th percentile: 1.900729489326477
20th percentile: 2.1678399085998534
30th percentile: 2.2740262031555174
40th percentile: 2.2949969291687013
50th percentile: 2.3984509706497192
60th percentile: 2.618144655227661
70th percentile: 2.803108501434326
80th percentile: 2.8496418476104735
90th percentile: 2.966562223434448
95th percentile: 3.051743602752685
99th percentile: 3.1198887062072753
mean time: 2.4605286598205565
Pipeline stage StressChecker completed in 26.18s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_rager_2025-12-14 status is now deployed due to DeploymentManager action
function_rager_2025-12-14 status is now inactive due to auto deactivation removed underperforming models