developer_uid: chai_evaluation_service
submission_id: function_segef_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T10:27:01+00:00
num_battles: 8235
num_wins: 4131
celo_rating: 1256.37
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.5016393442622951
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.348522901535034s
Received healthy response to inference request in 3.094736099243164s
Received healthy response to inference request in 2.908583641052246s
Received healthy response to inference request in 3.1392977237701416s
Received healthy response to inference request in 2.3731400966644287s
Received healthy response to inference request in 3.783127546310425s
Received healthy response to inference request in 2.7421982288360596s
Received healthy response to inference request in 4.484795093536377s
Received healthy response to inference request in 3.863693952560425s
Received healthy response to inference request in 3.0826706886291504s
10 requests
0 failed requests
5th percentile: 2.5392162561416627
10th percentile: 2.7052924156188967
20th percentile: 2.875306558609009
30th percentile: 3.0304445743560793
40th percentile: 3.0899099349975585
50th percentile: 3.117016911506653
60th percentile: 3.2229877948760985
70th percentile: 3.478904294967651
80th percentile: 3.799240827560425
90th percentile: 3.92580406665802
95th percentile: 4.205299580097198
99th percentile: 4.428895990848542
mean time: 3.282076597213745
Pipeline stage StressChecker completed in 33.98s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.84s
Shutdown handler de-registered
function_segef_2025-12-14 status is now deployed due to DeploymentManager action
function_segef_2025-12-14 status is now inactive due to auto deactivation removed underperforming models