developer_uid: chai_evaluation_service
submission_id: function_mabas_2025-12-15
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-15T05:56:39+00:00
num_battles: 7302
num_wins: 3744
celo_rating: 1256.45
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.5127362366474938
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.492244005203247s
Received healthy response to inference request in 2.3409764766693115s
Received healthy response to inference request in 2.6102240085601807s
Received healthy response to inference request in 2.63206148147583s
Received healthy response to inference request in 3.234604597091675s
Received healthy response to inference request in 2.31425404548645s
Received healthy response to inference request in 2.5983924865722656s
Received healthy response to inference request in 2.109769821166992s
Received healthy response to inference request in 2.3057684898376465s
Received healthy response to inference request in 1.876922369003296s
10 requests
0 failed requests
5th percentile: 1.9817037224769591
10th percentile: 2.0864850759506224
20th percentile: 2.2665687561035157
30th percentile: 2.311708378791809
40th percentile: 2.330287504196167
50th percentile: 2.4696844816207886
60th percentile: 2.6031250953674316
70th percentile: 2.6167752504348756
80th percentile: 2.752570104598999
90th percentile: 3.3603685379028314
95th percentile: 3.926306271553038
99th percentile: 4.3790564584732055
mean time: 2.6515217781066895
Pipeline stage StressChecker completed in 28.56s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_mabas_2025-12-15 status is now deployed due to DeploymentManager action
function_mabas_2025-12-15 status is now inactive due to auto deactivation removed underperforming models