developer_uid: chai_evaluation_service
submission_id: function_mufel_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T05:31:10+00:00
num_battles: 8314
num_wins: 4212
celo_rating: 1297.82
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.506615347606447
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.6893906593322754s
Received healthy response to inference request in 2.079099178314209s
Received healthy response to inference request in 2.3277485370635986s
Received healthy response to inference request in 2.127668857574463s
Received healthy response to inference request in 2.475435256958008s
Received healthy response to inference request in 1.817636489868164s
Received healthy response to inference request in 1.7795894145965576s
Received healthy response to inference request in 2.29974627494812s
Received healthy response to inference request in 3.032245397567749s
Received healthy response to inference request in 1.8670213222503662s
10 requests
0 failed requests
5th percentile: 1.7967105984687806
10th percentile: 1.8138317823410035
20th percentile: 1.8571443557739258
30th percentile: 2.015475821495056
40th percentile: 2.1082409858703612
50th percentile: 2.2137075662612915
60th percentile: 2.3109471797943115
70th percentile: 2.3720545530319215
80th percentile: 2.5182263374328615
90th percentile: 2.7236761331558226
95th percentile: 2.8779607653617854
99th percentile: 3.0013884711265564
mean time: 2.249558138847351
Pipeline stage StressChecker completed in 23.74s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_mufel_2025-12-17 status is now deployed due to DeploymentManager action
function_mufel_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_mufel_2025-12-17 status is now torndown due to DeploymentManager action