developer_uid: chai_evaluation_service
submission_id: function_mofef_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T16:01:08+00:00
num_battles: 7930
num_wins: 3934
celo_rating: 1290.5
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.4960907944514502
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.9246461391448975s
Received healthy response to inference request in 2.155428171157837s
Received healthy response to inference request in 1.7629597187042236s
Received healthy response to inference request in 1.821556806564331s
Received healthy response to inference request in 2.334177255630493s
Received healthy response to inference request in 2.434288263320923s
Received healthy response to inference request in 3.294240951538086s
Received healthy response to inference request in 2.8312458992004395s
Received healthy response to inference request in 2.2155165672302246s
Received healthy response to inference request in 1.728694200515747s
10 requests
0 failed requests
5th percentile: 1.7441136837005615
10th percentile: 1.759533166885376
20th percentile: 1.8098373889923096
30th percentile: 2.055266761779785
40th percentile: 2.1914812088012696
50th percentile: 2.274846911430359
60th percentile: 2.374221658706665
70th percentile: 2.5533755540847776
80th percentile: 2.8499259471893312
90th percentile: 2.961605620384216
95th percentile: 3.1279232859611508
99th percentile: 3.260977418422699
mean time: 2.35027539730072
Pipeline stage StressChecker completed in 25.03s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.62s
Shutdown handler de-registered
function_mofef_2025-12-17 status is now deployed due to DeploymentManager action
function_mofef_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_mofef_2025-12-17 status is now torndown due to DeploymentManager action