developer_uid: chai_evaluation_service
submission_id: function_nifit_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T22:46:28+00:00
num_battles: 6131
num_wins: 2990
celo_rating: 1284.71
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.4876855325395531
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.1920077800750732s
Received healthy response to inference request in 2.7564451694488525s
Received healthy response to inference request in 2.158904790878296s
Received healthy response to inference request in 2.3081588745117188s
Received healthy response to inference request in 2.488760232925415s
Received healthy response to inference request in 2.1477372646331787s
Received healthy response to inference request in 2.1906652450561523s
Received healthy response to inference request in 2.9636058807373047s
Received healthy response to inference request in 3.6988279819488525s
Received healthy response to inference request in 2.9537882804870605s
10 requests
0 failed requests
5th percentile: 2.1527626514434814
10th percentile: 2.157788038253784
20th percentile: 2.184313154220581
30th percentile: 2.272910785675049
40th percentile: 2.4165196895599363
50th percentile: 2.622602701187134
60th percentile: 2.8353824138641355
70th percentile: 2.956733560562134
80th percentile: 3.0092862606048585
90th percentile: 3.242689800262451
95th percentile: 3.4707588911056515
99th percentile: 3.6532141637802127
mean time: 2.6858901500701906
Pipeline stage StressChecker completed in 28.16s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_nifit_2025-12-17 status is now deployed due to DeploymentManager action
function_nifit_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_nifit_2025-12-17 status is now torndown due to DeploymentManager action