developer_uid: chai_evaluation_service
submission_id: function_nahef_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T04:51:21+00:00
num_battles: 7447
num_wins: 3706
celo_rating: 1291.83
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.4976500604270176
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.536957025527954s
Received healthy response to inference request in 1.8323421478271484s
Received healthy response to inference request in 2.258545160293579s
Received healthy response to inference request in 3.526794195175171s
Received healthy response to inference request in 4.320467710494995s
Received healthy response to inference request in 2.1641125679016113s
Received healthy response to inference request in 7.249154329299927s
Received healthy response to inference request in 3.0709328651428223s
Received healthy response to inference request in 1.9665358066558838s
Received healthy response to inference request in 1.756026029586792s
10 requests
0 failed requests
5th percentile: 1.7903682827949523
10th percentile: 1.8247105360031128
20th percentile: 1.9396970748901368
30th percentile: 2.104839539527893
40th percentile: 2.220772123336792
50th percentile: 2.3977510929107666
60th percentile: 2.750547361373901
70th percentile: 3.2076912641525266
80th percentile: 3.685528898239136
90th percentile: 4.613336372375487
95th percentile: 5.931245350837704
99th percentile: 6.985572533607483
mean time: 3.0681867837905883
Pipeline stage StressChecker completed in 32.06s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_nahef_2025-12-17 status is now deployed due to DeploymentManager action
function_nahef_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_nahef_2025-12-17 status is now torndown due to DeploymentManager action