developer_uid: chai_evaluation_service
submission_id: function_nohom_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T10:51:24+00:00
num_battles: 8416
num_wins: 4210
celo_rating: 1293.28
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.5002376425855514
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.528146982192993s
Received healthy response to inference request in 3.4813666343688965s
Received healthy response to inference request in 3.4748170375823975s
Received healthy response to inference request in 3.7551257610321045s
Received healthy response to inference request in 3.2504706382751465s
Received healthy response to inference request in 2.3987038135528564s
Received healthy response to inference request in 2.9921116828918457s
Received healthy response to inference request in 4.125519037246704s
Received healthy response to inference request in 3.3449652194976807s
Received healthy response to inference request in 3.5440545082092285s
10 requests
0 failed requests
5th percentile: 2.6657373547554015
10th percentile: 2.9327708959579466
20th percentile: 3.1987988471984865
30th percentile: 3.3166168451309206
40th percentile: 3.422876310348511
50th percentile: 3.478091835975647
60th percentile: 3.5000787734985352
70th percentile: 3.532919239997864
80th percentile: 3.586268758773804
90th percentile: 3.7921650886535643
95th percentile: 3.9588420629501337
99th percentile: 4.09218364238739
mean time: 3.3895281314849854
Pipeline stage StressChecker completed in 35.15s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.72s
Shutdown handler de-registered
function_nohom_2025-12-16 status is now deployed due to DeploymentManager action
function_nohom_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_nohom_2025-12-16 status is now torndown due to DeploymentManager action