developer_uid: chai_evaluation_service
submission_id: function_pagit_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T09:51:17+00:00
num_battles: 7804
num_wins: 3919
celo_rating: 1294.97
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-21
win_ratio: 0.5021783700666325
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0498733520507812s
Received healthy response to inference request in 1.8900516033172607s
Received healthy response to inference request in 2.4876859188079834s
Received healthy response to inference request in 2.3309061527252197s
Received healthy response to inference request in 2.0714218616485596s
Received healthy response to inference request in 1.8787462711334229s
Received healthy response to inference request in 2.9114232063293457s
Received healthy response to inference request in 2.5624897480010986s
Received healthy response to inference request in 2.112622022628784s
Received healthy response to inference request in 1.827164649963379s
10 requests
0 failed requests
5th percentile: 1.8503763794898986
10th percentile: 1.8735881090164184
20th percentile: 1.8877905368804933
30th percentile: 2.001926827430725
40th percentile: 2.062802457809448
50th percentile: 2.092021942138672
60th percentile: 2.1999356746673584
70th percentile: 2.3779400825500487
80th percentile: 2.5026466846466064
90th percentile: 2.597383093833923
95th percentile: 2.754403150081634
99th percentile: 2.8800191950798033
mean time: 2.2122384786605833
Pipeline stage StressChecker completed in 24.42s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
function_pagit_2025-12-18 status is now deployed due to DeploymentManager action
function_pagit_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_pagit_2025-12-18 status is now torndown due to DeploymentManager action