developer_uid: chai_evaluation_service
submission_id: function_posit_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T14:17:46+00:00
num_battles: 5245
num_wins: 2657
celo_rating: 1300.44
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.5065776930409914
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8677845001220703s
Received healthy response to inference request in 1.9796550273895264s
Received healthy response to inference request in 2.6037349700927734s
Received healthy response to inference request in 2.965477466583252s
Received healthy response to inference request in 3.527282238006592s
Received healthy response to inference request in 2.190922498703003s
Received healthy response to inference request in 1.7861058712005615s
Received healthy response to inference request in 2.6109251976013184s
Received healthy response to inference request in 2.465228319168091s
Received healthy response to inference request in 1.9992125034332275s
10 requests
0 failed requests
5th percentile: 1.8228612542152405
10th percentile: 1.8596166372299194
20th percentile: 1.9572809219360352
30th percentile: 1.9933452606201172
40th percentile: 2.1142385005950928
50th percentile: 2.328075408935547
60th percentile: 2.520630979537964
70th percentile: 2.605892038345337
80th percentile: 2.681835651397705
90th percentile: 3.021657943725586
95th percentile: 3.2744700908660884
99th percentile: 3.4767198085784914
mean time: 2.3996328592300413
Pipeline stage StressChecker completed in 25.66s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.73s
Shutdown handler de-registered
function_posit_2025-12-13 status is now deployed due to DeploymentManager action
function_posit_2025-12-13 status is now inactive due to auto deactivation removed underperforming models