developer_uid: chai_evaluation_service
submission_id: function_sonis_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T03:57:23+00:00
num_battles: 5758
num_wins: 2934
celo_rating: 1256.33
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.5095519277526919
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9021525382995605s
Received healthy response to inference request in 2.3088412284851074s
Received healthy response to inference request in 2.913180112838745s
Received healthy response to inference request in 2.07350492477417s
Received healthy response to inference request in 2.651869297027588s
Received healthy response to inference request in 2.152080535888672s
Received healthy response to inference request in 2.3891654014587402s
Received healthy response to inference request in 3.038074254989624s
Received healthy response to inference request in 2.0675082206726074s
Received healthy response to inference request in 1.6278786659240723s
10 requests
0 failed requests
5th percentile: 1.751301908493042
10th percentile: 1.8747251510620118
20th percentile: 2.034437084197998
30th percentile: 2.0717059135437013
40th percentile: 2.120650291442871
50th percentile: 2.2304608821868896
60th percentile: 2.3409708976745605
70th percentile: 2.4679765701293945
80th percentile: 2.7041314601898194
90th percentile: 2.925669527053833
95th percentile: 2.9818718910217283
99th percentile: 3.026833782196045
mean time: 2.3124255180358886
Pipeline stage StressChecker completed in 24.31s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_sonis_2025-12-14 status is now deployed due to DeploymentManager action
function_sonis_2025-12-14 status is now inactive due to auto deactivation removed underperforming models