developer_uid: chai_evaluation_service
submission_id: function_sadom_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T15:47:20+00:00
num_battles: 5659
num_wins: 2829
celo_rating: 1256.3
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.49991164516699066
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.1636407375335693s
Received healthy response to inference request in 2.9101486206054688s
Received healthy response to inference request in 1.7458100318908691s
Received healthy response to inference request in 5.14901328086853s
Received healthy response to inference request in 3.131242275238037s
Received healthy response to inference request in 1.9537131786346436s
Received healthy response to inference request in 2.346961498260498s
Received healthy response to inference request in 3.6087961196899414s
Received healthy response to inference request in 3.4912402629852295s
Received healthy response to inference request in 2.6659457683563232s
10 requests
0 failed requests
5th percentile: 1.8393664479255676
10th percentile: 1.932922863960266
20th percentile: 2.121655225753784
30th percentile: 2.2919652700424193
40th percentile: 2.538352060317993
50th percentile: 2.788047194480896
60th percentile: 2.998586082458496
70th percentile: 3.2392416715621946
80th percentile: 3.514751434326172
90th percentile: 3.7628178358078
95th percentile: 4.455915558338163
99th percentile: 5.010393736362458
mean time: 2.916651177406311
Pipeline stage StressChecker completed in 30.64s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.89s
Shutdown handler de-registered
function_sadom_2025-12-13 status is now deployed due to DeploymentManager action
function_sadom_2025-12-13 status is now inactive due to auto deactivation removed underperforming models