developer_uid: chai_evaluation_service
submission_id: function_dobek_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T05:27:37+00:00
num_battles: 7417
num_wins: 3752
celo_rating: 1256.34
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.5058649049480922
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2963614463806152s
Received healthy response to inference request in 4.194751024246216s
Received healthy response to inference request in 1.8653504848480225s
Received healthy response to inference request in 2.4188592433929443s
Received healthy response to inference request in 2.963801145553589s
Received healthy response to inference request in 3.5816333293914795s
Received healthy response to inference request in 2.9063589572906494s
Received healthy response to inference request in 5.069583892822266s
Received healthy response to inference request in 2.5260636806488037s
Received healthy response to inference request in 4.331966876983643s
10 requests
0 failed requests
5th percentile: 2.059305417537689
10th percentile: 2.253260350227356
20th percentile: 2.3943596839904786
30th percentile: 2.493902349472046
40th percentile: 2.754240846633911
50th percentile: 2.935080051422119
60th percentile: 3.2109340190887448
70th percentile: 3.7655686378479003
80th percentile: 4.222194194793701
90th percentile: 4.405728578567505
95th percentile: 4.737656235694884
99th percentile: 5.00319836139679
mean time: 3.215473008155823
Pipeline stage StressChecker completed in 33.49s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.98s
Shutdown handler de-registered
function_dobek_2025-12-14 status is now deployed due to DeploymentManager action
function_dobek_2025-12-14 status is now inactive due to auto deactivation removed underperforming models