developer_uid: chai_evaluation_service
submission_id: function_jukat_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T08:21:21+00:00
num_battles: 7810
num_wins: 3856
celo_rating: 1288.75
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-21
win_ratio: 0.4937259923175416
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.678051233291626s
Received healthy response to inference request in 2.755802631378174s
Received healthy response to inference request in 3.1320114135742188s
Received healthy response to inference request in 1.9201257228851318s
Received healthy response to inference request in 2.0363399982452393s
Received healthy response to inference request in 3.005628824234009s
Received healthy response to inference request in 3.190783739089966s
Received healthy response to inference request in 2.1860527992248535s
Received healthy response to inference request in 3.8895959854125977s
Received healthy response to inference request in 2.9770331382751465s
10 requests
0 failed requests
5th percentile: 1.97242214679718
10th percentile: 2.0247185707092283
20th percentile: 2.156110239028931
30th percentile: 2.530451703071594
40th percentile: 2.724702072143555
50th percentile: 2.86641788482666
60th percentile: 2.9884714126586913
70th percentile: 3.0435436010360717
80th percentile: 3.1437658786773683
90th percentile: 3.260664963722229
95th percentile: 3.5751304745674126
99th percentile: 3.826702883243561
mean time: 2.777142548561096
Pipeline stage StressChecker completed in 29.15s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_jukat_2025-12-18 status is now deployed due to DeploymentManager action
function_jukat_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_jukat_2025-12-18 status is now torndown due to DeploymentManager action