developer_uid: chai_evaluation_service
submission_id: function_liduf_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-17T23:21:17+00:00
num_battles: 16546
num_wins: 8274
celo_rating: 1297.39
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.5000604375679922
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.6643497943878174s
Received healthy response to inference request in 2.5223889350891113s
Received healthy response to inference request in 2.911731004714966s
Received healthy response to inference request in 4.690131664276123s
Received healthy response to inference request in 2.933948516845703s
Received healthy response to inference request in 3.499824047088623s
Received healthy response to inference request in 2.9945006370544434s
Received healthy response to inference request in 3.8931097984313965s
Received healthy response to inference request in 1.702530860900879s
Received healthy response to inference request in 2.5932188034057617s
10 requests
0 failed requests
5th percentile: 2.0714669942855837
10th percentile: 2.440403127670288
20th percentile: 2.5790528297424316
30th percentile: 2.6430104970932007
40th percentile: 2.8127785205841063
50th percentile: 2.9228397607803345
60th percentile: 2.958169364929199
70th percentile: 3.1460976600646973
80th percentile: 3.5784811973571777
90th percentile: 3.972811985015869
95th percentile: 4.331471824645996
99th percentile: 4.618399696350098
mean time: 3.0405734062194822
Pipeline stage StressChecker completed in 31.95s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.02s
Shutdown handler de-registered
function_liduf_2025-12-14 status is now deployed due to DeploymentManager action
function_liduf_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_liduf_2025-12-14 status is now torndown due to DeploymentManager action