developer_uid: chai_evaluation_service
submission_id: function_telef_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T08:21:28+00:00
num_battles: 7523
num_wins: 3875
celo_rating: 1303.59
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.5150870663299216
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7527527809143066s
Received healthy response to inference request in 2.1017708778381348s
Received healthy response to inference request in 2.8735268115997314s
Received healthy response to inference request in 2.326996326446533s
Received healthy response to inference request in 2.0254275798797607s
Received healthy response to inference request in 2.0568251609802246s
Received healthy response to inference request in 2.6126840114593506s
Received healthy response to inference request in 2.009996175765991s
Received healthy response to inference request in 2.862891435623169s
Received healthy response to inference request in 2.0305488109588623s
10 requests
0 failed requests
5th percentile: 1.8685123085975648
10th percentile: 1.9842718362808227
20th percentile: 2.0223412990570067
30th percentile: 2.029012441635132
40th percentile: 2.0463146209716796
50th percentile: 2.0792980194091797
60th percentile: 2.191861057281494
70th percentile: 2.4127026319503786
80th percentile: 2.662725496292114
90th percentile: 2.8639549732208254
95th percentile: 2.8687408924102784
99th percentile: 2.8725696277618407
mean time: 2.2653419971466064
Pipeline stage StressChecker completed in 24.06s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
function_telef_2025-12-17 status is now deployed due to DeploymentManager action
function_telef_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_telef_2025-12-17 status is now torndown due to DeploymentManager action