developer_uid: chai_evaluation_service
submission_id: function_sifut_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T09:26:30+00:00
num_battles: 7620
num_wins: 3800
celo_rating: 1292.39
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.49868766404199477
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.1282546520233154s
Received healthy response to inference request in 2.3377139568328857s
Received healthy response to inference request in 1.7849698066711426s
Received healthy response to inference request in 1.6959388256072998s
Received healthy response to inference request in 2.6421735286712646s
Received healthy response to inference request in 1.9603028297424316s
Received healthy response to inference request in 2.240110397338867s
Received healthy response to inference request in 1.9050438404083252s
Received healthy response to inference request in 2.711181640625s
Received healthy response to inference request in 2.005596876144409s
10 requests
0 failed requests
5th percentile: 1.736002767086029
10th percentile: 1.7760667085647583
20th percentile: 1.8810290336608886
30th percentile: 1.9437251329421996
40th percentile: 1.9874792575836182
50th percentile: 2.0669257640838623
60th percentile: 2.172996950149536
70th percentile: 2.2693914651870726
80th percentile: 2.398605871200562
90th percentile: 2.649074339866638
95th percentile: 2.680127990245819
99th percentile: 2.704970910549164
mean time: 2.1411286354064942
Pipeline stage StressChecker completed in 23.81s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.40s
Shutdown handler de-registered
function_sifut_2025-12-16 status is now deployed due to DeploymentManager action
function_sifut_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_sifut_2025-12-16 status is now torndown due to DeploymentManager action