developer_uid: chai_evaluation_service
submission_id: function_halal_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T02:51:14+00:00
num_battles: 7254
num_wins: 3659
celo_rating: 1296.49
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.504411359250069
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.412083864212036s
Received healthy response to inference request in 2.0559885501861572s
Received healthy response to inference request in 2.7266769409179688s
Received healthy response to inference request in 1.8992924690246582s
Received healthy response to inference request in 3.012763738632202s
Received healthy response to inference request in 1.8552124500274658s
Received healthy response to inference request in 1.7435767650604248s
Received healthy response to inference request in 2.7523727416992188s
Received healthy response to inference request in 2.4885623455047607s
Received healthy response to inference request in 3.6792802810668945s
10 requests
0 failed requests
5th percentile: 1.7938128232955932
10th percentile: 1.8440488815307616
20th percentile: 1.8904764652252197
30th percentile: 2.0089797258377073
40th percentile: 2.3155328273773192
50th percentile: 2.6076196432113647
60th percentile: 2.736955261230469
70th percentile: 2.8304900407791136
80th percentile: 3.092627763748169
90th percentile: 3.4388035058975217
95th percentile: 3.559041893482208
99th percentile: 3.6552326035499574
mean time: 2.562581014633179
Pipeline stage StressChecker completed in 27.34s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.62s
Shutdown handler de-registered
function_halal_2025-12-18 status is now deployed due to DeploymentManager action
function_halal_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_halal_2025-12-18 status is now torndown due to DeploymentManager action