developer_uid: chai_evaluation_service
submission_id: function_fusen_2025-12-15
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-18T01:21:26+00:00
num_battles: 9289
num_wins: 4627
celo_rating: 1291.91
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-17
win_ratio: 0.4981160512434062
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.3785860538482666s
Received healthy response to inference request in 3.343716859817505s
Received healthy response to inference request in 2.533068895339966s
Received healthy response to inference request in 2.7262260913848877s
Received healthy response to inference request in 3.318842887878418s
Received healthy response to inference request in 3.8296396732330322s
Received healthy response to inference request in 1.9290530681610107s
Received healthy response to inference request in 3.04646372795105s
Received healthy response to inference request in 3.002997875213623s
Received healthy response to inference request in 1.9696831703186035s
10 requests
0 failed requests
5th percentile: 1.9473366141319275
10th percentile: 1.9656201601028442
20th percentile: 2.296805477142334
30th percentile: 2.486724042892456
40th percentile: 2.648963212966919
50th percentile: 2.8646119832992554
60th percentile: 3.0203842163085937
70th percentile: 3.1281774759292604
80th percentile: 3.3238176822662355
90th percentile: 3.3923091411590574
95th percentile: 3.6109744071960446
99th percentile: 3.785906620025635
mean time: 2.8078278303146362
Pipeline stage StressChecker completed in 29.47s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.84s
Shutdown handler de-registered
function_fusen_2025-12-15 status is now deployed due to DeploymentManager action
function_fusen_2025-12-15 status is now inactive due to auto deactivation removed underperforming models
function_fusen_2025-12-15 status is now torndown due to DeploymentManager action