developer_uid: chai_evaluation_service
submission_id: function_fijam_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T13:31:09+00:00
num_battles: 9722
num_wins: 4952
celo_rating: 1299.88
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.5093602139477473
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.885427236557007s
Received healthy response to inference request in 2.907735586166382s
Received healthy response to inference request in 3.1611359119415283s
Received healthy response to inference request in 2.6112780570983887s
Received healthy response to inference request in 3.644218683242798s
Received healthy response to inference request in 3.13968825340271s
Received healthy response to inference request in 2.38250470161438s
Received healthy response to inference request in 4.490642786026001s
Received healthy response to inference request in 2.0154006481170654s
Received healthy response to inference request in 3.1014697551727295s
10 requests
0 failed requests
5th percentile: 2.1805974721908568
10th percentile: 2.3457942962646485
20th percentile: 2.565523386001587
30th percentile: 2.8031824827194214
40th percentile: 2.898812246322632
50th percentile: 3.0046026706695557
60th percentile: 3.1167571544647217
70th percentile: 3.1461225509643556
80th percentile: 3.2577524662017825
90th percentile: 3.7288610935211177
95th percentile: 4.109751939773559
99th percentile: 4.414464616775513
mean time: 3.033950161933899
Pipeline stage StressChecker completed in 31.64s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_fijam_2025-12-17 status is now deployed due to DeploymentManager action
function_fijam_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_fijam_2025-12-17 status is now torndown due to DeploymentManager action