developer_uid: chai_evaluation_service
submission_id: function_depak_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T09:51:17+00:00
num_battles: 8614
num_wins: 4436
celo_rating: 1303.42
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.5149756210819596
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.6287708282470703s
Received healthy response to inference request in 2.290553331375122s
Received healthy response to inference request in 1.675715684890747s
Received healthy response to inference request in 2.582744598388672s
Received healthy response to inference request in 2.001768112182617s
Received healthy response to inference request in 2.3785626888275146s
Received healthy response to inference request in 3.1655468940734863s
Received healthy response to inference request in 2.3459131717681885s
Received healthy response to inference request in 2.711805820465088s
Received healthy response to inference request in 1.677994966506958s
10 requests
0 failed requests
5th percentile: 1.676741361618042
10th percentile: 1.677767038345337
20th percentile: 1.9370134830474854
30th percentile: 2.2039177656173705
40th percentile: 2.323769235610962
50th percentile: 2.3622379302978516
60th percentile: 2.4602354526519776
70th percentile: 2.5965524673461915
80th percentile: 2.645377826690674
90th percentile: 2.7571799278259275
95th percentile: 2.9613634109497067
99th percentile: 3.1247101974487306
mean time: 2.3459376096725464
Pipeline stage StressChecker completed in 24.73s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.56s
Shutdown handler de-registered
function_depak_2025-12-16 status is now deployed due to DeploymentManager action
function_depak_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_depak_2025-12-16 status is now torndown due to DeploymentManager action