developer_uid: chai_evaluation_service
submission_id: function_palas_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T16:21:24+00:00
num_battles: 11011
num_wins: 5426
celo_rating: 1288.09
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.49277994732540187
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.377990484237671s
Received healthy response to inference request in 2.122178792953491s
Received healthy response to inference request in 3.1670117378234863s
Received healthy response to inference request in 3.091824531555176s
Received healthy response to inference request in 2.244188070297241s
Received healthy response to inference request in 2.227062225341797s
Received healthy response to inference request in 2.4257020950317383s
Received healthy response to inference request in 2.213425874710083s
Received healthy response to inference request in 2.115072727203369s
Received healthy response to inference request in 2.444941282272339s
10 requests
0 failed requests
5th percentile: 2.118270456790924
10th percentile: 2.121468186378479
20th percentile: 2.1951764583587647
30th percentile: 2.2229713201522827
40th percentile: 2.2373377323150634
50th percentile: 2.3349450826644897
60th percentile: 2.4333977699279785
70th percentile: 2.63900625705719
80th percentile: 3.106861972808838
90th percentile: 3.188109612464905
95th percentile: 3.2830500483512877
99th percentile: 3.359002397060394
mean time: 2.542939782142639
Pipeline stage StressChecker completed in 26.83s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.56s
Shutdown handler de-registered
function_palas_2025-12-16 status is now deployed due to DeploymentManager action
function_palas_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_palas_2025-12-16 status is now torndown due to DeploymentManager action