developer_uid: chai_evaluation_service
submission_id: function_fojel_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T04:51:26+00:00
num_battles: 7234
num_wins: 3608
celo_rating: 1296.8
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-17
win_ratio: 0.49875587503455904
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.8904736042022705s
Received healthy response to inference request in 2.250647783279419s
Received healthy response to inference request in 1.7381680011749268s
Received healthy response to inference request in 4.104508399963379s
Received healthy response to inference request in 2.6102564334869385s
Received healthy response to inference request in 1.9648690223693848s
Received healthy response to inference request in 1.7286295890808105s
Received healthy response to inference request in 2.445688009262085s
Received healthy response to inference request in 3.716935157775879s
Received healthy response to inference request in 1.689192771911621s
10 requests
0 failed requests
5th percentile: 1.7069393396377563
10th percentile: 1.7246859073638916
20th percentile: 1.7362603187561034
30th percentile: 1.8968587160110473
40th percentile: 2.1363362789154055
50th percentile: 2.348167896270752
60th percentile: 2.5115153789520264
70th percentile: 2.6943215847015383
80th percentile: 3.0557659149169925
90th percentile: 3.755692481994629
95th percentile: 3.9301004409790035
99th percentile: 4.069626808166504
mean time: 2.5139368772506714
Pipeline stage StressChecker completed in 26.48s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_fojel_2025-12-18 status is now deployed due to DeploymentManager action
function_fojel_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_fojel_2025-12-18 status is now torndown due to DeploymentManager action