developer_uid: chai_evaluation_service
submission_id: function_gosaf_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-17T19:06:30+00:00
num_battles: 10723
num_wins: 5290
celo_rating: 1256.39
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.4933320899002145
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.906987428665161s
Received healthy response to inference request in 2.260571002960205s
Received healthy response to inference request in 3.2560315132141113s
Received healthy response to inference request in 3.4352004528045654s
Received healthy response to inference request in 4.175747871398926s
Received healthy response to inference request in 3.4049415588378906s
Received healthy response to inference request in 2.5226449966430664s
Received healthy response to inference request in 2.46868634223938s
Received healthy response to inference request in 3.5339179039001465s
Received healthy response to inference request in 5.045316457748413s
10 requests
0 failed requests
5th percentile: 2.354222905635834
10th percentile: 2.4478748083114623
20th percentile: 2.511853265762329
30th percentile: 2.7916846990585324
40th percentile: 3.116413879394531
50th percentile: 3.330486536026001
60th percentile: 3.4170451164245605
70th percentile: 3.46481568813324
80th percentile: 3.6622838973999023
90th percentile: 4.2627047300338745
95th percentile: 4.654010593891143
99th percentile: 4.96705528497696
mean time: 3.3010045528411864
Pipeline stage StressChecker completed in 34.42s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.56s
Shutdown handler de-registered
function_gosaf_2025-12-14 status is now deployed due to DeploymentManager action
function_gosaf_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_gosaf_2025-12-14 status is now torndown due to DeploymentManager action