developer_uid: chai_evaluation_service
submission_id: function_nasot_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T12:51:16+00:00
num_battles: 9815
num_wins: 4955
celo_rating: 1296.47
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.5048395313295976
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.517906904220581s
Received healthy response to inference request in 6.868978500366211s
Received healthy response to inference request in 4.276037931442261s
Received healthy response to inference request in 3.158649206161499s
Received healthy response to inference request in 5.224266052246094s
Received healthy response to inference request in 4.83803129196167s
Received healthy response to inference request in 4.754601001739502s
Received healthy response to inference request in 3.897547483444214s
Received healthy response to inference request in 3.9272890090942383s
Received healthy response to inference request in 3.8914101123809814s
10 requests
0 failed requests
5th percentile: 3.3203151702880858
10th percentile: 3.481981134414673
20th percentile: 3.8167094707489015
30th percentile: 3.8957062721252442
40th percentile: 3.9153923988342285
50th percentile: 4.1016634702682495
60th percentile: 4.467463159561157
70th percentile: 4.779630088806153
80th percentile: 4.9152782440185545
90th percentile: 5.388737297058105
95th percentile: 6.128857898712156
99th percentile: 6.7209543800354
mean time: 4.435471749305725
Pipeline stage StressChecker completed in 45.77s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_nasot_2025-12-16 status is now deployed due to DeploymentManager action
function_nasot_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_nasot_2025-12-16 status is now torndown due to DeploymentManager action