developer_uid: chai_evaluation_service
submission_id: function_tijot_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T18:51:22+00:00
num_battles: 10949
num_wins: 5541
celo_rating: 1297.45
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.5060736140286785
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.298262119293213s
Received healthy response to inference request in 2.308115243911743s
Received healthy response to inference request in 3.0196125507354736s
Received healthy response to inference request in 3.1550421714782715s
Received healthy response to inference request in 2.7943994998931885s
Received healthy response to inference request in 2.8145699501037598s
Received healthy response to inference request in 4.595568656921387s
Received healthy response to inference request in 1.789994716644287s
Received healthy response to inference request in 3.634788751602173s
Received healthy response to inference request in 2.930408477783203s
10 requests
0 failed requests
5th percentile: 2.0187150478363036
10th percentile: 2.24743537902832
20th percentile: 2.306144618988037
30th percentile: 2.6485142230987546
40th percentile: 2.806501770019531
50th percentile: 2.8724892139434814
60th percentile: 2.9660901069641112
70th percentile: 3.060241436958313
80th percentile: 3.2509914875030517
90th percentile: 3.730866742134094
95th percentile: 4.163217699527739
99th percentile: 4.509098465442658
mean time: 2.93407621383667
Pipeline stage StressChecker completed in 30.79s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_tijot_2025-12-16 status is now deployed due to DeploymentManager action
function_tijot_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_tijot_2025-12-16 status is now torndown due to DeploymentManager action