developer_uid: chai_evaluation_service
submission_id: function_jafor_2025-12-15
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-18T12:01:09+00:00
num_battles: 6967
num_wins: 3464
celo_rating: 1291.17
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.4972010908568968
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.983515739440918s
Received healthy response to inference request in 2.488792657852173s
Received healthy response to inference request in 3.0045690536499023s
Received healthy response to inference request in 3.0629401206970215s
Received healthy response to inference request in 2.6139683723449707s
Received healthy response to inference request in 4.601788520812988s
Received healthy response to inference request in 3.35453200340271s
Received healthy response to inference request in 3.5180554389953613s
Received healthy response to inference request in 2.980541944503784s
Received healthy response to inference request in 1.9477436542510986s
10 requests
0 failed requests
5th percentile: 1.9638410925865173
10th percentile: 1.979938530921936
20th percentile: 2.387737274169922
30th percentile: 2.5764156579971313
40th percentile: 2.8339125156402587
50th percentile: 2.9925554990768433
60th percentile: 3.02791748046875
70th percentile: 3.150417685508728
80th percentile: 3.3872366905212403
90th percentile: 3.6264287471771235
95th percentile: 4.114108633995055
99th percentile: 4.504252543449402
mean time: 2.9556447505950927
Pipeline stage StressChecker completed in 31.07s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.99s
Shutdown handler de-registered
function_jafor_2025-12-15 status is now deployed due to DeploymentManager action
function_jafor_2025-12-15 status is now inactive due to auto deactivation removed underperforming models
function_jafor_2025-12-15 status is now torndown due to DeploymentManager action