developer_uid: chai_evaluation_service
submission_id: function_kufom_2025-12-15
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-18T03:21:20+00:00
num_battles: 10958
num_wins: 5601
celo_rating: 1301.0
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-17
win_ratio: 0.5111334185070269
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.5139081478118896s
Received healthy response to inference request in 5.4474639892578125s
Received healthy response to inference request in 3.5048794746398926s
Received healthy response to inference request in 1.8706738948822021s
Received healthy response to inference request in 4.4545440673828125s
Received healthy response to inference request in 1.9796350002288818s
Received healthy response to inference request in 1.9300289154052734s
Received healthy response to inference request in 2.8898110389709473s
Received healthy response to inference request in 2.2548108100891113s
10 requests
1 failed requests
5th percentile: 1.8973836541175841
10th percentile: 1.9240934133529664
20th percentile: 1.9697137832641602
30th percentile: 2.1722580671310423
40th percentile: 2.635810947418213
50th percentile: 3.19734525680542
60th percentile: 3.5084909439086913
70th percentile: 3.796098923683166
80th percentile: 4.653128051757813
90th percentile: 6.914322853088374
95th percentile: 13.515187740325912
99th percentile: 18.795879650115968
mean time: 4.79618079662323
%s, retrying in %s seconds...
Received healthy response to inference request in 2.7743122577667236s
Received healthy response to inference request in 2.968860149383545s
Received healthy response to inference request in 3.2289302349090576s
Received healthy response to inference request in 2.6625964641571045s
Received healthy response to inference request in 3.623042106628418s
Received healthy response to inference request in 2.9369168281555176s
Received healthy response to inference request in 2.689267158508301s
Received healthy response to inference request in 3.6053645610809326s
Received healthy response to inference request in 4.40428352355957s
Received healthy response to inference request in 2.5956568717956543s
10 requests
0 failed requests
5th percentile: 2.625779688358307
10th percentile: 2.6559025049209595
20th percentile: 2.6839330196380615
30th percentile: 2.7487987279891968
40th percentile: 2.871875
50th percentile: 2.9528884887695312
60th percentile: 3.07288818359375
70th percentile: 3.34186053276062
80th percentile: 3.6089000701904297
90th percentile: 3.701166248321533
95th percentile: 4.052724885940551
99th percentile: 4.333971796035767
mean time: 3.1489230155944825
Pipeline stage StressChecker completed in 82.58s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_kufom_2025-12-15 status is now deployed due to DeploymentManager action
function_kufom_2025-12-15 status is now inactive due to auto deactivation removed underperforming models
function_kufom_2025-12-15 status is now torndown due to DeploymentManager action