developer_uid: chai_evaluation_service
submission_id: function_lijaf_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T07:51:16+00:00
num_battles: 8254
num_wins: 4165
celo_rating: 1296.54
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.5046038284468136
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.775773763656616s
Received healthy response to inference request in 1.780691146850586s
Received healthy response to inference request in 3.04490065574646s
Received healthy response to inference request in 1.7022888660430908s
Received healthy response to inference request in 3.5043866634368896s
Received healthy response to inference request in 2.352365255355835s
Received healthy response to inference request in 2.63566517829895s
Received healthy response to inference request in 2.93969464302063s
Received healthy response to inference request in 3.0441501140594482s
10 requests
1 failed requests
5th percentile: 1.7375698924064635
10th percentile: 1.7728509187698365
20th percentile: 2.238030433654785
30th percentile: 2.5506752014160154
40th percentile: 2.71973032951355
50th percentile: 2.857734203338623
60th percentile: 2.981476831436157
70th percentile: 3.044375276565552
80th percentile: 3.136797857284546
90th percentile: 5.202704215049738
95th percentile: 12.845133197307568
99th percentile: 18.959076383113864
mean time: 4.426747846603393
%s, retrying in %s seconds...
Received healthy response to inference request in 2.1083366870880127s
Received healthy response to inference request in 2.4047446250915527s
Received healthy response to inference request in 2.8997316360473633s
Received healthy response to inference request in 2.3558003902435303s
Received healthy response to inference request in 1.895815372467041s
Received healthy response to inference request in 2.2398860454559326s
Received healthy response to inference request in 2.7541236877441406s
Received healthy response to inference request in 2.5656397342681885s
Received healthy response to inference request in 2.5149221420288086s
Received healthy response to inference request in 2.0245585441589355s
10 requests
0 failed requests
5th percentile: 1.9537497997283935
10th percentile: 2.011684226989746
20th percentile: 2.0915810585021974
30th percentile: 2.2004212379455566
40th percentile: 2.309434652328491
50th percentile: 2.3802725076675415
60th percentile: 2.448815631866455
70th percentile: 2.5301374197006226
80th percentile: 2.603336524963379
90th percentile: 2.768684482574463
95th percentile: 2.834208059310913
99th percentile: 2.886626920700073
mean time: 2.3763558864593506
Pipeline stage StressChecker completed in 70.86s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_lijaf_2025-12-16 status is now deployed due to DeploymentManager action
function_lijaf_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_lijaf_2025-12-16 status is now torndown due to DeploymentManager action