developer_uid: chai_evaluation_service
submission_id: function_dadem_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T00:51:15+00:00
num_battles: 7229
num_wins: 3722
celo_rating: 1303.61
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.5148706598423018
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.0518651008605957s
Received healthy response to inference request in 2.455883026123047s
Received healthy response to inference request in 1.682121992111206s
Received healthy response to inference request in 2.3367040157318115s
Received healthy response to inference request in 3.773919105529785s
Received healthy response to inference request in 2.227501630783081s
Received healthy response to inference request in 2.2802531719207764s
Received healthy response to inference request in 1.9347560405731201s
Received healthy response to inference request in 3.6784186363220215s
10 requests
1 failed requests
5th percentile: 1.7958073139190673
10th percentile: 1.9094926357269286
20th percentile: 2.0284432888031008
30th percentile: 2.1748106718063354
40th percentile: 2.259152555465698
50th percentile: 2.308478593826294
60th percentile: 2.3843756198883055
70th percentile: 2.822643709182739
80th percentile: 3.697518730163574
90th percentile: 5.411413073539729
95th percentile: 12.780135929584485
99th percentile: 18.675114214420322
mean time: 4.2570281505584715
%s, retrying in %s seconds...
Received healthy response to inference request in 1.804532766342163s
Received healthy response to inference request in 2.899989128112793s
Received healthy response to inference request in 1.7910583019256592s
Received healthy response to inference request in 1.8943374156951904s
Received healthy response to inference request in 2.6939680576324463s
Received healthy response to inference request in 2.330988645553589s
Received healthy response to inference request in 1.6236107349395752s
Received healthy response to inference request in 2.197322130203247s
Received healthy response to inference request in 2.6124613285064697s
Received healthy response to inference request in 2.80588698387146s
10 requests
0 failed requests
5th percentile: 1.698962140083313
10th percentile: 1.7743135452270509
20th percentile: 1.8018378734588623
30th percentile: 1.8673960208892821
40th percentile: 2.0761282444000244
50th percentile: 2.264155387878418
60th percentile: 2.443577718734741
70th percentile: 2.6369133472442625
80th percentile: 2.716351842880249
90th percentile: 2.8152971982955934
95th percentile: 2.857643163204193
99th percentile: 2.891519935131073
mean time: 2.2654155492782593
Pipeline stage StressChecker completed in 68.75s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_dadem_2025-12-16 status is now deployed due to DeploymentManager action
function_dadem_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_dadem_2025-12-16 status is now torndown due to DeploymentManager action