developer_uid: chai_evaluation_service
submission_id: function_gihob_2025-12-15
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-18T12:01:09+00:00
num_battles: 6888
num_wins: 3442
celo_rating: 1293.09
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.4997096399535424
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.6761245727539062s
Received healthy response to inference request in 2.3661813735961914s
Received healthy response to inference request in 2.034337043762207s
Received healthy response to inference request in 2.4178688526153564s
Received healthy response to inference request in 1.852778673171997s
Received healthy response to inference request in 2.1167056560516357s
Received healthy response to inference request in 2.4742186069488525s
Received healthy response to inference request in 1.7114598751068115s
Received healthy response to inference request in 2.5025253295898438s
10 requests
1 failed requests
5th percentile: 1.775053334236145
10th percentile: 1.8386467933654784
20th percentile: 1.998025369644165
30th percentile: 2.0919950723648073
40th percentile: 2.266391086578369
50th percentile: 2.392025113105774
60th percentile: 2.4404087543487547
70th percentile: 2.48271062374115
80th percentile: 2.537245178222656
90th percentile: 4.419255042076105
95th percentile: 12.263342154026013
99th percentile: 18.538611843585972
mean time: 4.025962924957275
%s, retrying in %s seconds...
Received healthy response to inference request in 2.262153148651123s
Received healthy response to inference request in 2.6797618865966797s
Received healthy response to inference request in 1.7311053276062012s
Received healthy response to inference request in 2.522803544998169s
Received healthy response to inference request in 2.4882640838623047s
Received healthy response to inference request in 2.743165969848633s
Received healthy response to inference request in 3.1531224250793457s
Received healthy response to inference request in 3.280830144882202s
Received healthy response to inference request in 2.382164239883423s
Received healthy response to inference request in 2.6518495082855225s
10 requests
0 failed requests
5th percentile: 1.970076847076416
10th percentile: 2.209048366546631
20th percentile: 2.358162021636963
30th percentile: 2.45643413066864
40th percentile: 2.5089877605438233
50th percentile: 2.5873265266418457
60th percentile: 2.663014459609985
70th percentile: 2.698783111572266
80th percentile: 2.8251572608947755
90th percentile: 3.1658931970596313
95th percentile: 3.2233616709709167
99th percentile: 3.269336450099945
mean time: 2.5895220279693603
Pipeline stage StressChecker completed in 69.99s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_gihob_2025-12-15 status is now deployed due to DeploymentManager action
function_gihob_2025-12-15 status is now inactive due to auto deactivation removed underperforming models
function_gihob_2025-12-15 status is now torndown due to DeploymentManager action