developer_uid: chai_evaluation_service
submission_id: function_juget_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-17T20:41:22+00:00
num_battles: 8790
num_wins: 4364
celo_rating: 1256.4
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.4964732650739477
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 4.393328428268433s
Received healthy response to inference request in 5.015485763549805s
Received healthy response to inference request in 5.200793027877808s
Received healthy response to inference request in 4.146046161651611s
Received healthy response to inference request in 3.927067518234253s
Received healthy response to inference request in 4.495083808898926s
Received healthy response to inference request in 2.5894315242767334s
Received healthy response to inference request in 4.294299840927124s
Received healthy response to inference request in 2.084768295288086s
10 requests
1 failed requests
5th percentile: 2.311866748332977
10th percentile: 2.5389652013778687
20th percentile: 3.659540319442749
30th percentile: 4.080352568626404
40th percentile: 4.234998369216919
50th percentile: 4.343814134597778
60th percentile: 4.43403058052063
70th percentile: 4.651204395294189
80th percentile: 5.052547216415405
90th percentile: 6.695362019538874
95th percentile: 13.420922482013687
99th percentile: 18.801370851993564
mean time: 5.6292787313461305
%s, retrying in %s seconds...
Received healthy response to inference request in 2.7895681858062744s
Received healthy response to inference request in 4.718525409698486s
Received healthy response to inference request in 3.184804677963257s
Received healthy response to inference request in 7.166722536087036s
Received healthy response to inference request in 1.939516305923462s
Received healthy response to inference request in 5.216093063354492s
Received healthy response to inference request in 6.107512712478638s
Received healthy response to inference request in 3.5404019355773926s
Received healthy response to inference request in 4.174333095550537s
Received healthy response to inference request in 7.2617528438568115s
10 requests
0 failed requests
5th percentile: 2.3220396518707274
10th percentile: 2.7045629978179933
20th percentile: 3.1057573795318603
30th percentile: 3.4337227582931518
40th percentile: 3.9207606315612793
50th percentile: 4.446429252624512
60th percentile: 4.917552471160889
70th percentile: 5.483518958091736
80th percentile: 6.319354677200318
90th percentile: 7.176225566864014
95th percentile: 7.218989205360413
99th percentile: 7.253200116157532
mean time: 4.609923076629639
Pipeline stage StressChecker completed in 104.91s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_juget_2025-12-14 status is now deployed due to DeploymentManager action
function_juget_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_juget_2025-12-14 status is now torndown due to DeploymentManager action