developer_uid: NischayDnk
submission_id: function_rupel_2025-07-08
model_name: function_rupel_2025-07-08
model_group:
status: torndown
timestamp: 2025-07-08T22:56:22+00:00
num_battles: 7579
num_wins: 3719
celo_rating: 1277.45
family_friendly_score: 0.5327999999999999
family_friendly_standard_error: 0.007055836732804976
submission_type: function
display_name: function_rupel_2025-07-08
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-07-08
win_ratio: 0.490697981264019
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.094106435775757s
Received healthy response to inference request in 2.998730421066284s
Received healthy response to inference request in 3.5707476139068604s
Received healthy response to inference request in 3.5476229190826416s
5 requests
1 failed requests
5th percentile: 3.0178056240081785
10th percentile: 3.0368808269500733
20th percentile: 3.0750312328338625
30th percentile: 3.1848097324371336
40th percentile: 3.3662163257598876
50th percentile: 3.5476229190826416
60th percentile: 3.5568727970123293
70th percentile: 3.5661226749420165
80th percentile: 6.881605815887454
90th percentile: 13.503322219848634
95th percentile: 16.81418042182922
99th percentile: 19.462866983413697
mean time: 6.6672492027282715
%s, retrying in %s seconds...
Received healthy response to inference request in 3.7906265258789062s
Received healthy response to inference request in 4.3328537940979s
Received healthy response to inference request in 4.0612804889678955s
Received healthy response to inference request in 2.927250623703003s
Received healthy response to inference request in 4.855731964111328s
5 requests
0 failed requests
5th percentile: 3.0999258041381834
10th percentile: 3.2726009845733643
20th percentile: 3.6179513454437258
30th percentile: 3.844757318496704
40th percentile: 3.9530189037323
50th percentile: 4.0612804889678955
60th percentile: 4.169909811019897
70th percentile: 4.2785391330719
80th percentile: 4.437429428100586
90th percentile: 4.646580696105957
95th percentile: 4.751156330108643
99th percentile: 4.834816837310791
mean time: 3.9935486793518065
%s, retrying in %s seconds...
Received healthy response to inference request in 3.488067865371704s
Received healthy response to inference request in 3.0767619609832764s
Received healthy response to inference request in 2.849229574203491s
Received healthy response to inference request in 3.2811625003814697s
Received healthy response to inference request in 3.8405866622924805s
5 requests
0 failed requests
5th percentile: 2.8947360515594482
10th percentile: 2.9402425289154053
20th percentile: 3.0312554836273193
30th percentile: 3.117642068862915
40th percentile: 3.1994022846221926
50th percentile: 3.2811625003814697
60th percentile: 3.3639246463775634
70th percentile: 3.446686792373657
80th percentile: 3.5585716247558596
90th percentile: 3.69957914352417
95th percentile: 3.770082902908325
99th percentile: 3.8264859104156494
mean time: 3.3071617126464843
Pipeline stage StressChecker completed in 73.98s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.81s
Shutdown handler de-registered
function_rupel_2025-07-08 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3836.20s
Shutdown handler de-registered
function_rupel_2025-07-08 status is now inactive due to auto deactivation removed underperforming models
function_rupel_2025-07-08 status is now torndown due to DeploymentManager action