submission_id: function_domet_2024-10-18
developer_uid: chai_backend_admin
celo_rating: 1273.65
display_name: reward_blend_default_full_bon
family_friendly_score: 0.5934671389216843
family_friendly_standard_error: 0.004848467184780921
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 50, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>', '<|user|>', '###'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: reward_blend_default_full_bon
num_battles: 17009
num_wins: 8871
ranking_group: single
status: torndown
submission_type: function
timestamp: 2024-10-18T19:21:26+00:00
us_pacific_date: 2024-10-18
win_ratio: 0.5215474160738433
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.790727138519287s
Received healthy response to inference request in 3.9548532962799072s
Failed to get response for submission chaiml-nemo-20241016-bre_9520_v5: ('http://chaiml-nemo-20241016-bre-9520-v5-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:55014->127.0.0.1:8080: read: connection reset by peer\n')
Received healthy response to inference request in 2.684215784072876s
Received healthy response to inference request in 3.699573278427124s
Received healthy response to inference request in 2.4927523136138916s
5 requests
0 failed requests
5th percentile: 2.5310450077056883
10th percentile: 2.5693377017974854
20th percentile: 2.6459230899810793
30th percentile: 2.705518054962158
40th percentile: 2.7481225967407226
50th percentile: 2.790727138519287
60th percentile: 3.1542655944824216
70th percentile: 3.5178040504455566
80th percentile: 3.7506292819976808
90th percentile: 3.8527412891387938
95th percentile: 3.9037972927093505
99th percentile: 3.9446420955657957
mean time: 3.124424362182617
Pipeline stage StressChecker completed in 16.90s
Shutdown handler de-registered
function_domet_2024-10-18 status is now deployed due to DeploymentManager action
function_domet_2024-10-18 status is now inactive due to auto deactivation removed underperforming models
function_domet_2024-10-18 status is now torndown due to DeploymentManager action