submission_id: function_bapof_2024-08-15
developer_uid: chai_backend_admin
celo_rating: 1114.02
display_name: gpt-4o-mini-reward
family_friendly_score: 0.0
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 4, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: gpt-4o-mini-reward
num_battles: 6834
num_wins: 2383
ranking_group: single
status: torndown
submission_type: function
timestamp: 2024-08-15T16:39:44+00:00
us_pacific_date: 2024-08-15
win_ratio: 0.34869768803043605
Download Preferencedata
Resubmit model
Running pipeline stage StressChecker
Received healthy response to inference request in 2.27058482170105s
Received healthy response to inference request in 1.3387644290924072s
Received healthy response to inference request in 0.8043897151947021s
Received healthy response to inference request in 1.1039955615997314s
Received healthy response to inference request in 1.2977893352508545s
5 requests
0 failed requests
5th percentile: 0.864310884475708
10th percentile: 0.9242320537567139
20th percentile: 1.0440743923187257
30th percentile: 1.142754316329956
40th percentile: 1.2202718257904053
50th percentile: 1.2977893352508545
60th percentile: 1.3141793727874755
70th percentile: 1.3305694103240966
80th percentile: 1.5251285076141359
90th percentile: 1.897856664657593
95th percentile: 2.084220743179321
99th percentile: 2.233312005996704
mean time: 1.363104772567749
Pipeline stage StressChecker completed in 7.45s
function_bapof_2024-08-15 status is now deployed due to DeploymentManager action
function_bapof_2024-08-15 status is now inactive due to admin request
function_bapof_2024-08-15 status is now torndown due to DeploymentManager action