submission_id: function_bulef_2024-08-09
developer_uid: chai_backend_admin
alignment_samples: 0
display_name: gpt-4o
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 1, 'max_output_tokens': 64, 'reward_max_token_input': 256}
is_internal_developer: True
model_group:
model_name: gpt-4o
num_battles: 57
num_wins: 20
propriety_score: 0.75
propriety_total_count: 4.0
ranking_group: single
reward_repo: ChaiML/gpt2_xl_pairwise_89m_step_347634
status: torndown
submission_type: function
timestamp: 2024-08-09T15:06:43+00:00
us_pacific_date: 2024-08-09
win_ratio: 0.3508771929824561
Download Preference Data
Resubmit model
Running pipeline stage StressChecker
Received healthy response to inference request in 3.532430410385132s
Received healthy response to inference request in 4.711452960968018s
Received healthy response to inference request in 3.4244585037231445s
Received healthy response to inference request in 6.446208953857422s
Received healthy response to inference request in 3.776556968688965s
5 requests
0 failed requests
5th percentile: 3.446052885055542
10th percentile: 3.4676472663879396
20th percentile: 3.5108360290527343
30th percentile: 3.5812557220458983
40th percentile: 3.6789063453674316
50th percentile: 3.776556968688965
60th percentile: 4.150515365600586
70th percentile: 4.524473762512207
80th percentile: 5.058404159545899
90th percentile: 5.752306556701661
95th percentile: 6.099257755279541
99th percentile: 6.376818714141845
mean time: 4.378221559524536
Pipeline stage StressChecker completed in 22.55s
function_bulef_2024-08-09 status is now deployed due to DeploymentManager action
function_bulef_2024-08-09 status is now inactive due to admin request
function_bulef_2024-08-09 status is now torndown due to DeploymentManager action
function_bulef_2024-08-09 status is now torndown due to DeploymentManager action