submission_id: function_kelur_2024-08-09
developer_uid: chai_backend_admin
alignment_samples: 10
celo_rating: 1003.21
display_name: gpt-4o
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 512, 'best_of': 1, 'max_output_tokens': 64, 'reward_max_token_input': 256}
is_internal_developer: True
model_group:
model_name: gpt-4o
num_battles: 1224
num_wins: 299
propriety_score: 0.5583333333333333
propriety_total_count: 120.0
ranking_group: single
reward_repo: ChaiML/gpt2_xl_pairwise_89m_step_347634
status: torndown
submission_type: function
timestamp: 2024-08-09T16:03:47+00:00
us_pacific_date: 2024-08-09
win_ratio: 0.24428104575163398
Download Preference Data
Resubmit model
Running pipeline stage StressChecker
Received healthy response to inference request in 5.139282941818237s
Received healthy response to inference request in 3.0143890380859375s
Received healthy response to inference request in 2.347299098968506s
Received healthy response to inference request in 2.101793050765991s
Received healthy response to inference request in 3.753021478652954s
5 requests
0 failed requests
5th percentile: 2.1508942604064942
10th percentile: 2.1999954700469972
20th percentile: 2.298197889328003
30th percentile: 2.480717086791992
40th percentile: 2.747553062438965
50th percentile: 3.0143890380859375
60th percentile: 3.309842014312744
70th percentile: 3.6052949905395506
80th percentile: 4.030273771286011
90th percentile: 4.584778356552124
95th percentile: 4.86203064918518
99th percentile: 5.0838324832916255
mean time: 3.2711571216583253
Pipeline stage StressChecker completed in 16.96s
function_kelur_2024-08-09 status is now deployed due to DeploymentManager action
function_kelur_2024-08-09 status is now inactive due to admin request
function_kelur_2024-08-09 status is now deployed due to admin request
function_kelur_2024-08-09 status is now torndown due to DeploymentManager action

Usage Metrics

Latency Metrics