submission_id: function_kuhit_2024-08-20
developer_uid: chai_backend_admin
alignment_samples: 591
alignment_score: 2.5101690284534826
display_name: gpt4-tl
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.1, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', 'You:'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: gpt4-tl
num_battles: 591
num_wins: 281
propriety_score: 0.7659574468085106
propriety_total_count: 47.0
ranking_group: single
status: torndown
submission_type: function
timestamp: 2024-08-20T17:51:25+00:00
us_pacific_date: 2024-08-20
win_ratio: 0.4754653130287648
Download Preference Data
Resubmit model
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2184321880340576s
Failed to get response for submission function_desof_2024-08-20: ('http://chaiml-llama-8b-pairwis-8189-v17-predictor.tenant-chaiml-guanaco.k.chaiverse.com/v1/models/GPT-J-6B-lit-v2:predict', 'read tcp 127.0.0.1:44196->127.0.0.1:8080: read: connection reset by peer\n')
Received healthy response to inference request in 2.564561605453491s
Received healthy response to inference request in 2.3853135108947754s
Received healthy response to inference request in 5.4463560581207275s
Received healthy response to inference request in 1.6036489009857178s
5 requests
0 failed requests
5th percentile: 1.7266055583953857
10th percentile: 1.8495622158050538
20th percentile: 2.0954755306243897
30th percentile: 2.251808452606201
40th percentile: 2.318560981750488
50th percentile: 2.3853135108947754
60th percentile: 2.457012748718262
70th percentile: 2.5287119865417482
80th percentile: 3.140920495986939
90th percentile: 4.293638277053834
95th percentile: 4.86999716758728
99th percentile: 5.331084280014038
mean time: 2.843662452697754
Pipeline stage StressChecker completed in 14.77s
function_kuhit_2024-08-20 status is now deployed due to DeploymentManager action
function_kuhit_2024-08-20 status is now inactive due to admin request
function_kuhit_2024-08-20 status is now torndown due to DeploymentManager action

Usage Metrics

Latency Metrics