developer_uid: chai_backend_admin
submission_id: function_tefon_2024-11-16
model_name: retune_with_base
model_group:
status: inactive
timestamp: 2024-11-16T19:33:27+00:00
num_battles: 19944
num_wins: 10036
celo_rating: 1253.29
family_friendly_score: 0.575
family_friendly_standard_error: 0.006991065727054781
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-11-16
win_ratio: 0.5032089851584436
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.3181161880493164s
Received healthy response to inference request in 3.8250865936279297s
Received healthy response to inference request in 4.519575595855713s
Received healthy response to inference request in 3.288239002227783s
Received healthy response to inference request in 3.1969683170318604s
5 requests
0 failed requests
5th percentile: 3.215222454071045
10th percentile: 3.2334765911102297
20th percentile: 3.2699848651885985
30th percentile: 3.29421443939209
40th percentile: 3.306165313720703
50th percentile: 3.3181161880493164
60th percentile: 3.5209043502807615
70th percentile: 3.723692512512207
80th percentile: 3.9639843940734867
90th percentile: 4.2417799949646
95th percentile: 4.380677795410156
99th percentile: 4.491796035766601
mean time: 3.6295971393585207
%s, retrying in %s seconds...
Received healthy response to inference request in 5.22058367729187s
Received healthy response to inference request in 2.820507526397705s
Received healthy response to inference request in 2.4885456562042236s
Received healthy response to inference request in 3.0644662380218506s
Received healthy response to inference request in 2.6007513999938965s
5 requests
0 failed requests
5th percentile: 2.510986804962158
10th percentile: 2.5334279537200928
20th percentile: 2.578310251235962
30th percentile: 2.6447026252746584
40th percentile: 2.7326050758361817
50th percentile: 2.820507526397705
60th percentile: 2.9180910110473635
70th percentile: 3.0156744956970214
80th percentile: 3.495689725875855
90th percentile: 4.358136701583862
95th percentile: 4.789360189437866
99th percentile: 5.134338979721069
mean time: 3.238970899581909
Pipeline stage StressChecker completed in 37.17s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 5.83s
Shutdown handler de-registered
function_tefon_2024-11-16 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3069.26s
Shutdown handler de-registered
function_tefon_2024-11-16 status is now inactive due to auto deactivation removed underperforming models