function_lopum_2024-11-15

developer_uid: chai_backend_admin

submission_id: function_lopum_2024-11-15

model_name: retune_with_base

model_group:

status: inactive

timestamp: 2024-11-15T19:02:08+00:00

num_battles: 14272

num_wins: 7314

celo_rating: 1255.96

family_friendly_score: 0.589

family_friendly_standard_error: 0.006958146304871722

submission_type: function

display_name: retune_with_base

is_internal_developer: True

ranking_group: single

us_pacific_date: 2024-11-15

win_ratio: 0.5124719730941704

generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.7235589027404785s
Received healthy response to inference request in 2.903090238571167s
Received healthy response to inference request in 2.9733645915985107s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 2.7343218326568604s
Received healthy response to inference request in 3.051971673965454s
5 requests
0 failed requests
5th percentile: 2.725711488723755
10th percentile: 2.727864074707031
20th percentile: 2.732169246673584
30th percentile: 2.7680755138397215
40th percentile: 2.8355828762054442
50th percentile: 2.903090238571167
60th percentile: 2.9311999797821047
70th percentile: 2.959309720993042
80th percentile: 2.9890860080718995
90th percentile: 3.0205288410186766
95th percentile: 3.0362502574920653
99th percentile: 3.0488273906707763
mean time: 2.877261447906494
Pipeline stage StressChecker completed in 15.71s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 5.83s
Shutdown handler de-registered
function_lopum_2024-11-15 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2981.79s
Shutdown handler de-registered
function_lopum_2024-11-15 status is now inactive due to auto deactivation removed underperforming models