function_mitan_2024-11-14

submission_id: function_mitan_2024-11-14

developer_uid: chai_backend_admin

celo_rating: 1249.83

display_name: retune_with_base

family_friendly_score: 0.5738

family_friendly_standard_error: 0.006993619377689924

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

is_internal_developer: True

model_group:

model_name: retune_with_base

num_battles: 13075

num_wins: 6546

ranking_group: single

status: inactive

submission_type: function

timestamp: 2024-11-14T18:57:11+00:00

us_pacific_date: 2024-11-14

win_ratio: 0.5006500956022945

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.7608892917633057s
Received healthy response to inference request in 3.428382396697998s
Received healthy response to inference request in 4.500351190567017s
Received healthy response to inference request in 2.9115641117095947s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 3.2186460494995117s
5 requests
0 failed requests
5th percentile: 2.972980499267578
10th percentile: 3.0343968868255615
20th percentile: 3.1572296619415283
30th percentile: 3.260593318939209
40th percentile: 3.3444878578186037
50th percentile: 3.428382396697998
60th percentile: 3.561385154724121
70th percentile: 3.694387912750244
80th percentile: 3.908781671524048
90th percentile: 4.204566431045532
95th percentile: 4.352458810806274
99th percentile: 4.470772714614868
mean time: 3.5639666080474854
Pipeline stage StressChecker completed in 19.06s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.31s
Shutdown handler de-registered
function_mitan_2024-11-14 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3297.33s
Shutdown handler de-registered
function_mitan_2024-11-14 status is now inactive due to auto deactivation removed underperforming models