submission_id: function_mehor_2024-11-14
developer_uid: chai_backend_admin
celo_rating: 1277.25
display_name: retune_with_base
family_friendly_score: 0.5808
family_friendly_standard_error: 0.006978128115762852
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: retune_with_base
num_battles: 6068
num_wins: 3224
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-11-14T21:37:03+00:00
us_pacific_date: 2024-11-14
win_ratio: 0.5313117996044825
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.313755512237549s
Received healthy response to inference request in 3.0643630027770996s
Received healthy response to inference request in 2.844698667526245s
Received healthy response to inference request in 5.448190450668335s
Received healthy response to inference request in 3.283423900604248s
5 requests
0 failed requests
5th percentile: 2.888631534576416
10th percentile: 2.932564401626587
20th percentile: 3.0204301357269285
30th percentile: 3.108175182342529
40th percentile: 3.195799541473389
50th percentile: 3.283423900604248
60th percentile: 3.2955565452575684
70th percentile: 3.3076891899108887
80th percentile: 3.7406424999237062
90th percentile: 4.594416475296021
95th percentile: 5.021303462982178
99th percentile: 5.362813053131103
mean time: 3.5908863067626955
Pipeline stage StressChecker completed in 19.74s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 5.34s
Shutdown handler de-registered
function_mehor_2024-11-14 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3092.07s
Shutdown handler de-registered
function_mehor_2024-11-14 status is now inactive due to auto deactivation removed underperforming models