developer_uid: chai_backend_admin
submission_id: function_tukab_2024-11-27
model_name: retune_with_base
model_group:
status: inactive
timestamp: 2024-11-27T23:56:08+00:00
num_battles: 14959
num_wins: 7447
celo_rating: 1263.86
family_friendly_score: 0.5598000000000001
family_friendly_standard_error: 0.007020312813543283
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-11-27
win_ratio: 0.49782739487933686
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.029597043991089s
Received healthy response to inference request in 2.5397019386291504s
Received healthy response to inference request in 2.1171329021453857s
Received healthy response to inference request in 1.9092097282409668s
Received healthy response to inference request in 2.19049334526062s
5 requests
0 failed requests
5th percentile: 1.9332871913909913
10th percentile: 1.9573646545410157
20th percentile: 2.0055195808410646
30th percentile: 2.0471042156219483
40th percentile: 2.082118558883667
50th percentile: 2.1171329021453857
60th percentile: 2.1464770793914796
70th percentile: 2.1758212566375734
80th percentile: 2.2603350639343263
90th percentile: 2.400018501281738
95th percentile: 2.4698602199554442
99th percentile: 2.525733594894409
mean time: 2.1572269916534426
Pipeline stage StressChecker completed in 12.10s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.17s
Shutdown handler de-registered
function_tukab_2024-11-27 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3819.47s
Shutdown handler de-registered
function_tukab_2024-11-27 status is now inactive due to auto deactivation removed underperforming models