developer_uid: rirv938
submission_id: function_lehul_2025-03-08
model_name: retune_with_base
model_group:
status: torndown
timestamp: 2025-03-08T18:34:49+00:00
num_battles: 8796
num_wins: 4595
celo_rating: 1301.06
family_friendly_score: 0.5464
family_friendly_standard_error: 0.007040554523615309
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-03-08
win_ratio: 0.5223965438835835
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7935009002685547s
Received healthy response to inference request in 1.9224493503570557s
Received healthy response to inference request in 2.9097697734832764s
Received healthy response to inference request in 1.52138352394104s
Received healthy response to inference request in 3.7433321475982666s
5 requests
0 failed requests
5th percentile: 1.5758069992065429
10th percentile: 1.630230474472046
20th percentile: 1.7390774250030518
30th percentile: 1.8192905902862548
40th percentile: 1.8708699703216554
50th percentile: 1.9224493503570557
60th percentile: 2.3173775196075437
70th percentile: 2.712305688858032
80th percentile: 3.0764822483062746
90th percentile: 3.4099071979522706
95th percentile: 3.5766196727752684
99th percentile: 3.709989652633667
mean time: 2.3780871391296388
Pipeline stage StressChecker completed in 12.98s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.71s
Shutdown handler de-registered
function_lehul_2025-03-08 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3008.37s
Shutdown handler de-registered
function_lehul_2025-03-08 status is now inactive due to auto deactivation removed underperforming models
function_lehul_2025-03-08 status is now torndown due to DeploymentManager action
ChatRequest
Generation Params
Prompt Formatter
Chat History
ChatMessage 1