developer_uid: rirv938
submission_id: function_ruses_2025-03-05
model_name: retune_with_base
model_group:
status: torndown
timestamp: 2025-03-05T18:53:26+00:00
num_battles: 8367
num_wins: 4250
celo_rating: 1283.87
family_friendly_score: 0.6088
family_friendly_standard_error: 0.006901631111556166
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-03-05
win_ratio: 0.50794789052229
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.0742275714874268s
Received healthy response to inference request in 3.4574739933013916s
Received healthy response to inference request in 3.278796434402466s
Received healthy response to inference request in 2.9078476428985596s
Received healthy response to inference request in 6.542574644088745s
5 requests
0 failed requests
5th percentile: 2.941123628616333
10th percentile: 2.9743996143341063
20th percentile: 3.0409515857696534
30th percentile: 3.1151413440704347
40th percentile: 3.1969688892364503
50th percentile: 3.278796434402466
60th percentile: 3.350267457962036
70th percentile: 3.4217384815216065
80th percentile: 4.074494123458863
90th percentile: 5.308534383773804
95th percentile: 5.9255545139312735
99th percentile: 6.419170618057251
mean time: 3.8521840572357178
Pipeline stage StressChecker completed in 20.47s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.71s
Shutdown handler de-registered
function_ruses_2025-03-05 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3266.05s
Shutdown handler de-registered
function_ruses_2025-03-05 status is now inactive due to auto deactivation removed underperforming models
function_ruses_2025-03-05 status is now torndown due to DeploymentManager action
ChatRequest
Generation Params
Prompt Formatter
Chat History
ChatMessage 1