developer_uid: rirv938
submission_id: function_kurot_2025-04-20
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2025-04-20T02:10:35+00:00
num_battles: 6546
num_wins: 3333
celo_rating: 1295.31
family_friendly_score: 0.5706
family_friendly_standard_error: 0.007000223425005805
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-04-19
win_ratio: 0.5091659028414299
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.313410758972168s
Received healthy response to inference request in 3.4222445487976074s
Received healthy response to inference request in 3.0175046920776367s
Received healthy response to inference request in 2.714444160461426s
Received healthy response to inference request in 2.0157132148742676s
5 requests
0 failed requests
5th percentile: 2.155459403991699
10th percentile: 2.295205593109131
20th percentile: 2.574697971343994
30th percentile: 2.775056266784668
40th percentile: 2.8962804794311525
50th percentile: 3.0175046920776367
60th percentile: 3.135867118835449
70th percentile: 3.2542295455932617
80th percentile: 3.335177516937256
90th percentile: 3.3787110328674315
95th percentile: 3.4004777908325194
99th percentile: 3.41789119720459
mean time: 2.896663475036621
Pipeline stage StressChecker completed in 15.59s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.02s
Shutdown handler de-registered
function_kurot_2025-04-20 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4112.23s
Shutdown handler de-registered
function_kurot_2025-04-20 status is now inactive due to auto deactivation removed underperforming models
function_kurot_2025-04-20 status is now torndown due to DeploymentManager action