developer_uid: rirv938
submission_id: function_tumif_2025-04-09
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2025-04-09T22:06:16+00:00
num_battles: 5562
num_wins: 3274
celo_rating: 1336.95
family_friendly_score: 0.5529999999999999
family_friendly_standard_error: 0.0070312303333058285
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-04-09
win_ratio: 0.5886371808701906
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['</s>', '\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.9207894802093506s
Received healthy response to inference request in 2.8916513919830322s
Received healthy response to inference request in 4.709888696670532s
Received healthy response to inference request in 2.3598010540008545s
Received healthy response to inference request in 2.8675732612609863s
5 requests
0 failed requests
5th percentile: 2.4613554954528807
10th percentile: 2.5629099369049073
20th percentile: 2.76601881980896
30th percentile: 2.8723888874053953
40th percentile: 2.882020139694214
50th percentile: 2.8916513919830322
60th percentile: 2.9033066272735595
70th percentile: 2.9149618625640867
80th percentile: 3.278609323501587
90th percentile: 3.99424901008606
95th percentile: 4.352068853378295
99th percentile: 4.638324728012085
mean time: 3.149940776824951
Pipeline stage StressChecker completed in 16.80s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.79s
Shutdown handler de-registered
function_tumif_2025-04-09 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3236.11s
Shutdown handler de-registered
function_tumif_2025-04-09 status is now inactive due to auto deactivation removed underperforming models
function_tumif_2025-04-09 status is now torndown due to DeploymentManager action