developer_uid: rirv938
submission_id: function_boren_2025-06-08
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2025-06-08T18:03:31+00:00
num_battles: 12020
num_wins: 6681
celo_rating: 1314.22
family_friendly_score: 0.52
family_friendly_standard_error: 0.007065408693062277
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-06-08
win_ratio: 0.5558236272878536
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['</s>', '\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.401621341705322s
Received healthy response to inference request in 3.0632901191711426s
Received healthy response to inference request in 5.0390496253967285s
Received healthy response to inference request in 4.063369512557983s
Received healthy response to inference request in 4.742433786392212s
5 requests
0 failed requests
5th percentile: 3.263305997848511
10th percentile: 3.463321876525879
20th percentile: 3.863353633880615
30th percentile: 4.131019878387451
40th percentile: 4.266320610046387
50th percentile: 4.401621341705322
60th percentile: 4.537946319580078
70th percentile: 4.674271297454834
80th percentile: 4.801756954193115
90th percentile: 4.920403289794922
95th percentile: 4.9797264575958256
99th percentile: 5.027184991836548
mean time: 4.261952877044678
%s, retrying in %s seconds...
Received healthy response to inference request in 5.32720947265625s
Received healthy response to inference request in 4.485281944274902s
Received healthy response to inference request in 3.477620840072632s
Received healthy response to inference request in 5.690863609313965s
Received healthy response to inference request in 3.26292085647583s
5 requests
0 failed requests
5th percentile: 3.3058608531951905
10th percentile: 3.348800849914551
20th percentile: 3.4346808433532714
30th percentile: 3.679153060913086
40th percentile: 4.082217502593994
50th percentile: 4.485281944274902
60th percentile: 4.822052955627441
70th percentile: 5.1588239669799805
80th percentile: 5.399940299987793
90th percentile: 5.545401954650879
95th percentile: 5.618132781982422
99th percentile: 5.6763174438476565
mean time: 4.448779344558716
%s, retrying in %s seconds...
Received healthy response to inference request in 4.580328702926636s
Received healthy response to inference request in 3.0476090908050537s
Received healthy response to inference request in 3.6888983249664307s
Received healthy response to inference request in 3.5904712677001953s
Received healthy response to inference request in 3.674739360809326s
5 requests
0 failed requests
5th percentile: 3.156181526184082
10th percentile: 3.2647539615631103
20th percentile: 3.481898832321167
30th percentile: 3.6073248863220213
40th percentile: 3.6410321235656737
50th percentile: 3.674739360809326
60th percentile: 3.680402946472168
70th percentile: 3.6860665321350097
80th percentile: 3.8671844005584717
90th percentile: 4.223756551742554
95th percentile: 4.402042627334595
99th percentile: 4.544671487808228
mean time: 3.7164093494415282
Pipeline stage StressChecker completed in 65.54s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.64s
Shutdown handler de-registered
function_boren_2025-06-08 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4365.97s
Shutdown handler de-registered
function_boren_2025-06-08 status is now inactive due to auto deactivation removed underperforming models
function_boren_2025-06-08 status is now torndown due to DeploymentManager action