developer_uid: rirv938
submission_id: function_damif_2025-06-12
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2025-06-12T06:35:50+00:00
num_battles: 8118
num_wins: 4311
celo_rating: 1299.7
family_friendly_score: 0.5398000000000001
family_friendly_standard_error: 0.0070486305052825686
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-06-11
win_ratio: 0.5310421286031042
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['</s>', '\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 8.5444815158844s
Received healthy response to inference request in 3.755376100540161s
Received healthy response to inference request in 3.8488047122955322s
Received healthy response to inference request in 3.0632622241973877s
Received healthy response to inference request in 3.4521069526672363s
5 requests
0 failed requests
5th percentile: 3.1410311698913573
10th percentile: 3.218800115585327
20th percentile: 3.3743380069732667
30th percentile: 3.512760782241821
40th percentile: 3.6340684413909914
50th percentile: 3.755376100540161
60th percentile: 3.7927475452423094
70th percentile: 3.830118989944458
80th percentile: 4.787940073013306
90th percentile: 6.666210794448853
95th percentile: 7.605346155166625
99th percentile: 8.356654443740844
mean time: 4.532806301116944
%s, retrying in %s seconds...
Received healthy response to inference request in 4.952945947647095s
Received healthy response to inference request in 2.907003879547119s
Received healthy response to inference request in 4.254527568817139s
Received healthy response to inference request in 3.199371814727783s
Received healthy response to inference request in 2.8661322593688965s
5 requests
0 failed requests
5th percentile: 2.874306583404541
10th percentile: 2.8824809074401854
20th percentile: 2.8988295555114747
30th percentile: 2.965477466583252
40th percentile: 3.0824246406555176
50th percentile: 3.199371814727783
60th percentile: 3.6214341163635253
70th percentile: 4.043496417999267
80th percentile: 4.39421124458313
90th percentile: 4.673578596115112
95th percentile: 4.813262271881103
99th percentile: 4.9250092124938964
mean time: 3.6359962940216066
%s, retrying in %s seconds...
Received healthy response to inference request in 3.27957820892334s
Received healthy response to inference request in 3.256108045578003s
Received healthy response to inference request in 3.6509156227111816s
Received healthy response to inference request in 1.9083645343780518s
Received healthy response to inference request in 4.84870719909668s
5 requests
0 failed requests
5th percentile: 2.177913236618042
10th percentile: 2.4474619388580323
20th percentile: 2.986559343338013
30th percentile: 3.26080207824707
40th percentile: 3.270190143585205
50th percentile: 3.27957820892334
60th percentile: 3.4281131744384767
70th percentile: 3.576648139953613
80th percentile: 3.8904739379882813
90th percentile: 4.36959056854248
95th percentile: 4.6091488838195795
99th percentile: 4.800795536041259
mean time: 3.388734722137451
Pipeline stage StressChecker completed in 61.27s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.68s
Shutdown handler de-registered
function_damif_2025-06-12 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3526.16s
Shutdown handler de-registered
function_damif_2025-06-12 status is now inactive due to auto deactivation removed underperforming models
function_damif_2025-06-12 status is now protected due to ABTestQueueItem
function_damif_2025-06-12 status is now inactive due to ABTestQueueItem
function_damif_2025-06-12 status is now torndown due to DeploymentManager action