developer_uid: rirv938
submission_id: function_japut_2025-05-22
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2025-05-22T03:15:26+00:00
num_battles: 7894
num_wins: 4119
celo_rating: 1295.85
family_friendly_score: 0.5449999999999999
family_friendly_standard_error: 0.0070423717595707765
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-05-21
win_ratio: 0.5217887002786927
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['</s>', '\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.470306634902954s
Received healthy response to inference request in 3.7806365489959717s
Received healthy response to inference request in 3.248671293258667s
Received healthy response to inference request in 2.870051383972168s
Received healthy response to inference request in 5.5002217292785645s
5 requests
0 failed requests
5th percentile: 2.945775365829468
10th percentile: 3.0214993476867678
20th percentile: 3.172947311401367
30th percentile: 3.2929983615875242
40th percentile: 3.381652498245239
50th percentile: 3.470306634902954
60th percentile: 3.594438600540161
70th percentile: 3.7185705661773683
80th percentile: 4.124553585052491
90th percentile: 4.8123876571655275
95th percentile: 5.1563046932220455
99th percentile: 5.431438322067261
mean time: 3.773977518081665
%s, retrying in %s seconds...
Received healthy response to inference request in 3.262483835220337s
Received healthy response to inference request in 4.386300563812256s
Received healthy response to inference request in 4.148212909698486s
Received healthy response to inference request in 5.498366832733154s
Received healthy response to inference request in 4.636662721633911s
5 requests
0 failed requests
5th percentile: 3.439629650115967
10th percentile: 3.616775465011597
20th percentile: 3.9710670948028564
30th percentile: 4.19583044052124
40th percentile: 4.291065502166748
50th percentile: 4.386300563812256
60th percentile: 4.486445426940918
70th percentile: 4.58659029006958
80th percentile: 4.80900354385376
90th percentile: 5.153685188293457
95th percentile: 5.3260260105133055
99th percentile: 5.463898668289184
mean time: 4.386405372619629
%s, retrying in %s seconds...
Received healthy response to inference request in 2.8617308139801025s
Received healthy response to inference request in 2.8280673027038574s
Received healthy response to inference request in 3.9708759784698486s
Received healthy response to inference request in 1.9890213012695312s
Received healthy response to inference request in 2.7518422603607178s
5 requests
0 failed requests
5th percentile: 2.1415854930877685
10th percentile: 2.2941496849060057
20th percentile: 2.5992780685424806
30th percentile: 2.7670872688293455
40th percentile: 2.7975772857666015
50th percentile: 2.8280673027038574
60th percentile: 2.8415327072143555
70th percentile: 2.8549981117248535
80th percentile: 3.083559846878052
90th percentile: 3.5272179126739505
95th percentile: 3.7490469455718993
99th percentile: 3.926510171890259
mean time: 2.8803075313568116
Pipeline stage StressChecker completed in 58.27s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.67s
Shutdown handler de-registered
function_japut_2025-05-22 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3517.25s
Shutdown handler de-registered
function_japut_2025-05-22 status is now inactive due to auto deactivation removed underperforming models
function_japut_2025-05-22 status is now torndown due to DeploymentManager action