developer_uid: rirv938
submission_id: function_tujom_2025-06-25
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2025-06-25T03:18:42+00:00
num_battles: 6539
num_wins: 3333
celo_rating: 1284.93
family_friendly_score: 0.54
family_friendly_standard_error: 0.0070484040746824385
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-06-24
win_ratio: 0.5097109649793546
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.6354665756225586s
Received healthy response to inference request in 2.1914517879486084s
Received healthy response to inference request in 4.0489795207977295s
Received healthy response to inference request in 4.221809387207031s
Received healthy response to inference request in 1.933530330657959s
5 requests
0 failed requests
5th percentile: 1.985114622116089
10th percentile: 2.036698913574219
20th percentile: 2.1398674964904787
30th percentile: 2.4802547454833985
40th percentile: 3.057860660552979
50th percentile: 3.6354665756225586
60th percentile: 3.800871753692627
70th percentile: 3.966276931762695
80th percentile: 4.08354549407959
90th percentile: 4.152677440643311
95th percentile: 4.187243413925171
99th percentile: 4.214896192550659
mean time: 3.2062475204467775
%s, retrying in %s seconds...
Received healthy response to inference request in 3.4170265197753906s
Received healthy response to inference request in 3.960486888885498s
Received healthy response to inference request in 3.5445752143859863s
Received healthy response to inference request in 3.3355166912078857s
Received healthy response to inference request in 3.6058802604675293s
5 requests
0 failed requests
5th percentile: 3.351818656921387
10th percentile: 3.368120622634888
20th percentile: 3.4007245540618896
30th percentile: 3.4425362586975097
40th percentile: 3.4935557365417482
50th percentile: 3.5445752143859863
60th percentile: 3.5690972328186037
70th percentile: 3.5936192512512206
80th percentile: 3.676801586151123
90th percentile: 3.8186442375183107
95th percentile: 3.8895655632019044
99th percentile: 3.9463026237487795
mean time: 3.572697114944458
Pipeline stage StressChecker completed in 36.42s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.69s
Shutdown handler de-registered
function_tujom_2025-06-25 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3743.80s
Shutdown handler de-registered
function_tujom_2025-06-25 status is now inactive due to auto deactivation removed underperforming models
function_tujom_2025-06-25 status is now protected due to ABTestQueueItem
function_tujom_2025-06-25 status is now inactive
function_tujom_2025-06-25 status is now torndown due to DeploymentManager action