function_tujom_2025-06-25

developer_uid: rirv938

submission_id: function_tujom_2025-06-25

model_name: dpo_data_collection

model_group:

status: torndown

timestamp: 2025-06-25T03:18:42+00:00

num_battles: 6539

num_wins: 3333

celo_rating: 1284.93

family_friendly_score: 0.54

family_friendly_standard_error: 0.0070484040746824385

submission_type: function

display_name: dpo_data_collection

is_internal_developer: True

ranking_group: single

us_pacific_date: 2025-06-24

win_ratio: 0.5097109649793546

generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.6354665756225586s
Received healthy response to inference request in 2.1914517879486084s
Received healthy response to inference request in 4.0489795207977295s
Received healthy response to inference request in 4.221809387207031s
Received healthy response to inference request in 1.933530330657959s
5 requests
0 failed requests
5th percentile: 1.985114622116089
10th percentile: 2.036698913574219
20th percentile: 2.1398674964904787
30th percentile: 2.4802547454833985
40th percentile: 3.057860660552979
50th percentile: 3.6354665756225586
60th percentile: 3.800871753692627
70th percentile: 3.966276931762695
80th percentile: 4.08354549407959
90th percentile: 4.152677440643311
95th percentile: 4.187243413925171
99th percentile: 4.214896192550659
mean time: 3.2062475204467775
%s, retrying in %s seconds...
Received healthy response to inference request in 3.4170265197753906s
Received healthy response to inference request in 3.960486888885498s
Received healthy response to inference request in 3.5445752143859863s
Received healthy response to inference request in 3.3355166912078857s
Received healthy response to inference request in 3.6058802604675293s
5 requests
0 failed requests
5th percentile: 3.351818656921387
10th percentile: 3.368120622634888
20th percentile: 3.4007245540618896
30th percentile: 3.4425362586975097
40th percentile: 3.4935557365417482
50th percentile: 3.5445752143859863
60th percentile: 3.5690972328186037
70th percentile: 3.5936192512512206
80th percentile: 3.676801586151123
90th percentile: 3.8186442375183107
95th percentile: 3.8895655632019044
99th percentile: 3.9463026237487795
mean time: 3.572697114944458
Pipeline stage StressChecker completed in 36.42s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.69s
Shutdown handler de-registered
function_tujom_2025-06-25 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3743.80s
Shutdown handler de-registered
function_tujom_2025-06-25 status is now inactive due to auto deactivation removed underperforming models
function_tujom_2025-06-25 status is now protected due to ABTestQueueItem
function_tujom_2025-06-25 status is now inactive
function_tujom_2025-06-25 status is now torndown due to DeploymentManager action