developer_uid: rirv938
submission_id: function_noron_2026-01-05
model_name: dpo_data_collection
model_group:
status: protected
timestamp: 2026-01-05T22:14:01+00:00
num_battles: 8968
num_wins: 4560
celo_rating: 1302.11
family_friendly_score: 0.5085999999999999
family_friendly_standard_error: 0.0070700217821446625
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-05
win_ratio: 0.5084745762711864
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.620891809463501s
Received healthy response to inference request in 6.452365875244141s
Received healthy response to inference request in 5.81593918800354s
Received healthy response to inference request in 3.283076763153076s
Received healthy response to inference request in 3.7447688579559326s
Received healthy response to inference request in 3.618241786956787s
Received healthy response to inference request in 4.697661638259888s
Received healthy response to inference request in 3.970348358154297s
Received healthy response to inference request in 2.992058038711548s
Received healthy response to inference request in 6.400217056274414s
10 requests
0 failed requests
5th percentile: 3.1230164647102354
10th percentile: 3.2539748907089234
20th percentile: 3.5512087821960447
30th percentile: 3.706810736656189
40th percentile: 3.8801165580749513
50th percentile: 4.295620083808899
60th percentile: 4.651599740982055
70th percentile: 5.033144903182984
80th percentile: 5.932794761657715
90th percentile: 6.405431938171387
95th percentile: 6.428898906707763
99th percentile: 6.447672481536865
mean time: 4.559556937217712
Pipeline stage StressChecker completed in 47.11s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_noron_2026-01-05 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2730.51s
Shutdown handler de-registered
function_noron_2026-01-05 status is now protected due to ABTestQueueItem