developer_uid: rirv938
submission_id: function_pinun_2026-01-05
model_name: dpo_data_collection
model_group:
status: protected
timestamp: 2026-01-05T22:13:29+00:00
num_battles: 6498
num_wins: 3403
celo_rating: 1313.06
family_friendly_score: 0.4998
family_friendly_standard_error: 0.007071067246180028
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-05
win_ratio: 0.5236995998768852
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '\n'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.3860371112823486s
Received healthy response to inference request in 2.941070318222046s
Received healthy response to inference request in 2.811871290206909s
Received healthy response to inference request in 2.937228202819824s
Received healthy response to inference request in 4.1775243282318115s
Received healthy response to inference request in 2.557612657546997s
Received healthy response to inference request in 3.1655194759368896s
Received healthy response to inference request in 11.276356220245361s
Received healthy response to inference request in 4.444294691085815s
Received healthy response to inference request in 2.9808995723724365s
10 requests
0 failed requests
5th percentile: 2.6720290422439574
10th percentile: 2.786445426940918
20th percentile: 2.912156820297241
30th percentile: 2.9399176836013794
40th percentile: 2.96496787071228
50th percentile: 3.073209524154663
60th percentile: 3.253726530075073
70th percentile: 3.623483276367187
80th percentile: 4.230878400802612
90th percentile: 5.1275008440017675
95th percentile: 8.201928532123558
99th percentile: 10.661470682621003
mean time: 4.067841386795044
Pipeline stage StressChecker completed in 43.20s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_pinun_2026-01-05 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2811.14s
Shutdown handler de-registered
function_pinun_2026-01-05 status is now protected due to ABTestQueueItem