developer_uid: rirv938
submission_id: function_gudem_2026-01-01
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2026-01-04T01:11:43+00:00
num_battles: 7452
num_wins: 3825
celo_rating: 1296.3
family_friendly_score: 0.5329999999999999
family_friendly_standard_error: 0.007055650218087628
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-31
win_ratio: 0.5132850241545893
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.150041341781616s
Received healthy response to inference request in 2.2028369903564453s
Received healthy response to inference request in 2.4904730319976807s
Received healthy response to inference request in 3.973867654800415s
Received healthy response to inference request in 4.234087705612183s
Received healthy response to inference request in 2.786884307861328s
Received healthy response to inference request in 2.8118951320648193s
Received healthy response to inference request in 1.6706135272979736s
Received healthy response to inference request in 1.7131094932556152s
Received healthy response to inference request in 5.997286796569824s
10 requests
0 failed requests
5th percentile: 1.6897367119789124
10th percentile: 1.7088598966598512
20th percentile: 2.1048914909362795
30th percentile: 2.40418221950531
40th percentile: 2.6683197975158692
50th percentile: 2.7993897199630737
60th percentile: 2.947153615951538
70th percentile: 3.3971892356872555
80th percentile: 4.025911664962768
90th percentile: 4.410407614707946
95th percentile: 5.203847205638883
99th percentile: 5.838598878383637
mean time: 3.10310959815979
Pipeline stage StressChecker completed in 32.64s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.68s
Shutdown handler de-registered
function_gudem_2026-01-01 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2245.28s
Shutdown handler de-registered
function_gudem_2026-01-01 status is now torndown due to DeploymentManager action