developer_uid: rirv938
submission_id: function_nokuk_2025-12-29
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2026-01-01T22:26:44+00:00
num_battles: 6521
num_wins: 3387
celo_rating: 1314.26
family_friendly_score: 0.5338
family_friendly_standard_error: 0.0070548927702694395
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-29
win_ratio: 0.5193988652047232
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '\n'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.616321563720703s
Received healthy response to inference request in 4.393440008163452s
Received healthy response to inference request in 3.1482036113739014s
Received healthy response to inference request in 2.4359958171844482s
Received healthy response to inference request in 2.5393142700195312s
Received healthy response to inference request in 4.536928176879883s
Received healthy response to inference request in 3.44439435005188s
Received healthy response to inference request in 2.015437126159668s
Received healthy response to inference request in 3.4129018783569336s
Received healthy response to inference request in 3.8348071575164795s
10 requests
0 failed requests
5th percentile: 2.204688537120819
10th percentile: 2.3939399480819703
20th percentile: 2.5186505794525145
30th percentile: 2.96553680896759
40th percentile: 3.3070225715637207
50th percentile: 3.4286481142044067
60th percentile: 3.6005594730377197
70th percentile: 4.0023970127105715
80th percentile: 4.422137641906739
90th percentile: 4.544867515563965
95th percentile: 4.580594539642334
99th percentile: 4.609176158905029
mean time: 3.437774395942688
Pipeline stage StressChecker completed in 35.99s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_nokuk_2025-12-29 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2871.21s
Shutdown handler de-registered
function_nokuk_2025-12-29 status is now torndown due to DeploymentManager action