developer_uid: rirv938
submission_id: function_finaf_2026-01-05
model_name: dpo_data_collection
model_group:
status: protected
timestamp: 2026-01-05T22:12:45+00:00
num_battles: 19249
num_wins: 10000
celo_rating: 1310.75
family_friendly_score: 0.5038
family_friendly_standard_error: 0.007070863596478156
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-05
win_ratio: 0.5195075068834745
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '\n'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.4575488567352295s
Received healthy response to inference request in 6.268377065658569s
Received healthy response to inference request in 5.7140793800354s
Received healthy response to inference request in 4.956835746765137s
Received healthy response to inference request in 2.304838180541992s
Received healthy response to inference request in 3.6693830490112305s
Received healthy response to inference request in 3.180750846862793s
Received healthy response to inference request in 3.038022756576538s
Received healthy response to inference request in 4.973935842514038s
Received healthy response to inference request in 2.0728983879089355s
10 requests
0 failed requests
5th percentile: 2.177271294593811
10th percentile: 2.2816442012786866
20th percentile: 2.8913858413696287
30th percentile: 3.1379324197769165
40th percentile: 3.4739301681518553
50th percentile: 4.06346595287323
60th percentile: 4.657263612747192
70th percentile: 4.961965775489807
80th percentile: 5.121964550018311
90th percentile: 5.7695091485977175
95th percentile: 6.018943107128143
99th percentile: 6.218490273952484
mean time: 4.0636670112609865
Pipeline stage StressChecker completed in 41.99s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_finaf_2026-01-05 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2998.84s
Shutdown handler de-registered
function_finaf_2026-01-05 status is now protected due to ABTestQueueItem