developer_uid: rirv938
submission_id: function_norek_2026-01-05
model_name: dpo_data_collection
model_group:
status: protected
timestamp: 2026-01-05T22:13:50+00:00
num_battles: 9056
num_wins: 4670
celo_rating: 1308.47
family_friendly_score: 0.5234
family_friendly_standard_error: 0.007063319899310805
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-05
win_ratio: 0.5156802120141343
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.857495069503784s
Received healthy response to inference request in 3.5986032485961914s
Received healthy response to inference request in 5.11647367477417s
Received healthy response to inference request in 4.217697858810425s
Received healthy response to inference request in 5.636442422866821s
Received healthy response to inference request in 2.9517593383789062s
Received healthy response to inference request in 3.80100679397583s
Received healthy response to inference request in 3.6791515350341797s
Received healthy response to inference request in 1.6281719207763672s
Received healthy response to inference request in 3.8503918647766113s
10 requests
0 failed requests
5th percentile: 2.2237862586975097
10th percentile: 2.819400596618652
20th percentile: 3.4692344665527344
30th percentile: 3.6549870491027834
40th percentile: 3.75226469039917
50th percentile: 3.8256993293762207
60th percentile: 3.8532331466674803
70th percentile: 3.9655559062957764
80th percentile: 4.397453022003174
90th percentile: 5.168470549583435
95th percentile: 5.402456486225128
99th percentile: 5.589645235538483
mean time: 3.8337193727493286
Pipeline stage StressChecker completed in 39.64s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_norek_2026-01-05 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2767.60s
Shutdown handler de-registered
function_norek_2026-01-05 status is now protected due to ABTestQueueItem