developer_uid: rirv938
submission_id: function_kihir_2026-01-05
model_name: dpo_data_collection
model_group:
status: protected
timestamp: 2026-01-05T22:13:40+00:00
num_battles: 9771
num_wins: 5089
celo_rating: 1311.37
family_friendly_score: 0.5346
family_friendly_standard_error: 0.007054117095710844
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-05
win_ratio: 0.5208269368539555
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '\n'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.823514938354492s
Received healthy response to inference request in 3.4116225242614746s
Received healthy response to inference request in 5.7171711921691895s
Received healthy response to inference request in 4.574156761169434s
Received healthy response to inference request in 3.9612956047058105s
Received healthy response to inference request in 4.5246803760528564s
Received healthy response to inference request in 3.370941638946533s
Received healthy response to inference request in 4.317189931869507s
Received healthy response to inference request in 5.371129751205444s
Received healthy response to inference request in 2.431945323944092s
10 requests
0 failed requests
5th percentile: 2.608151650428772
10th percentile: 2.7843579769134523
20th percentile: 3.261456298828125
30th percentile: 3.399418258666992
40th percentile: 3.741426372528076
50th percentile: 4.139242768287659
60th percentile: 4.4001861095428465
70th percentile: 4.539523291587829
80th percentile: 4.733551359176636
90th percentile: 5.405733895301819
95th percentile: 5.561452543735504
99th percentile: 5.6860274624824525
mean time: 4.0503648042678835
Pipeline stage StressChecker completed in 41.78s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.56s
Shutdown handler de-registered
function_kihir_2026-01-05 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2801.96s
Shutdown handler de-registered
function_kihir_2026-01-05 status is now protected due to ABTestQueueItem