developer_uid: rirv938
submission_id: function_tigok_2026-01-24
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2026-01-27T08:23:20+00:00
num_battles: 10843
num_wins: 5439
celo_rating: 1308.1
family_friendly_score: 0.5246
family_friendly_standard_error: 0.007062504371680062
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-24
win_ratio: 0.5016139444803098
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.9080820083618164s
Received healthy response to inference request in 4.605860948562622s
Received healthy response to inference request in 4.425547122955322s
Received healthy response to inference request in 3.5705692768096924s
Received healthy response to inference request in 3.6640853881835938s
Received healthy response to inference request in 3.806751251220703s
Received healthy response to inference request in 4.245641469955444s
Received healthy response to inference request in 4.605475187301636s
Received healthy response to inference request in 4.3836305141448975s
Received healthy response to inference request in 8.565509557723999s
10 requests
0 failed requests
5th percentile: 3.2062012791633605
10th percentile: 3.5043205499649046
20th percentile: 3.6453821659088135
30th percentile: 3.7639514923095705
40th percentile: 4.0700853824615475
50th percentile: 4.314635992050171
60th percentile: 4.400397157669067
70th percentile: 4.479525542259216
80th percentile: 4.605552339553833
90th percentile: 5.001825809478758
95th percentile: 6.783667683601375
99th percentile: 8.209141182899476
mean time: 4.478115272521973
Pipeline stage StressChecker completed in 46.92s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.02s
Shutdown handler de-registered
function_tigok_2026-01-24 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 3205.62s
Shutdown handler de-registered
function_tigok_2026-01-24 status is now inactive due to auto deactivation removed underperforming models
function_tigok_2026-01-24 status is now torndown due to DeploymentManager action