developer_uid: rirv938
submission_id: function_dakul_2026-01-11
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2026-01-14T16:59:50+00:00
num_battles: 11434
num_wins: 5845
celo_rating: 9999.0
family_friendly_score: 0.4918
family_friendly_standard_error: 0.007070116830717863
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-10
win_ratio: 0.5111946825258002
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.9063165187835693s
Received healthy response to inference request in 2.746021032333374s
Received healthy response to inference request in 2.9874494075775146s
Received healthy response to inference request in 2.6397173404693604s
Received healthy response to inference request in 2.4403326511383057s
Received healthy response to inference request in 2.3055083751678467s
Received healthy response to inference request in 2.523810863494873s
Received healthy response to inference request in 3.1603355407714844s
Received healthy response to inference request in 2.180610418319702s
Received healthy response to inference request in 2.7225992679595947s
10 requests
0 failed requests
5th percentile: 2.236814498901367
10th percentile: 2.2930185794830322
20th percentile: 2.4133677959442137
30th percentile: 2.4987673997879027
40th percentile: 2.5933547496795653
50th percentile: 2.6811583042144775
60th percentile: 2.7319679737091063
70th percentile: 2.7941096782684327
80th percentile: 2.922543096542358
90th percentile: 3.0047380208969114
95th percentile: 3.082536780834198
99th percentile: 3.144775788784027
mean time: 2.6612701416015625
Pipeline stage StressChecker completed in 27.99s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
Shutdown handler de-registered
function_dakul_2026-01-11 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 3019.10s
Shutdown handler de-registered
function_dakul_2026-01-11 status is now torndown due to DeploymentManager action