developer_uid: rirv938
submission_id: function_retut_2026-01-05
model_name: dpo_data_collection
model_group:
status: inactive
timestamp: 2026-01-05T19:43:15+00:00
num_battles: 12217
num_wins: 6314
celo_rating: 1308.14
family_friendly_score: 0.5214
family_friendly_standard_error: 0.007064588310722713
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-05
win_ratio: 0.5168208234427437
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['</s>', '\n'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 5.091272592544556s
Received healthy response to inference request in 4.810091972351074s
Received healthy response to inference request in 3.362736463546753s
Received healthy response to inference request in 3.856189250946045s
Received healthy response to inference request in 4.522874116897583s
Received healthy response to inference request in 3.741581916809082s
Received healthy response to inference request in 3.1441726684570312s
Received healthy response to inference request in 2.9375550746917725s
Received healthy response to inference request in 2.8467702865600586s
Received healthy response to inference request in 4.002380609512329s
10 requests
0 failed requests
5th percentile: 2.88762344121933
10th percentile: 2.928476595878601
20th percentile: 3.1028491497039794
30th percentile: 3.2971673250198363
40th percentile: 3.5900437355041506
50th percentile: 3.7988855838775635
60th percentile: 3.9146657943725587
70th percentile: 4.158528661727905
80th percentile: 4.580317687988281
90th percentile: 4.838210034370422
95th percentile: 4.9647413134574885
99th percentile: 5.065966336727143
mean time: 3.8315624952316285
Pipeline stage StressChecker completed in 46.80s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_retut_2026-01-05 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2903.16s
Shutdown handler de-registered