developer_uid: rirv938
submission_id: function_himur_2026-01-05
model_name: dpo_data_collection
model_group:
status: inactive
timestamp: 2026-01-05T19:23:21+00:00
num_battles: 22291
num_wins: 11037
celo_rating: 1300.74
family_friendly_score: 0.5166
family_friendly_standard_error: 0.007067169730521547
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-05
win_ratio: 0.4951325647122157
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 5.863984107971191s
Received healthy response to inference request in 3.6717965602874756s
Received healthy response to inference request in 5.713897228240967s
Received healthy response to inference request in 4.801391124725342s
Received healthy response to inference request in 5.538990259170532s
Received healthy response to inference request in 4.06536865234375s
Received healthy response to inference request in 3.126847982406616s
Received healthy response to inference request in 3.676280975341797s
Received healthy response to inference request in 2.856069326400757s
Received healthy response to inference request in 2.5857772827148438s
10 requests
0 failed requests
5th percentile: 2.7074087023735047
10th percentile: 2.8290401220321657
20th percentile: 3.0726922512054444
30th percentile: 3.5083119869232178
40th percentile: 3.6744872093200684
50th percentile: 3.8708248138427734
60th percentile: 4.359777641296386
70th percentile: 5.022670865058899
80th percentile: 5.5739716529846195
90th percentile: 5.728905916213989
95th percentile: 5.79644501209259
99th percentile: 5.8504762887954715
mean time: 4.190040349960327
Pipeline stage StressChecker completed in 43.66s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
function_himur_2026-01-05 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2894.65s
Shutdown handler de-registered