developer_uid: rirv938
submission_id: function_genen_2026-01-12
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2026-02-28T19:06:32+00:00
num_battles: 0
num_wins: 0
family_friendly_score: 0.5124
family_friendly_standard_error: 0.007068892982638794
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-02-25
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 5.492301940917969s
Received healthy response to inference request in 5.3622355461120605s
Received healthy response to inference request in 4.306252479553223s
Received healthy response to inference request in 4.87659764289856s
Received healthy response to inference request in 5.84809947013855s
Received healthy response to inference request in 4.987878084182739s
Received healthy response to inference request in 5.536242723464966s
Received healthy response to inference request in 5.1776206493377686s
Received healthy response to inference request in 7.274480104446411s
Received healthy response to inference request in 6.39578652381897s
10 requests
0 failed requests
5th percentile: 4.562907803058624
10th percentile: 4.8195631265640255
20th percentile: 4.965621995925903
30th percentile: 5.12069787979126
40th percentile: 5.288389587402344
50th percentile: 5.427268743515015
60th percentile: 5.509878253936767
70th percentile: 5.629799747467041
80th percentile: 5.957636880874634
90th percentile: 6.483655881881713
95th percentile: 6.879067993164061
99th percentile: 7.195397682189942
mean time: 5.525749516487122
Pipeline stage StressChecker completed in 56.61s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.72s
Shutdown handler de-registered
function_genen_2026-01-12 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 5552.58s
Shutdown handler de-registered
function_genen_2026-01-12 status is now inactive due to system request
function_genen_2026-01-12 status is now torndown due to DeploymentManager action