developer_uid: rirv938
submission_id: function_nihem_2026-01-15
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2026-01-18T15:18:16+00:00
num_battles: 10631
num_wins: 5566
celo_rating: 1312.29
family_friendly_score: 0.49660000000000004
family_friendly_standard_error: 0.007070904326887757
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-15
win_ratio: 0.5235631643307309
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['</s>', '\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9710335731506348s
Received healthy response to inference request in 2.9371039867401123s
Received healthy response to inference request in 1.9827382564544678s
Received healthy response to inference request in 2.561828851699829s
Received healthy response to inference request in 3.250882625579834s
Received healthy response to inference request in 2.9601457118988037s
Received healthy response to inference request in 4.655827283859253s
Received healthy response to inference request in 2.197323799133301s
Received healthy response to inference request in 3.1103148460388184s
Received healthy response to inference request in 2.6419179439544678s
10 requests
0 failed requests
5th percentile: 1.9763006806373595
10th percentile: 1.9815677881240845
20th percentile: 2.154406690597534
30th percentile: 2.4524773359298706
40th percentile: 2.6098823070526125
50th percentile: 2.78951096534729
60th percentile: 2.946320676803589
70th percentile: 3.005196452140808
80th percentile: 3.1384284019470217
90th percentile: 3.3913770914077754
95th percentile: 4.023602187633513
99th percentile: 4.529382264614106
mean time: 2.826911687850952
Pipeline stage StressChecker completed in 29.93s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.87s
Shutdown handler de-registered
function_nihem_2026-01-15 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 3422.49s
Shutdown handler de-registered
function_nihem_2026-01-15 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of function_nihem_2026-01-15
Shutdown handler not registered because Python interpreter is not running in the main thread
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
run pipeline %s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Shutdown handler de-registered
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation