developer_uid: rirv938
submission_id: function_finom_2025-12-29
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2026-01-01T22:26:43+00:00
num_battles: 7610
num_wins: 3864
celo_rating: 1298.73
family_friendly_score: 0.5296000000000001
family_friendly_standard_error: 0.007058666162951752
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-29
win_ratio: 0.5077529566360053
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 80, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 512, 'best_of': 8, 'max_output_tokens': 64}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.501559257507324s
Received healthy response to inference request in 3.80916690826416s
Received healthy response to inference request in 2.092904806137085s
Received healthy response to inference request in 2.626567840576172s
Received healthy response to inference request in 2.257469654083252s
Received healthy response to inference request in 1.5276060104370117s
Received healthy response to inference request in 3.4203500747680664s
Received healthy response to inference request in 3.0822432041168213s
Received healthy response to inference request in 4.841796398162842s
10 requests
1 failed requests
5th percentile: 1.7819904685020447
10th percentile: 2.0363749265670776
20th percentile: 2.2245566844940186
30th percentile: 2.515838384628296
40th percentile: 2.8999730587005614
50th percentile: 3.251296639442444
60th percentile: 3.4528337478637696
70th percentile: 3.593841552734375
80th percentile: 4.015692806243897
90th percentile: 6.367121195793146
95th percentile: 13.23108278512953
99th percentile: 18.722252056598666
mean time: 4.725470852851868
%s, retrying in %s seconds...
Received healthy response to inference request in 4.485839605331421s
Received healthy response to inference request in 4.392689943313599s
Received healthy response to inference request in 2.9316964149475098s
Received healthy response to inference request in 4.0846474170684814s
Received healthy response to inference request in 1.6405510902404785s
Received healthy response to inference request in 4.678333759307861s
Received healthy response to inference request in 3.2733876705169678s
Received healthy response to inference request in 5.663655996322632s
Received healthy response to inference request in 5.447785139083862s
Received healthy response to inference request in 4.270541667938232s
10 requests
0 failed requests
5th percentile: 2.2215664863586424
10th percentile: 2.8025818824768067
20th percentile: 3.205049419403076
30th percentile: 3.8412694931030273
40th percentile: 4.196183967590332
50th percentile: 4.3316158056259155
60th percentile: 4.429949808120727
70th percentile: 4.543587851524353
80th percentile: 4.832224035263062
90th percentile: 5.469372224807739
95th percentile: 5.566514110565185
99th percentile: 5.644227619171143
mean time: 4.0869128704071045
Pipeline stage StressChecker completed in 90.49s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_finom_2025-12-29 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2894.64s
Shutdown handler de-registered
function_finom_2025-12-29 status is now torndown due to DeploymentManager action