function_nitim_2025-04-10

developer_uid: rirv938

submission_id: function_nitim_2025-04-10

model_name: dpo_data_collection

model_group:

status: torndown

timestamp: 2025-04-10T18:01:29+00:00

num_battles: 7459

num_wins: 3790

celo_rating: 1290.8

family_friendly_score: 0.6536

family_friendly_standard_error: 0.006729146156831489

submission_type: function

display_name: dpo_data_collection

is_internal_developer: True

ranking_group: single

us_pacific_date: 2025-04-10

win_ratio: 0.5081110068373776

generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['</s>', '\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 7.067456960678101s
Received healthy response to inference request in 9.474692821502686s
Received healthy response to inference request in 3.360745668411255s
Received healthy response to inference request in 3.6178760528564453s
Received healthy response to inference request in 4.930642604827881s
5 requests
0 failed requests
5th percentile: 3.412171745300293
10th percentile: 3.463597822189331
20th percentile: 3.5664499759674073
30th percentile: 3.8804293632507325
40th percentile: 4.4055359840393065
50th percentile: 4.930642604827881
60th percentile: 5.785368347167968
70th percentile: 6.6400940895080565
80th percentile: 7.548904132843018
90th percentile: 8.511798477172851
95th percentile: 8.993245649337767
99th percentile: 9.378403387069701
mean time: 5.690282821655273
%s, retrying in %s seconds...
Received healthy response to inference request in 3.352304458618164s
Received healthy response to inference request in 2.6226255893707275s
Received healthy response to inference request in 4.321777582168579s
Received healthy response to inference request in 3.2989306449890137s
Received healthy response to inference request in 2.3781940937042236s
5 requests
0 failed requests
5th percentile: 2.4270803928375244
10th percentile: 2.475966691970825
20th percentile: 2.5737392902374268
30th percentile: 2.757886600494385
40th percentile: 3.0284086227416993
50th percentile: 3.2989306449890137
60th percentile: 3.320280170440674
70th percentile: 3.341629695892334
80th percentile: 3.546199083328247
90th percentile: 3.933988332748413
95th percentile: 4.127882957458496
99th percentile: 4.282998657226562
mean time: 3.194766473770142
Pipeline stage StressChecker completed in 46.76s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.69s
Shutdown handler de-registered
function_nitim_2025-04-10 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4041.01s
Shutdown handler de-registered
function_nitim_2025-04-10 status is now inactive due to auto deactivation removed underperforming models
function_nitim_2025-04-10 status is now torndown due to DeploymentManager action