developer_uid: rirv938
submission_id: function_jenir_2025-03-30
model_name: dpo_data_collection
model_group:
status: inactive
timestamp: 2025-03-30T00:56:05+00:00
num_battles: 8712
num_wins: 5163
celo_rating: 1348.54
family_friendly_score: 0.4928
family_friendly_standard_error: 0.007070334645545429
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-03-29
win_ratio: 0.5926308539944903
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 8.844380378723145s
Received healthy response to inference request in 6.916195869445801s
Received healthy response to inference request in 3.191157341003418s
Received healthy response to inference request in 3.614424705505371s
Received healthy response to inference request in 3.4779305458068848s
5 requests
0 failed requests
5th percentile: 3.2485119819641115
10th percentile: 3.3058666229248046
20th percentile: 3.4205759048461912
30th percentile: 3.505229377746582
40th percentile: 3.5598270416259767
50th percentile: 3.614424705505371
60th percentile: 4.935133171081542
70th percentile: 6.255841636657714
80th percentile: 7.30183277130127
90th percentile: 8.073106575012208
95th percentile: 8.458743476867676
99th percentile: 8.76725299835205
mean time: 5.208817768096924
%s, retrying in %s seconds...
Received healthy response to inference request in 3.801901340484619s
Received healthy response to inference request in 4.195447683334351s
Received healthy response to inference request in 2.7874248027801514s
Received healthy response to inference request in 3.8348429203033447s
Received healthy response to inference request in 3.6119282245635986s
5 requests
0 failed requests
5th percentile: 2.9523254871368407
10th percentile: 3.11722617149353
20th percentile: 3.4470275402069093
30th percentile: 3.6499228477478027
40th percentile: 3.725912094116211
50th percentile: 3.801901340484619
60th percentile: 3.8150779724121096
70th percentile: 3.8282546043395995
80th percentile: 3.906963872909546
90th percentile: 4.051205778121949
95th percentile: 4.12332673072815
99th percentile: 4.18102349281311
mean time: 3.646308994293213
%s, retrying in %s seconds...
Received healthy response to inference request in 3.8063831329345703s
Received healthy response to inference request in 2.9988842010498047s
Received healthy response to inference request in 3.1501986980438232s
Received healthy response to inference request in 2.964853048324585s
Received healthy response to inference request in 2.9143760204315186s
5 requests
0 failed requests
5th percentile: 2.9244714260101317
10th percentile: 2.934566831588745
20th percentile: 2.954757642745972
30th percentile: 2.971659278869629
40th percentile: 2.9852717399597166
50th percentile: 2.9988842010498047
60th percentile: 3.059409999847412
70th percentile: 3.1199357986450194
80th percentile: 3.2814355850219727
90th percentile: 3.5439093589782713
95th percentile: 3.675146245956421
99th percentile: 3.7801357555389403
mean time: 3.1669390201568604
Pipeline stage StressChecker completed in 63.34s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.68s
Shutdown handler de-registered
function_jenir_2025-03-30 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4226.21s
Shutdown handler de-registered
function_jenir_2025-03-30 status is now inactive due to auto deactivation removed underperforming models
ChatRequest
Generation Params
Prompt Formatter
Chat History
ChatMessage 1