developer_uid: rirv938
submission_id: function_ramit_2025-06-25
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2025-06-25T16:02:34+00:00
num_battles: 6474
num_wins: 3309
celo_rating: 1288.74
family_friendly_score: 0.54
family_friendly_standard_error: 0.0070484040746824385
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-06-25
win_ratio: 0.5111214087117701
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.8552708625793457s
Received healthy response to inference request in 3.147488832473755s
Received healthy response to inference request in 16.71318531036377s
Received healthy response to inference request in 3.7616171836853027s
Received healthy response to inference request in 4.849827527999878s
5 requests
0 failed requests
5th percentile: 2.9137144565582274
10th percentile: 2.9721580505371095
20th percentile: 3.0890452384948732
30th percentile: 3.2703145027160643
40th percentile: 3.5159658432006835
50th percentile: 3.7616171836853027
60th percentile: 4.196901321411133
70th percentile: 4.632185459136963
80th percentile: 7.222499084472658
90th percentile: 11.967842197418214
95th percentile: 14.340513753890988
99th percentile: 16.238650999069215
mean time: 6.26547794342041
%s, retrying in %s seconds...
Received healthy response to inference request in 2.5653839111328125s
Received healthy response to inference request in 9.07992959022522s
Received healthy response to inference request in 3.9640586376190186s
Received healthy response to inference request in 3.5704731941223145s
Received healthy response to inference request in 3.8063061237335205s
5 requests
0 failed requests
5th percentile: 2.766401767730713
10th percentile: 2.9674196243286133
20th percentile: 3.369455337524414
30th percentile: 3.6176397800445557
40th percentile: 3.711972951889038
50th percentile: 3.8063061237335205
60th percentile: 3.86940712928772
70th percentile: 3.932508134841919
80th percentile: 4.987232828140259
90th percentile: 7.03358120918274
95th percentile: 8.05675539970398
99th percentile: 8.875294752120972
mean time: 4.597230291366577
%s, retrying in %s seconds...
Received healthy response to inference request in 9.09247899055481s
Received healthy response to inference request in 2.4722864627838135s
Received healthy response to inference request in 2.7522995471954346s
Received healthy response to inference request in 3.0758705139160156s
Received healthy response to inference request in 2.629467248916626s
5 requests
0 failed requests
5th percentile: 2.503722620010376
10th percentile: 2.5351587772369384
20th percentile: 2.5980310916900633
30th percentile: 2.6540337085723875
40th percentile: 2.703166627883911
50th percentile: 2.7522995471954346
60th percentile: 2.881727933883667
70th percentile: 3.0111563205718994
80th percentile: 4.279192209243775
90th percentile: 6.685835599899292
95th percentile: 7.88915729522705
99th percentile: 8.851814651489258
mean time: 4.00448055267334
Pipeline stage StressChecker completed in 77.94s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.70s
Shutdown handler de-registered
function_ramit_2025-06-25 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4163.10s
Shutdown handler de-registered
function_ramit_2025-06-25 status is now inactive due to auto deactivation removed underperforming models
function_ramit_2025-06-25 status is now torndown due to DeploymentManager action