developer_uid: rirv938
submission_id: function_lujeb_2025-02-27
model_name: retune_with_base
model_group:
status: torndown
timestamp: 2025-02-27T20:30:53+00:00
num_battles: 5681
num_wins: 2935
celo_rating: 1293.4
family_friendly_score: 0.5347999999999999
family_friendly_standard_error: 0.007053920328441483
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-02-27
win_ratio: 0.5166343953529309
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 7.306220054626465s
Received healthy response to inference request in 3.88982892036438s
Received healthy response to inference request in 3.5918924808502197s
Received healthy response to inference request in 2.3761305809020996s
Received healthy response to inference request in 4.14841628074646s
5 requests
0 failed requests
5th percentile: 2.6192829608917236
10th percentile: 2.8624353408813477
20th percentile: 3.3487401008605957
30th percentile: 3.6514797687530516
40th percentile: 3.7706543445587157
50th percentile: 3.88982892036438
60th percentile: 3.993263864517212
70th percentile: 4.096698808670044
80th percentile: 4.779977035522461
90th percentile: 6.043098545074463
95th percentile: 6.674659299850464
99th percentile: 7.179907903671264
mean time: 4.262497663497925
%s, retrying in %s seconds...
Received healthy response to inference request in 2.699906349182129s
Received healthy response to inference request in 1.8375358581542969s
Received healthy response to inference request in 3.055574417114258s
Received healthy response to inference request in 3.9262256622314453s
Received healthy response to inference request in 5.4687113761901855s
5 requests
0 failed requests
5th percentile: 2.0100099563598635
10th percentile: 2.1824840545654296
20th percentile: 2.5274322509765623
30th percentile: 2.7710399627685547
40th percentile: 2.9133071899414062
50th percentile: 3.055574417114258
60th percentile: 3.403834915161133
70th percentile: 3.7520954132080075
80th percentile: 4.2347228050231935
90th percentile: 4.851717090606689
95th percentile: 5.160214233398437
99th percentile: 5.407011947631836
mean time: 3.397590732574463
%s, retrying in %s seconds...
Received healthy response to inference request in 3.4199938774108887s
Received healthy response to inference request in 2.4954450130462646s
Received healthy response to inference request in 2.812007188796997s
Received healthy response to inference request in 3.400930643081665s
Received healthy response to inference request in 3.4833567142486572s
5 requests
0 failed requests
5th percentile: 2.558757448196411
10th percentile: 2.6220698833465574
20th percentile: 2.7486947536468507
30th percentile: 2.929791879653931
40th percentile: 3.165361261367798
50th percentile: 3.400930643081665
60th percentile: 3.4085559368133547
70th percentile: 3.416181230545044
80th percentile: 3.4326664447784423
90th percentile: 3.45801157951355
95th percentile: 3.4706841468811036
99th percentile: 3.4808222007751466
mean time: 3.1223466873168944
Pipeline stage StressChecker completed in 57.13s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_lujeb_2025-02-27 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3448.80s
Shutdown handler de-registered
function_lujeb_2025-02-27 status is now inactive due to auto deactivation removed underperforming models
function_lujeb_2025-02-27 status is now torndown due to DeploymentManager action
ChatRequest
Generation Params
Prompt Formatter
Chat History
ChatMessage 1