developer_uid: rirv938
submission_id: function_pumus_2025-02-13
model_name: retune_with_base
model_group:
status: torndown
timestamp: 2025-02-13T19:25:23+00:00
num_battles: 6792
num_wins: 3389
celo_rating: 1249.67
family_friendly_score: 0.579
family_friendly_standard_error: 0.006982248921371967
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-02-13
win_ratio: 0.4989693757361602
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.6385469436645508s
Received healthy response to inference request in 2.2660179138183594s
Received healthy response to inference request in 1.8840739727020264s
Received healthy response to inference request in 1.6925106048583984s
Received healthy response to inference request in 1.8793349266052246s
5 requests
0 failed requests
5th percentile: 1.6493396759033203
10th percentile: 1.6601324081420898
20th percentile: 1.681717872619629
30th percentile: 1.7298754692077636
40th percentile: 1.804605197906494
50th percentile: 1.8793349266052246
60th percentile: 1.8812305450439453
70th percentile: 1.883126163482666
80th percentile: 1.9604627609252931
90th percentile: 2.1132403373718263
95th percentile: 2.1896291255950926
99th percentile: 2.250740156173706
mean time: 1.872096872329712
Pipeline stage StressChecker completed in 10.47s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.64s
Shutdown handler de-registered
function_pumus_2025-02-13 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2403.19s
Shutdown handler de-registered
function_pumus_2025-02-13 status is now inactive due to auto deactivation removed underperforming models
function_pumus_2025-02-13 status is now torndown due to DeploymentManager action
function_pumus_2025-02-13 status is now torndown due to DeploymentManager action
function_pumus_2025-02-13 status is now torndown due to DeploymentManager action
ChatRequest
Generation Params
Prompt Formatter
Chat History
ChatMessage 1