submission_id: function_hamat_2024-11-29
developer_uid: chai_backend_admin
celo_rating: 1234.68
display_name: retune_with_base
family_friendly_score: 0.629
family_friendly_standard_error: 0.0068316762218360435
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
is_internal_developer: True
model_group:
model_name: retune_with_base
num_battles: 12792
num_wins: 5969
ranking_group: single
status: inactive
submission_type: function
timestamp: 2024-11-29T18:44:39+00:00
us_pacific_date: 2024-11-29
win_ratio: 0.4666197623514697
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.86777663230896s
Received healthy response to inference request in 3.730053424835205s
Received healthy response to inference request in 3.9173617362976074s
Received healthy response to inference request in 3.398275852203369s
Received healthy response to inference request in 3.238455057144165s
5 requests
0 failed requests
5th percentile: 2.941912317276001
10th percentile: 3.016048002243042
20th percentile: 3.164319372177124
30th percentile: 3.270419216156006
40th percentile: 3.3343475341796873
50th percentile: 3.398275852203369
60th percentile: 3.5309868812561036
70th percentile: 3.6636979103088376
80th percentile: 3.7675150871276855
90th percentile: 3.8424384117126467
95th percentile: 3.879900074005127
99th percentile: 3.9098694038391115
mean time: 3.4303845405578612
Pipeline stage StressChecker completed in 18.67s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.38s
Shutdown handler de-registered
function_hamat_2024-11-29 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4381.48s
Shutdown handler de-registered
function_hamat_2024-11-29 status is now inactive due to auto deactivation removed underperforming models