developer_uid: chai_backend_admin
submission_id: function_jikib_2024-11-18
model_name: retune_with_base
model_group:
status: inactive
timestamp: 2024-11-18T18:09:24+00:00
num_battles: 13830
num_wins: 7088
celo_rating: 1262.62
family_friendly_score: 0.5916
family_friendly_standard_error: 0.006951394680206268
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-11-18
win_ratio: 0.5125090383224874
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.547508955001831s
Received healthy response to inference request in 3.6664652824401855s
Received healthy response to inference request in 3.146693229675293s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 2.683847188949585s
Received healthy response to inference request in 2.7451136112213135s
5 requests
0 failed requests
5th percentile: 2.6961004734039307
10th percentile: 2.7083537578582764
20th percentile: 2.7328603267669678
30th percentile: 2.8254295349121095
40th percentile: 2.986061382293701
50th percentile: 3.146693229675293
60th percentile: 3.307019519805908
70th percentile: 3.4673458099365235
80th percentile: 3.571300220489502
90th percentile: 3.618882751464844
95th percentile: 3.6426740169525145
99th percentile: 3.6617070293426512
mean time: 3.1579256534576414
Pipeline stage StressChecker completed in 17.41s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.63s
Shutdown handler de-registered
function_jikib_2024-11-18 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3689.43s
Shutdown handler de-registered
function_jikib_2024-11-18 status is now inactive due to auto deactivation removed underperforming models