developer_uid: chai_backend_admin
submission_id: function_fisir_2024-11-27
model_name: retune_with_base
model_group:
status: inactive
timestamp: 2024-11-27T22:10:30+00:00
num_battles: 10005
num_wins: 5036
celo_rating: 1258.9
family_friendly_score: 0.5962000000000001
family_friendly_standard_error: 0.006938956117457438
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-11-27
win_ratio: 0.5033483258370814
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.833080530166626s
Received healthy response to inference request in 4.48120641708374s
Received healthy response to inference request in 2.8255932331085205s
Received healthy response to inference request in 2.798893690109253s
Received healthy response to inference request in 3.793701171875s
5 requests
0 failed requests
5th percentile: 2.8042335987091063
10th percentile: 2.80957350730896
20th percentile: 2.820253324508667
30th percentile: 3.0192148208618166
40th percentile: 3.4064579963684083
50th percentile: 3.793701171875
60th percentile: 4.068703269958496
70th percentile: 4.343705368041992
80th percentile: 4.551581239700317
90th percentile: 4.6923308849334715
95th percentile: 4.762705707550049
99th percentile: 4.819005565643311
mean time: 3.746495008468628
%s, retrying in %s seconds...
Received healthy response to inference request in 3.455453872680664s
Received healthy response to inference request in 3.87576961517334s
Received healthy response to inference request in 2.6821651458740234s
Received healthy response to inference request in 2.9149091243743896s
Failed to get response for submission function_nupus_2024-11-27: no entry with id "fake_submission_id_for_testing" found on database!
Received healthy response to inference request in 2.8418493270874023s
5 requests
0 failed requests
5th percentile: 2.7141019821166994
10th percentile: 2.746038818359375
20th percentile: 2.8099124908447264
30th percentile: 2.8564612865448
40th percentile: 2.8856852054595947
50th percentile: 2.9149091243743896
60th percentile: 3.1311270236968993
70th percentile: 3.347344923019409
80th percentile: 3.5395170211791993
90th percentile: 3.7076433181762694
95th percentile: 3.7917064666748046
99th percentile: 3.858956985473633
mean time: 3.1540294170379637
Pipeline stage StressChecker completed in 36.93s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.47s
Shutdown handler de-registered
function_fisir_2024-11-27 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4443.31s
Shutdown handler de-registered
function_fisir_2024-11-27 status is now inactive due to auto deactivation removed underperforming models