-
公开(公告)号:US11861039B1
公开(公告)日:2024-01-02
申请号:US17035437
申请日:2020-09-28
Applicant: Amazon Technologies, Inc.
Inventor: Yahor Pushkin , Sravan Babu Bodapati , Sunil Mallya Kasaragod , Sameer Karnik , Abhinav Goyal , Yaser Al-Onaizan , Ravindra Manjunatha , Kalpit Dixit , Alok Kumar Parmesh , Syed Kashif Hussain Shah
IPC: G06F21/62 , G06F16/903 , G06F3/06 , G06N20/00
CPC classification number: G06F21/6245 , G06F3/0619 , G06F3/0623 , G06F3/0683 , G06F16/90344 , G06N20/00
Abstract: Various embodiments of a hierarchical system or method of identifying sensitive content in data is described. In some embodiments, sensitive data classifiers local to a data storage system can analyze a plurality of data items and classify at least some data items as potentially containing sensitive data. The sensitive data classifiers can provide the classified data items to a separate sensitive data discovery component. The sensitive data discovery component can, in some embodiments, obtain the classified data items, perform a sensitive data location analysis on the classified data items to identify a location of sensitive data within some of the classified data items, and generate location information for the sensitive data within the data items containing sensitive data. The sensitive data discovery component can provide to a destination this information, in some embodiments, where the destination might redact, tokenize, highlight, or perform other actions on the located sensitive data.