-
公开(公告)号:US11093641B1
公开(公告)日:2021-08-17
申请号:US16219742
申请日:2018-12-13
Applicant: Amazon Technologies, Inc.
Inventor: Michael William Whalen , Carsten Varming , Neha Rungta , Andrew Judge Gacek , Murphy Berzish
IPC: G06F21/62 , G06N5/00 , G06F16/903 , G06K9/00 , H04L29/06 , G06F16/906
Abstract: A document anonymization system transforms structured documents, such as security policies, that contain user-specific and other sensitive data, producing encoded logic problems in the format or language of one or more constraint solvers; the logic problems do not contain any of the sensitive data. The system may perform a one- or two-stage anonymization process: in a first stage, the electronic document is analyzed according to its document type to identify parameters likely to contain sensitive data, and the associated values are replaced with arbitrary values; in a second stage, after the anonymized electronic document is converted into logic formulae representing the data, the system performs replacements of string constants in the logic formulae with arbitrary strings to further anonymize the sensitive data. The system may confirm that anonymization preserves the document structure, difficulty level, and satisfiability of the original document by executing the constraint solver against the anonymized logic problem.