- 专利标题: Automated masking of confidential information in unstructured computer text using artificial intelligence
-
申请号: US16248175申请日: 2019-01-15
-
公开(公告)号: US11727245B2公开(公告)日: 2023-08-15
- 发明人: Yu Zhang , Pu Li , Hua Hao , Liang Chen , Dong Han
- 申请人: FMR LLC
- 申请人地址: US MA Boston
- 专利权人: FMR LLC
- 当前专利权人: FMR LLC
- 当前专利权人地址: US MA Boston
- 代理机构: Cesari & McKenna, LLP
- 主分类号: G06N3/04
- IPC分类号: G06N3/04 ; G06F21/62 ; G06F40/295 ; G06F40/30 ; G06F40/247 ; G06N3/02
摘要:
Methods and apparatuses are described for unstructured computer text is analyzed for masking of confidential information using artificial intelligence. A client device generates a message comprising unstructured computer text including confidential information. A server trains a word embedding model using the unstructured text. The server generates a multidimensional vector for each word in the unstructured text, generates a mapping table comprising a predetermined set of words corresponding to confidential information from the unstructured text, and determines one or more neighboring words in the trained word embedding model using the predetermined set of words. The server updates the mapping table to incorporate the one or more neighboring words and executes rules on the unstructured text that filter out one or more words, and applies the updated mapping table to match words in the updated mapping table with words in the filtered text and mask the matching words in the unstructured text.
公开/授权文献
信息查询