Invention Grant
- Patent Title: Systems and methods for discovering synonymous elements using context over multiple similar addresses
- Patent Title (中): 使用上下文发现多个相似地址的同义元素的系统和方法
-
Application No.: US12771543Application Date: 2010-04-30
-
Publication No.: US08682898B2Publication Date: 2014-03-25
- Inventor: Sachindra Joshi , Tanveer Faruquie , Hima Prasad Karanam , Marvin Mendelssohn , Mukesh Kumar Mohania , Angel Marie Smith , L Venkata Subramaniam , Girish Venkatachaliah
- Applicant: Sachindra Joshi , Tanveer Faruquie , Hima Prasad Karanam , Marvin Mendelssohn , Mukesh Kumar Mohania , Angel Marie Smith , L Venkata Subramaniam , Girish Venkatachaliah
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Ference & Associates LLC
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/00

Abstract:
A clustering-based approach to data standardization is provided. Certain embodiments take as input a plurality of addresses, identify one or more features of the addresses, cluster the addresses based on the one or more features, utilize the cluster(s) to provide a data-based context useful in identifying one or more synonyms for elements contained in the address(es), and standardize the address(es) to an acceptable format, with one or more synonyms and/or other elements being added to or taken away from the input address(es) as part of the standardization process.
Public/Granted literature
- US20110270808A1 Systems and Methods for Discovering Synonymous Elements Using Context Over Multiple Similar Addresses Public/Granted day:2011-11-03
Information query