Invention Application
US20130238610A1 Automatically Mining Patterns For Rule Based Data Standardization Systems
审中-公开
自动挖掘基于规则的数据标准化系统的模式
- Patent Title: Automatically Mining Patterns For Rule Based Data Standardization Systems
- Patent Title (中): 自动挖掘基于规则的数据标准化系统的模式
-
Application No.: US13414374Application Date: 2012-03-07
-
Publication No.: US20130238610A1Publication Date: 2013-09-12
- Inventor: Snigdha Chaturvedi , Tanveer A. Faruquie , Hima P. Karanam , Marvin Mendelssohn , Mukesh K. Mohania , L. Venkata Subramaniam
- Applicant: Snigdha Chaturvedi , Tanveer A. Faruquie , Hima P. Karanam , Marvin Mendelssohn , Mukesh K. Mohania , L. Venkata Subramaniam
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
Computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.
Public/Granted literature
- US10163063B2 Automatically mining patterns for rule based data standardization systems Public/Granted day:2018-12-25
Information query