Invention Grant
US08996524B2 Automatically mining patterns for rule based data standardization systems
有权
自动挖掘基于规则的数据标准化系统的模式
- Patent Title: Automatically mining patterns for rule based data standardization systems
- Patent Title (中): 自动挖掘基于规则的数据标准化系统的模式
-
Application No.: US13415144Application Date: 2012-03-08
-
Publication No.: US08996524B2Publication Date: 2015-03-31
- Inventor: Snigdha Chaturvedi , Tanveer A Faruquie , Hima P. Karanam , Marvin Mendelssohn , Mukesh K. Mohania , L. Venkata Subramaniam
- Applicant: Snigdha Chaturvedi , Tanveer A Faruquie , Hima P. Karanam , Marvin Mendelssohn , Mukesh K. Mohania , L. Venkata Subramaniam
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Edell, Shapiro & Finnan, LLC
- Agent Susan Murray
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30

Abstract:
Methods, computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.
Public/Granted literature
- US20130238611A1 Automatically Mining Patterns for Rule Based Data Standardization Systems Public/Granted day:2013-09-12
Information query