Invention Grant
US08996524B2 Automatically mining patterns for rule based data standardization systems 有权
自动挖掘基于规则的数据标准化系统的模式

Automatically mining patterns for rule based data standardization systems
Abstract:
Methods, computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.
Information query
Patent Agency Ranking
0/0