Invention Grant
- Patent Title: Statistical stemming
- Patent Title (中): 统计词干
-
Application No.: US13453473Application Date: 2012-04-23
-
Publication No.: US08352247B2Publication Date: 2013-01-08
- Inventor: Evgeny A. Cherepanov , Oleksandr Grushetskyy , Dmitry N. Orlov
- Applicant: Evgeny A. Cherepanov , Oleksandr Grushetskyy , Dmitry N. Orlov
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06F17/27
- IPC: G06F17/27

Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating suffix rewriting rules. A method includes obtaining a plurality of canonical suffix-rewriting rules each associated with one or more words, generating a suffix tree from the words, selecting a minimum colored subset of the nodes and leaves in the suffix tree, and generating a plurality of final suffix-rewriting rules from the nodes in the minimum colored subset. Another method includes receiving applicable and non-applicable words for a suffix-rewriting rule, generating a suffix tree from the applicable words and the non-applicable words, selecting a minimum colored subset of the nodes and leaves in the suffix tree, and generating a plurality of suffix-rewriting rules, wherein each rule corresponds to a node in the minimum colored subset with a valid status.
Public/Granted literature
- US20120209592A1 STATISTICAL STEMMING Public/Granted day:2012-08-16
Information query