Invention Grant
- Patent Title: Document-based synonym generation
- Patent Title (中): 基于文档的同义词生成
-
Application No.: US12027559Application Date: 2008-02-07
-
Publication No.: US07890521B1Publication Date: 2011-02-15
- Inventor: Oleksandr Grushetskyy , Steven D. Baker
- Applicant: Oleksandr Grushetskyy , Steven D. Baker
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06F7/00
- IPC: G06F7/00

Abstract:
One embodiment of the present invention provides a system that automatically generates synonyms for words from documents. During operation, this system determines co-occurrence frequencies for pairs of words in the documents. The system also determines closeness scores for pairs of words in the documents, wherein a closeness score indicates whether a pair of words are located so close to each other that the words are likely to occur in the same sentence or phrase. Finally, the system determines whether pairs of words are synonyms based on the determined co-occurrence frequencies and the determined closeness scores. While making this determination, the system can additionally consider correlations between words in a title or an anchor of a document and words in the document as well as word-form scores for pairs of words in the documents.
Information query