Invention Application
- Patent Title: EXTRACTION OF CERTAIN TYPES OF ENTITIES
- Patent Title (中): 提取某些类型的实体
-
Application No.: US12626905Application Date: 2009-11-29
-
Publication No.: US20110131244A1Publication Date: 2011-06-02
- Inventor: Amir J. Padovitz , Matthew F. Hurst
- Applicant: Amir J. Padovitz , Matthew F. Hurst
- Applicant Address: US WA Redmond
- Assignee: MICROSOFT CORPORATION
- Current Assignee: MICROSOFT CORPORATION
- Current Assignee Address: US WA Redmond
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F15/18

Abstract:
Certain types of entities may be extracted from a document. In one example, the entities to be recognized are cultural entities, such as the names of movies, video games, books, etc. For each such entity, a concept graph may be built that shows the relationship between the entity itself and other entities, such as the relationship between a movie and the actor(s) who act in the movie. When a candidate entity name is detected in the document, the concept graph may be used to look for other entities that appear in the context of the candidate entity. The presence of related entities in the context of the candidate may be used to disambiguate the meaning of the candidate. For example, a common word like “up” might be recognized as the name of a movie if the names of actors or characters in that movie appear near the word “up”.
Information query