Invention Application
- Patent Title: METHODOLOGY TO ESTABLISH TERM CO-RELATIONSHIP USING SENTENCE BOUNDARY DETECTION
- Patent Title (中): 使用边界边界检测建立时间相关性的方法
-
Application No.: US13044873Application Date: 2011-03-10
-
Publication No.: US20120233132A1Publication Date: 2012-09-13
- Inventor: Dnyanesh Rajpathak
- Applicant: Dnyanesh Rajpathak
- Applicant Address: US MI Detroit
- Assignee: GM GLOBAL TECHNOLOGY OPERATIONS LLC
- Current Assignee: GM GLOBAL TECHNOLOGY OPERATIONS LLC
- Current Assignee Address: US MI Detroit
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
A method and system for splitting a text document into individual sentences using sentence boundary detection, and establishing co-relationships between terms which are present in the same sentence. A document corpus, or collection of text records, is provided, containing text with terms to be extracted. The text records in the document corpus are divided into individual sentences, using a set of rules for sentence boundary detection. The individual sentences are then analyzed to extract and correlate terms, such as parts and symptoms, symptoms and actions, or parts and failure modes. The correlated terms are then validated based on frequency of occurrence, with term pairs being considered valid if their frequency of occurrence exceeds a minimum frequency threshold. The validated term correlations can be used for fault model development, document classification, and document clustering.
Public/Granted literature
- US08452774B2 Methodology to establish term co-relationship using sentence boundary detection Public/Granted day:2013-05-28
Information query