Abstract:
A method and system for splitting a text document into individual sentences using sentence boundary detection, and establishing co-relationships between terms which are present in the same sentence. A document corpus, or collection of text records, is provided, containing text with terms to be extracted. The text records in the document corpus are divided into individual sentences, using a set of rules for sentence boundary detection. The individual sentences are then analyzed to extract and correlate terms, such as parts and symptoms, symptoms and actions, or parts and failure modes. The correlated terms are then validated based on frequency of occurrence, with term pairs being considered valid if their frequency of occurrence exceeds a minimum frequency threshold. The validated term correlations can be used for fault model development, document classification, and document clustering.
Abstract:
A method and system for developing reliability models from unstructured text documents, such as text verbatim descriptions from service technicians. An ontology, or data model, and heuristic rules are used to identify and extract failure modes and parts from the text verbatim comments associated with specific labor codes from service events. Like-meaning but differently-worded terms are then merged using text similarity scoring techniques. The resultant failure modes are used to create enhanced reliability models, where component reliability is predicted in terms of individual failure modes instead of aggregated for the component. The enhanced reliability models provide improved reliability prediction for the component, and also provides insight into aspects of the component design which can be improved in the future.
Abstract:
A method and system for developing fault models from unstructured text documents, such as text verbatim descriptions from customers and service technicians. An ontology, or data model, and heuristic rules are used to identify and extract descriptive terms from the text verbatim document. The descriptive terms are then classified into types, including symptoms, failure modes, and parts. Like-meaning but differently-worded descriptive terms are then merged using text similarity scoring techniques. The resultant symptoms, failure modes, parts, and correlations are then assembled into a fault model, which can be used for real-time fault diagnosis onboard a vehicle, or off-board at service shops.
Abstract:
A document may be received at a processing module. One or more tags may be applied to the document, each tag applied to a term, each tag representing a part of speech. One or more terms may be extracted from the document based on the tag. A weighting assignment parameter may be determined for each of the one or more extracted terms. Based on the weighting assignment parameter associated with each of the extracted terms, it may be determined whether the domain ontology includes the one or more extracted terms. If the domain ontology does not include the one or more extracted terms, the domain ontology may be augmented such that the domain ontology comprises the one or more extracted terms.
Abstract:
A method for extracting data from service repair verbatims in a vehicle service reporting system. Each service repair verbatim includes a technician's comments concerning a part, a symptom associated with the part, and a repair action associated with the symptom. Each service repair verbatim includes information relating to an identified problem with at least one vehicle part. A diagnostic and prognostic ontology database is provided that is structured by vehicle part classification, a vehicle part sub-class classification, and a relationship classification, wherein the relationship classification includes symptom relationships and action relationships. Each of the service repair verbatims are reconstructed utilizing the diagnostic and prognostic ontology database. Combinations of information are extracted from the reconstructed service repair verbatims as a function of user input criteria. A frequency is determined of each combination extracted in the reconstructed service repair verbatims. The service repair verbatims are clustered for each combination.
Abstract:
A method and system for splitting a text document into individual sentences using sentence boundary detection, and establishing co-relationships between terms which are present in the same sentence. A document corpus, or collection of text records, is provided, containing text with terms to be extracted. The text records in the document corpus are divided into individual sentences, using a set of rules for sentence boundary detection. The individual sentences are then analyzed to extract and correlate terms, such as parts and symptoms, symptoms and actions, or parts and failure modes. The correlated terms are then validated based on frequency of occurrence, with term pairs being considered valid if their frequency of occurrence exceeds a minimum frequency threshold. The validated term correlations can be used for fault model development, document classification, and document clustering.
Abstract:
A method is provided for extracting data from service repair verbatims in a vehicle service reporting system. Each service repair verbatim includes a technician's comments concerning a part, a symptom associated with the part, and a repair action associated with the symptom. Each service repair verbatim includes information relating to an identified problem with at least one vehicle part. A diagnostic and prognostic ontology database is provided that is structured by vehicle part classification, a vehicle part sub-class classification, and a relationship classification, wherein the relationship classification includes symptom relationships and action relationships. Each of the service repair verbatims are reconstructed utilizing the diagnostic and prognostic ontology database. Combinations of information are extracted from the reconstructed service repair verbatims as a function of user input criteria. A frequency is determined of each combination extracted in the reconstructed service repair verbatims. The service repair verbatims are clustered for each combination.