Abstract:
A technique for generating a new equivalent phrase for an input phrase includes receiving a first input phrase for natural language expansion. Tokens that correspond to parts of speech are generated for the first input phrase. An original grammar tree is generated using at least some of the tokens. One or more alternate grammar trees are generated that are different from the original grammar tree but substantially equivalent to the original grammar tree. One or more synonyms for at least one of the tokens are generated. Finally, one or more new phrases are generated based on the one or more alternate grammar trees and the one or more synonyms.
Abstract:
Textual Geographical Location relates a placename, which is a set of terms, from one to any maximum as defined in an individual language, to a unique point or area (many points) as found on a map or other coordinate system, such as the map of the United States as used in global positioning system (GPS).
Abstract:
The invention relate to a computer implemented document retrieval method comprising the steps of: a) allowing a user to input a search term in a first language, b) applying a phonetic algorithm to the search term, so that a phonetic version of the search term is obtained, c) using the output from step b) to perform a search in a plurality of electronic documents in the first language where said search identifies the most relevant document based upon the phonetic version of the search term, d) selecting a translated document that represents the document identified in step c), translated into a second language, and f) returning, to the user, the translated document.
Abstract:
A natural-language processing system includes a registration-candidate storage section that stores therein registration-candidate dictionary data, a judgment means that compares input data against the registration-candidate dictionary data to thereby judge whether or not the input data includes a word corresponding to the registration-candidate dictionary data, an inquiry means that inquires to a user whether or not corresponding dictionary data is to be registered in a dictionary storage section to accept a user's instruction if it is judged that a corresponding word exists, a dictionary registration means that registers the corresponding dictionary data in the dictionary storage section based on the input instruction, and a natural-language processing means that executes a natural-language processing onto the input data by using the dictionary data registered in the dictionary storage section.
Abstract:
Apparatus for determining a meaning of an ontology term, the apparatus including an electronic processing device that compares the ontology term to a number of potential ontology term meanings, generates a matching score for the results of each comparison, the matching score being based on a similarity of the ontology term and the potential ontology term meaning and selects a potential ontology term as the ontology term meaning using the matching scores.
Abstract:
Mechanisms are provided for processing a knowledge canvassing request. The mechanisms receive a request specifying an entity of interest from an originator of the request and analyze the request to extract a feature of the request. The mechanisms determine whether the request is a targeted natural language question to be answered or a knowledge canvassing request, based on the extracted feature. In response to determining that the request is a knowledge canvassing request, the mechanisms process the request by identifying entities represented in a knowledge graph data structure as being related to the entity of interest. The mechanisms output results of the processing of the request to the originator of the request.
Abstract:
Computer-program products and methods for automatically annotating terms, such as ambiguous terms, in an electronic text document are disclosed. In one embodiment, a method of annotating a text document includes determining, by a computing device, a term of interest within the text document. The method further includes searching a data structure including incongruous term pairs (tx, tt) determined from a controlled vocabulary for the term of interest appearing as a term tt, wherein the term tt is a linguistic head of a term tx of the incongruous term pairs (tx, tt). The method further includes annotating the term of interest with a meaning provided by the controlled vocabulary only if a term tx of the incongruous term pairs (tx, tt) associated with the term of interest in the data structure is not present within a predetermined textual distance of the term of interest in the text document.
Abstract:
The integrity of an existing tag set associated with a web module is protected by reviewing user-generated proposed additions to the tag set to determine whether the proposed addition is appropriate. An initial test is made to determine whether a proposed addition is trivial; e.g., includes strings of the same character. If the proposed addition is not found to be trivial, it is assessed to determine its relatedness to the existing tag set. The proposed addition is made a part of the tag set only if it is found to be sufficiently related to the existing tag set.
Abstract:
Contextual adaptation of documents automatically replaces words for synonyms that appear within context or topic whey they are being used. A machine learned topic modeling, trained by a set of documents representative of a target user is executed to determine topics of an input document, and to determine words in the document to replace based on determining the relevance of the words to the topics in the documents. An output document is generated based on the input document with the replaced words.
Abstract:
Methods and apparatuses for managing rule based on informal data are provided, one of methods comprises, receiving, by a rule management apparatus, informal data representing a rule, analyzing, by the rule management apparatus, the informal data, generating, by the rule management apparatus, formal data that can be processed by a rule engine of the rule management apparatus, using results of the analysis of the informal data, selecting, by the rule management apparatus, one or more items to be corrected for setting the rule from the formal data with reference to a target thesaurus relevant to the rule and processing, by the rule management apparatus, the formal data with the selected items corrected, using the rule engine.