Abstract:
A method for specifically converting non-phonetic characters representing vocabulary in languages into surrogate words for inputting into a computer includes the steps of (a) alphabetizing a pictographic/ideographic radical of each character according to its pronunciation in a respective language, with the resulting spelling then being used as a prefix for a newly created surrogate word; (b) alphabetizing a phonetic radical according to its pronunciation in the respective language, with the resulting spelling then being used as a suffix for the newly created surrogate word; and (c) combining the prefix and suffix together and separate them with a separate mark such as a hyphen "-", thus creating the surrogate word for each specific "character" used in the written form of the respective language. Such new created surrogate words are unique for and can precisely represent the non-phonetic characters used in the written oriental languages such as Chinese, Japanese, Korean language, and Indian language, thus facilitating the easy input of the information in these languages into a computer.
Abstract:
A dynamic pattern matching system (DPRS) for a communication system is disclosed. The DPRS enables the command processor within the communication system to be flexible, permitting commands and their associated responses to be added, deleted and changed on the fly while the communication system is operative. A user designs a command set and constructs a trap definition file. This trap definition file is read by the DPRS during initialization and a trap chain is constructed. The trap chain is a linked list of all the traps constituting the command set. Traps, however, can be added and deleted on the fly via commands from the remote host or entered by a user via the keyboard or menus. The DPRS can be used as a tool to dynamically alter the behavior of an existing host application communication system. The DPRS can function as a stand alone flexible command processor or can work in parallel or serial fashion with a conventional command processor.
Abstract:
A content processing system determines the content of input discourse. The content processing system includes a lexicon, which contains definitional characteristics for words and terms, and a knowledge catalog. The knowledge catalog provides independent and parallel static ontologies to accurately represent a broad coverage of concepts that define knowledge. The knowledge catalog also includes a dynamic level for expanding concepts in the static ontologies. A theme parsing system determines the thematic context of input discourse including identifying the topics of the discourse, generating summarized versions of the discourse, and identifying the key content of the discourse. A knowledge catalog processor processes the extracted content for use with the knowledge catalog. A theme vector processor identifies the most important themes in the input discourse, and maps these themes into the knowledge catalog. The content indexing processor extends the static ontologies into the dynamic level based on the specific terminology contained in the input discourse, and attaches the dynamic level into concepts in the static ontologies that were mapped by the theme vector processor.
Abstract:
Application programs compiled for a first, "source", computer are translated, from their object form, for execution on a section, "target", computer. The translated application programs are linked or otherwise bound with a translation of the source computer system software. The translated system software operates on the image of the source computer address space in the target computer exactly as it did in the source computer. The semantics of the source computer system softwae are thus preserved identically. In addition, a virtual hardware environment is provided in the target computer to manage events and to deal with differences in the address space layouts between the source and target computers.
Abstract:
A method and apparatus for providing computer-assisted implicit and explicit assistance. If no implicit assist actions are desired or indicated, a logical process is initiated to determine whether explicit assistance should be undertaken. If implicit assistance is indicated, a list of action alternatives is displayed for the user. If explicit assistance is desired by the user, particular object(s) from which the assistance may be inferred are entered into an assistance operation. An attempt is made to recognize possible intents expressed by the objects entered into the assistance process. If no user intent is, in fact, recognized, the assist operation is usually terminated. If a possible intent is recognized, the actual intent is hypothesized. A check is further undertaken, to determine whether a hypothesis is in fact available. If no hypothesis is found, the process permits the user to supply a proposed action. If no hypothesis is found and no user action is proposed, assistance efforts terminate. However, if a hypothesis is available, preparations for execution are undertaken. A final inquiry is made as to whether to undertake the hypothesized assist. If the response to an inquiry whether to assist as hypothesized is affirmative, execution of the hypothesized action is undertaken. A pen-based computer preferably implements the indicated functions.
Abstract:
An improved multi-unit analyzer is provided by virtue of the present invention. An analog-to-digital converter (A/D) digitizes a multi-unit signal, a pulse detector triggers when the multi-unit signal reaches a specified threshold, and a pulse sampler outputs a packet of samples of the signal around the point where the signal passes the threshold. The packet is then transformed into a point on a plane, the P-plane. The transformed point is compared to one or more rectangles in the P-plane, to determine if the point falls within any rectangle. If a point falls within a rectangle, a pulse is output on a line corresponding to the rectangle. This compare and output process is given a high priority. A low priority process displays the packet of samples in a window corresponding to the rectangle which encloses the point in the P-plane which results from the transformation of the packet of samples. A display also exists to display the points in the P-plane and the rectangles, thus allowing a user to view the relationship between the points and the rectangles. The points can be rotated to fit into the rectangles. The output pulses reflect the separation of multiple spikes of the multi-unit signal into individual single-unit signals.
Abstract:
A language processing system includes a mechanism for measuring the syntax trees of sentences of material to be translated and a mechanism for truncating syntax trees in response to the measuring mechanism. In a particular embodiment, a Score Function is provided for disambiguating or truncating ambiguities on the basis of composite scores, generated at different stages of the processing.
Abstract:
Sentences from a source language having first rules of grammar are translated into a target language having second rules of grammar; the resulting sentences are meaningful but ungrammatical. The ungrammatical sentences are changed into grammatical sentences by a method that includes identifying grammar markers and grammar marker patterns in the translated put ungrammatical sentence. A first database includes grammar rules and is consulted for each grammar marker that appears in the ungrammatical sentence. A second database includes self-correction rules and is consulted for each grammar marker pattern that appears. The grammar rules and self-correction rules are applied by a correction scheme. The resulting sentences are substantially perfectly grammatical.
Abstract:
Documents represented as bitmap images are transformed into coded textual data and coded graphics data by graphics and textual recognizers, which use a standard notation for recording the results of the document recognition processes, including any ambiguities, in a document description language. Recognized portions of the document, represented as editable coded data, such as for example ASCII, are placed in elements, defined in the document description language, with all contents of an element sharing some common characteristic. Elements can include, for example: character-string-elements, questionable-character-elements, questionable-word-elements, verified-word-elements, alternate-word-elements, segment-elements, and arc-elements.
Abstract:
A morphological analyzer for analyzing the morphology of an input word, the analyzer including a recognition engine which includes means for finding a stem within the input word; and means for identifying suffixes attached to the stem, wherein the finding means and suffix identifying means cooperate to conduct morphological analysis of the input word from the root to the affix and wherein the recognition engine performs inflectional and derivational analysis, the derivational analysis using more than two derivational suffixes.