Abstract:
Provided is a device including: a display unit configured to display handwritten content based on an analog handwritten input of a user; a user input unit that receives a user input of selecting a portion of the handwritten content displayed on the display unit; and a control unit reproduces a segment of multimedia content, which corresponds to the portion of the handwritten content, from the multimedia content synchronized with the handwritten content.
Abstract:
The invention relates to a method for recognizing handwriting on a physical surface on the basis of three-dimensional signals originating from sensors of a terminal, the method being characterized in that the signals are obtained on the basis of at least 3 different types of sensors, and in that it comprises steps of sampling, according to 3 axes and over a sliding time window, of inertial signals originating from the sensors, fusing the sampled signals into a 9-dimensional vector for each sampling period, converting the fused signals into a sequence of characteristic 9-dimensional vectors, and, when a signal characteristic of an input start has been detected, storing the sequence of characteristic vectors in a list of sequences of characteristic vectors, the preceding steps being repeated until the detection of a signal characteristic of an input end, the method furthermore comprising, on detection of said signal characteristic of an input end, a step of recognizing a word on the basis of the list of sequences of characteristic vectors created over the time window.
Abstract:
Process, and related apparatus, that exploits psycho-physiological aspects involved in generation and perception of handwriting for directly inferring from the trace on the paper (or any other means on which the author writes by hand) the interpretation of writing, i.e. the sequence of characters that the trace is intended to represent.
Abstract:
A system and method for computing confidence in an output of a text recognition system includes performing character recognition on an input text image with a text recognition system to generate a candidate string of characters. A first representation is generated, based on the candidate string of characters, and a second representation is generated based on the input text image. A confidence in the candidate string of characters is computed based on a computed similarity between the first and second representations in a common embedding space.
Abstract:
A method of identifying a string formed from a number of hand-written characters is disclosed. The method starts by determining character probabilities for each hand-written character in the string. Each character probability represents the likelihood of the respective hand-written character being a respective one of a number of predetermined characters. Next, template probabilities for the string are determined. Each template probability represents the likelihood of the string corresponding to a respective one of a number of templates. Each template represents a respective combination of character types. The step of determining the template probabilities for the string includes the sub-steps of determining the number of characters in the string, selecting templates having an identical number of characters, and obtaining a template probability for each selected template.
Abstract:
Systems and methods are provided for recognizing handwritten characters drawn on a paper form using a digital pen that records stroke coordinates corresponding to respective pen strokes. In one embodiment, a field on the paper form is assigned a lexical inference level. For example, the field may be assigned a word level, a word prefix level, and/or a word stem level. The assigned lexical inference level is used to recognize one or more stroke coordinates corresponding to pen strokes written in the field. Recognized characters are then used to create or modify an inference lexicon used to perform handwriting recognition for the entire field. In one embodiment, the inference lexicon is used for handwriting recognition in the same field on subsequently processed forms.
Abstract:
A system for automatically recognizing a handwriting image and converting such image to text data including a sequence of validated words, has an image input device, a number of handwriting recognition engines, and control unit. A first handwriting recognition engine is responsive to the image input device, for analyzing the data file and providing one or more possible text words for each successive word in the data file. The first handwriting recognition engine further provides a resemblance indication for each possible text word indicating a level of resemblance between its appearance and the appearance of the handwritten word in the data file. In the event that there is not a high level of confidence in the selection of the first handwriting recognition engine, a selection of a validated word is based on the selections of one or more of the other handwriting recognition engines.
Abstract:
There are provided: an n-fold-character recognizing part for collectively recognizing an unmatched portion without segmenting character candidate patterns character by character for an image of a read-wise skipped portion, i.e., the unmatched portion upon word verification; and an n-fold-character recognizing dictionary referred to by the n-fold-character recognizing part upon recognition; to thereby conduct re-recognition independent of instability of character segmentation even when the portion read-wise skipped by the word verification includes two or more characters.
Abstract:
A method and system for recognizing user input information including cursive handwriting and spoken words. A time-delayed neural network having an improved architecture is trained at the word level with an improved method, which, along with preprocessing improvements, results in a recognizer with greater recognition accuracy. Preprocessing is performed on the input data and, for example, may include resampling the data with sample points based on the second derivative to focus the recognizer on areas of the input data where the slope change per time is greatest. The input data is segmented, featurized and fed to the time-delayed neural network which outputs a matrix of character scores per segment. The neural network architecture outputs a separate score for the start and the continuation of a character. A dynamic time warp (DTW) is run against dictionary words to find the most probable path through the output matrix for that word, and each word is assigned a score based on the least costly path that can be traversed through the output matrix. The word (or words) with the overall lowest score (or scores) are returned. A DTW is similarly used in training, whereby the sample ink only need be labeled at the word level.
Abstract:
In word recognition using the character recognition result, recognition processing is performed for an input character string that corresponds to a word to be recognized, a probability at which characteristics obtained as the result of character recognition are generated by conditioning characters of words contained in a word dictionary that stores in advance candidates of words to be recognized. The thus obtained probability is divided by a probability at which characteristics obtained as the result of character recognition are generated, and each of the division results obtained relevant to the characters of the words contained in the word dictionary is multiplied relevant to all the characters. The recognition results of the above words are obtained based on the multiplication results.