摘要:
A character recognizer recognizes a handwritten input character. A sequence of points in two dimensional space representative of a stroke trajectory forming the input character is gathered. An input Directed Acyclic Graph is built with nodes representative of singular points at the beginning, end, and along the trajectory of the input character and with edges between nodes representative of an edge trajectory formed by the sequence of points of the input character between the singular points. Each edge in the input graph is described based on the shape, orientation and pen lift of the edge trajectory that the edge represents. The input graph is evaluated against model graphs, which are also Directed Acyclic Graphs, for all possible characters to find a path through a model graph that produces a best path similarity score with a corresponding path through the input graph. The input character is identified as an answer character represented by the model graph producing the best path similarity score. Each model graph for a reference character has nodes representative of singular points at the beginning, end, and along the stroke trajectory of the reference character. Edges between nodes in the model graph are representative of an edge trajectory formed by the sequence of points of the reference character between the singular points. Further each edge of the model graph has an “i” vector, a “j” vector and a “k” vector, and the “i,j,k” vectors indicate similarity values associated with the edge trajectory of the reference character and typical shapes, shape rotation and pen lift, respectively.
摘要:
Character model graphs are created, and the parameters of the model graphs are adjusted to optimize character recognition performed with the model graphs. In effect the character recognizer using the model graphs is trained. The model graphs are created in three stages. First, a vector quantization process is used on a set of raw samples of handwriting symbols to create a smaller set of generalized reference characters or symbols. Second, a character reference model graph structure is created by merging each generalized form model graph of the same character into a single character reference model graph. The merging is based on weighted Euclidian distance between parts of trajectory assigned to graph edges. As a last part of this second stage “type-similarity” vectors are assigned to model edges to describe similarities of given model edge to each shape and to each possible quantized value of other input graph edge parameters. Thus, similarity functions, or similarity values, are defined by different tables on different model edges. In the third stage, model creation further consists of minimizing recognition error by adjusting model graphs parameters. An appropriate smoothing approximation is used in the calculation of similarity score between input graph and model graphs. The input graph represents a word from a work sample set used for training, i.e. adjusting the model graph parameters. A recognition error is calculated as a function of the difference between similarity scores for best answers and the one correct answer for the word being recognized. The gradient of the recognition error as a function of change in parameters is computed and used to adjust the parameters. Model graphs with adjusted parameters are then used to recognize the words in a test set, and a percent of correct recognitions in the test set is calculated. The recognition error calculation with the work set, the parameter adjustment process, and the calculation of the percent of correct recognitions with the test set is repeated. After a number of iterations of this process, the optimum set of parameters for the model graphs will be found.
摘要:
The invention improves the appearance of freehand drawn lines and shapes in an electronic document by first recognizing freehand drawn lines and shapes and generating a line made up of sequential straight line segments for the freehand drawn line when the line does not form a closed line and generating a multiple straight-line-segment shape when the line forms a closed line. If a multiple segment shape is being reshaped, a basic shape is selected from reference ideal shapes as the basic shape of the multiple segment shape. The basic shape is adjusted to provide a specific shape as an improved shape for the freehand drawn shape. The recognition of the freehand drawn lines and shapes is accomplished by comparing source segments of a source freehand drawn line to a straight line and substituting a straight line segment for a source segment if the deviation between the source segment and the straight line is below a predetermined value. Sequential source segments of a source freehand drawn line are evaluated to determine if they are an angle segment. A segment pair of sequential source line segments are defined as an angle segment having two straight line segments intersecting at a common end point if both source segments of the segment pair are within a predetermined deviation value from a straight line. The straight line segments and angle segments are combined to form a multiple segment line if the line does not close on itself and a multiple segment shape if the line does close on itself.
摘要:
The invention improves the appearance of freehand drawn lines and shapes in an electronic document by first recognizing freehand drawn lines and shapes and generating a line made up of sequential straight line segments for the freehand drawn line when the line does not form a closed line and generating a multiple straight-line-segment shape when the line forms a closed line. If a multiple segment shape is being reshaped, a basic shape is selected from reference ideal shapes as the basic shape of the multiple segment shape. The basic shape is adjusted to provide a specific shape as an improved shape for the freehand drawn shape. The recognition of the freehand drawn lines and shapes is accomplished by comparing source segments of a source freehand drawn line to a straight line and substituting a straight line segment for a source segment if the deviation between the source segment and the straight line is below a predetermined value. Sequential source segments of a source freehand drawn line are evaluated to determine if they are an angle segment. A segment pair of sequential source line segments are defined as an angle segment having two straight line segments intersecting at a common end point if both source segments of the segment pair are within a predetermined deviation value from a straight line. The straight line segments and angle segments are combined to form a multiple segment line if the line does not close on itself and a multiple segment shape if the line does close on itself.
摘要:
Methods and systems are disclosed to aid in the detection of cancer or lesion in a mammogram images. Two mammogram images are input into an application that aids in determining the probability of a cancer or lesion being present in one or both of the images. The images are divided into different nodes and labels are applied to the nodes. The first node is compared to different variants of corresponding nodes on the second image as well as neighboring nodes on the first image. Based upon the comparisons, a unary and binary potential is calculated for the label that is applied to the node. The process is repeated for every possible label and for every node. Once the unary and binary potentials have been calculated, the potentials are input into a Conditional Random Field model to determine the probability of cancer for each node of the images.
摘要:
A method and apparatus for cursive script recognition employs an intermediate cursive words description language constructed of elements in the form of metastrokes wherein each of the metastrokes is a member of a metric space, the metric space being expressible as a matrix of likelihood of matching between input metastrokes and predefined metastrokes forming a vocabulary. Typically, a minimum of twenty metastrokes is used to form the vocabulary or set of allowable stroke elemental shapes, although as many as seventy may be used, with a correlation factor or measurement of similarity being defined between strings of metastrokes. Techniques for interpreting the metastrokes as words include substituting, adding, and deleting metastrokes in an input sequence, measuring similarity and comparing with entries in a dictionary of words constructed of metastroke sequences and variants of metastroke sequences. Weighting of similarity may include penalties for lack of similarity.
摘要:
The invention improves the appearance of freehand drawn lines and shapes in an electronic document by first recognizing freehand drawn lines and shapes and generating a line made up of sequential straight line segments for the freehand drawn line when the line does not form a closed line and generating a multiple straight-line-segment shape when the line forms a closed line. If a multiple segment shape is being reshaped, a basic shape is selected from reference ideal shapes as the basic shape of the multiple segment shape. The basic shape is adjusted to provide a specific shape as an improved shape for the freehand drawn shape. The recognition of the freehand drawn lines and shapes is accomplished by comparing source segments of a source freehand drawn line to a straight line and substituting a straight line segment for a source segment if the deviation between the source segment and the straight line is below a predetermined value. Sequential source segments of a source freehand drawn line are evaluated to determine if they are an angle segment. A segment pair of sequential source line segments are defined as an angle segment having two straight line segments intersecting at a common end point if both source segments of the segment pair are within a predetermined deviation value from a straight line. The straight line segments and angle segments are combined to form a multiple segment line if the line does not close on itself and a multiple segment shape if the line does close on itself.
摘要:
The invention improves the appearance of freehand drawn lines and shapes in an electronic document by first recognizing freehand drawn lines and shapes and generating a line made up of sequential straight line segments for the freehand drawn line when the line does not form a closed line and generating a multiple straight-line-segment shape when the line forms a closed line. If a multiple segment shape is being reshaped, a basic shape is selected from reference ideal shapes as the basic shape of the multiple segment shape. The basic shape is adjusted to provide a specific shape as an improved shape for the freehand drawn shape. The recognition of the freehand drawn lines and shapes is accomplished by comparing source segments of a source freehand drawn line to a straight line and substituting a straight line segment for a source segment if the deviation between the source segment and the straight line is below a predetermined value. Sequential source segments of a source freehand drawn line are evaluated to determine if they are an angle segment. A segment pair of sequential source line segments are defined as an angle segment having two straight line segments intersecting at a common end point if both source segments of the segment pair are within a predetermined deviation value from a straight line. The straight line segments and angle segments are combined to form a multiple segment line if the line does not close on itself and a multiple segment shape if the line does close on itself.
摘要:
Methods and systems are disclosed to aid in the detection of cancer or lesion in a mammogram images. Two mammogram images are input into an application that aids in determining the probability of a cancer or lesion being present in one or both of the images. The images are divided into different nodes and labels are applied to the nodes. The first node is compared to different variants of corresponding nodes on the second image as well as neighboring nodes on the first image. Based upon the comparisons, a unary and binary potential is calculated for the label that is applied to the node. The process is repeated for every possible label and for every node. Once the unary and binary potentials have been calculated, the potentials are input into a Conditional Random Field model to determine the probability of cancer for each node of the images.
摘要:
The invention improves the appearance of freehand drawn lines and shapes in an electronic document by first recognizing freehand drawn lines and shapes and generating a line made up of sequential straight line segments for the freehand drawn line when the line does not form a closed line and generating a multiple straight-line-segment shape when the line forms a closed line. If a multiple segment shape is being reshaped, a basic shape is selected from reference ideal shapes as the basic shape of the multiple segment shape. The basic shape is adjusted to provide a specific shape as an improved shape for the freehand drawn shape. The recognition of the freehand drawn lines and shapes is accomplished by comparing source segments of a source freehand drawn line to a straight line and substituting a straight line segment for a source segment if the deviation between the source segment and the straight line is below a predetermined value. Sequential source segments of a source freehand drawn line are evaluated to determine if they are an angle segment. A segment pair of sequential source line segments are defined as an angle segment having two straight line segments intersecting at a common end point if both source segments of the segment pair are within a predetermined deviation value from a straight line. The straight line segments and angle segments are combined to form a multiple segment line if the line does not close on itself and a multiple segment shape if the line does close on itself.