METHOD AND DEVICE FOR INPUTTING HANDWRITING CHARACTER

    公开(公告)号:US20170344817A1

    公开(公告)日:2017-11-30

    申请号:US15535190

    申请日:2015-01-28

    IPC分类号: G06K9/00 G06F3/0488 G06K9/34

    摘要: A method and an electronic device for inputting handwriting character are provided. The electronic device comprises a touch screen, a memory, and a processor. The processor is configured to perform the functions of the method. The method comprises steps of: adding a handwriting input on the touch screen; detecting a position of an initial point of the handwriting input; determining an input area for the handwriting input among the plurality of input areas of the touch screen based on the position of the initial point of the handwriting input; determining an operation of the handwriting input based on the position of the initial point of the handwriting input and performing the determined operation; and upon completion of the handwriting input, recognizing the input as a character and displaying the recognized character in the determined input area on the touch screen.

    Text recognition based on recognition units
    4.
    发明授权
    Text recognition based on recognition units 有权
    基于识别单位的文本识别

    公开(公告)号:US09536180B2

    公开(公告)日:2017-01-03

    申请号:US14142967

    申请日:2013-12-30

    申请人: Google Inc.

    IPC分类号: G06K9/62 G06K9/72 G06F17/22

    摘要: Methods and systems for grapheme splitting of text input for recognition are provided. A method may include receiving a text input in a script and segmenting the text input into one or more graphemes. Each of the one or more graphemes may be split into one or more recognition units based on one or more recognition unit identification criteria associated with the script. Next, a text recognition system may be trained using the recognition units. Text input may be handwritten text input received from a user or a scanned image of text.

    摘要翻译: 提供了用于识别文本输入的图形分割的方法和系统。 方法可以包括在脚本中接收文本输入并将文本输入分割成一个或多个字形。 基于与脚本相关联的一个或多个识别单元识别标准,一个或多个字形中的每一个可以被分割成一个或多个识别单元。 接下来,可以使用识别单元来训练文本识别系统。 文本输入可以是从用户接收的手写文本输入或文本的扫描图像。

    Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR
    6.
    发明授权
    Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR 有权
    通过基于前缀树的解码处理复杂变体词的方法,用于Devanagiri OCR

    公开(公告)号:US09262699B2

    公开(公告)日:2016-02-16

    申请号:US13828060

    申请日:2013-03-14

    摘要: An electronic device and method identify a block of text in a portion of an image of real world captured by a camera of a mobile device, slice sub-blocks from the block and identify characters in the sub-blocks that form a first sequence to a predetermined set of sequences to identify a second sequence therein. The second sequence may be identified as recognized (as a modifier-absent word) when not associated with additional information. When the second sequence is associated with additional information, a check is made on pixels in the image, based on a test specified in the additional information. When the test is satisfied, a copy of the second sequence in combination with the modifier is identified as recognized (as a modifier-present word). Storage and use of modifier information in addition to a set of sequences of characters enables recognition of words with or without modifiers.

    摘要翻译: 电子设备和方法识别由移动设备的相机捕获的真实世界的图像的一部分中的文本块,从块中切片子块,并将形成第一序列的子块中的字符识别为 用于识别其中的第二序列的预定序列集合。 当不与附加信息相关联时,第二序列可以被识别为被识别(作为不包含修饰词的词)。 当第二个序列与附加信息相关联时,将根据附加信息中指定的测试对图像中的像素进行检查。 当测试被满足时,与修饰符组合的第二个序列的拷贝被识别为识别(作为修饰语现在的词)。 修改器信息的存储和使用除了一组字符序列之外,还可以识别具有或不具有修饰符的单词。

    Lower modifier detection and extraction from devanagari text images to improve OCR performance
    7.
    发明授权
    Lower modifier detection and extraction from devanagari text images to improve OCR performance 有权
    较低的修改器检测和提取从devanagari文本图像,以提高OCR性能

    公开(公告)号:US09064191B2

    公开(公告)日:2015-06-23

    申请号:US13791188

    申请日:2013-03-08

    IPC分类号: G06K9/78 G06K9/32

    摘要: Systems, apparatus and methods for extracting lower modifiers from a word image, before performing optical character recognition (OCR), based on a plurality of tests comprising a first test, a second test and a third test are presented. The method obtains the word image and performing a plurality of tests (e.g., a first test, a second test and a third test). The first test determines whether a vertical line spanning the height of the word image exists. The second test determines whether a jump of a number of components in the lower portion of the word image exists. The third test determines sparseness in a lower portion of the word image. The plurality of tests may run sequentially and/or in parallel. Results from the plurality of tests are used to decide whether a lower modifier exists by comparing and accumulating test results from the plurality of tests.

    摘要翻译: 提出了基于包括第一测试,第二测试和第三测试的多个测试之前,在执行光学字符识别(OCR)之前从单词图像中提取下修改器的系统,设备和方法。 该方法获得单词图像并执行多个测试(例如,第一测试,第二测试和第三测试)。 第一个测试确定是否存在跨越单词图像的高度的垂直线。 第二个测试确定是否存在单词图像下部的一些组件的跳转。 第三个测试确定单词图像下部的稀疏度。 多个测试可以顺序地和/或并行地运行。 多个测试的结果用于通过比较和累积来自多个测试的测试结果来决定是否存在较低的修饰符。

    System and methods for arabic text recognition and arabic corpus building
    9.
    发明授权
    System and methods for arabic text recognition and arabic corpus building 失效
    阿拉伯语文本识别和阿拉伯语语料库建立的系统和方法

    公开(公告)号:US08761500B2

    公开(公告)日:2014-06-24

    申请号:US13892289

    申请日:2013-05-12

    IPC分类号: G06K9/62

    摘要: A method for automatically recognizing Arabic text includes building an Arabic corpus comprising Arabic text files written in different writing styles and ground truths corresponding to each of the Arabic text files, storing writing-style indices in association with the Arabic text files, digitizing a line of Arabic characters to form an array of pixels, dividing the line of the Arabic characters into line images, forming a text feature vector from the line images, training a Hidden Markov Model using the Arabic text files and ground truths in the Arabic corpus in accordance with the writing-style indices, and feeding the text feature vector into a Hidden Markov Model to recognize the line of Arabic characters.

    摘要翻译: 一种自动识别阿拉伯语文本的方法包括建立一个阿拉伯语语料库,其中包括以与阿拉伯语文本文件相对应的不同写作风格和地面实况写的阿拉伯语文本文件,与阿拉伯语文本文件相关联地存储写作风格的索引,将一行 阿拉伯语字符形成像素阵列,将阿拉伯字符线划分成线条图像,从线图形成文本特征向量,使用阿拉伯语文本格式的阿拉伯语文本文件和地面真实性按照阿拉伯语语料库训练隐马尔可夫模型 写作风格的指标,并将文本特征向量输入到隐马尔可夫模型中,以识别阿拉伯字符行。

    REDUNDANT ASPECT RATIO DECODING OF DEVANAGARI CHARACTERS
    10.
    发明申请
    REDUNDANT ASPECT RATIO DECODING OF DEVANAGARI CHARACTERS 审中-公开
    DEVANAGARI字符的冗余宽高比解码

    公开(公告)号:US20140023275A1

    公开(公告)日:2014-01-23

    申请号:US13844641

    申请日:2013-03-15

    IPC分类号: G06K9/58

    摘要: An electronic device and method receive a block sliced from a rectangular portion of an image of a scene of real world captured by a camera and use a property of the block to operate one of multiple optical character recognition (OCR) decoders. In an illustrative aspect, a first OCR decoder is configured to recognize characters whose property satisfies the test based on a first limit, the first limit being obtained by reducing a predetermined limit by an overlap amount. In this illustrative aspect, a second OCR decoder is configured to recognize characters whose property does not satisfy the test based on a second limit, the second limit being obtained by increasing the predetermined limit by the overlap amount. When the property of the block satisfies the test, the first OCR decoder is operated and alternatively the second OCR decoder is operated, resulting in candidates for a character being identified.

    摘要翻译: 电子设备和方法接收从相机拍摄的现实世界的场景的图像的矩形部分切割的块,并使用块的属性来操作多个光学字符识别(OCR)解码器之一。 在说明性方面,第一OCR解码器被配置为基于第一限制来识别其性能满足测试的字符,所述第一限制是通过将预定限制减小重叠量而获得的。 在该说明性方面,第二OCR解码器被配置为基于第二限制来识别属性不满足测试的字符,所述第二限制是通过将预定限度增加重叠量而获得的。 当块的属性满足测试时,第一OCR解码器被操作,并且可选地,第二OCR解码器被操作,导致正在识别的字符的候选。