Detecting errors in recognized text
    1.
    发明授权
    Detecting errors in recognized text 有权
    检测识别文本中的错误

    公开(公告)号:US09384389B1

    公开(公告)日:2016-07-05

    申请号:US13612273

    申请日:2012-09-12

    IPC分类号: G06K9/00

    摘要: Some examples include detecting errors in text that has been recognized using automated text recognition technology. For instance, errors in the recognized text may be detected based on glyph image similarity and the use of a language model, dictionary information, or the like. Some implementations may group together glyphs based on association of the glyphs with the same glyph identifier and a similarity of the appearance of the glyphs. Furthermore, the words associated with each glyph may be checked against a language model, such as to check a spelling or other validity of the words, and a score may be assigned to each group of glyphs based on the validity of the words corresponding to the glyphs in that group. Groups that have a score that fails to meet a threshold may be reviewed by a person or may undergo automated correction techniques.

    摘要翻译: 一些例子包括检测已使用自动文本识别技术识别的文本中的错误。 例如,可以基于字形图像相似度和语言模型,字典信息等的使用来检测识别的文本中的错误。 一些实现可以基于字形与相同字形标识符的关联以及字形外观的相似性将字形组合在一起。 此外,可以针对语言模型检查与每个字形相关联的单词,例如检查单词的拼写或其他有效性,并且可以基于对应于单词的单词的有效性将分数分配给每组字形 该组中的字形。 具有不符合阈值的分数的组可以由人员进行评估,或者可以进行自动校正技术。

    System and method for increasing the available workspace of a graphical user interface
    2.
    发明申请
    System and method for increasing the available workspace of a graphical user interface 审中-公开
    用于增加图形用户界面的可用工作空间的系统和方法

    公开(公告)号:US20060048067A1

    公开(公告)日:2006-03-02

    申请号:US10930365

    申请日:2004-08-31

    IPC分类号: G06F17/00

    CPC分类号: G06F3/0481 G06F2203/04804

    摘要: An improved system and method for increasing the available workspace of a graphical user interface by providing reduced opacity of an element in the graphical user interface to make the workspace beneath the semi-transparent element visible. Later, the semi-transparent element may be made opaque again for better visibility to a user. An opacity manager may be operably coupled to a graphics interface of an operating system to change the opacity of an element of the graphical user interface. Any type of element of a graphical user interface may have its opacity reduced, including a window, a dialog box, a message box, a toolbar, a control, a button, a menu, and so forth. The system and method may reduce or increase the opacity of an element of the graphical user interface in response to any event including a system event, an application event, or a user interface event.

    摘要翻译: 一种改进的系统和方法,用于通过在图形用户界面中提供元素的不透明度来增加图形用户界面的可用工作空间,以使半透明元素之下的工作空间可见。 之后,可以使半透明元件再次变得不透明以更好地使用户可见。 不透明度管理器可以可操作地耦合到操作系统的图形接口以改变图形用户界面的元素的不透明度。 图形用户界面的任何类型的元素可以减少其不透明度,包括窗口,对话框,消息框,工具栏,控件,按钮,菜单等。 响应于包括系统事件,应用程序事件或用户界面事件的任何事件,系统和方法可以减少或增加图形用户界面的元素的不透明度。

    Identification of text-block frames
    3.
    发明授权
    Identification of text-block frames 有权
    识别文本框框架

    公开(公告)号:US08515176B1

    公开(公告)日:2013-08-20

    申请号:US13332120

    申请日:2011-12-20

    IPC分类号: G06K9/18

    摘要: Determination of an underlying grid structure that facilitates layout of East Asian text is disclosed. The underlying grid structure includes both a size of character frames and a size of a text block frame. The East Asian text may be obtained from a scan of printed material that has the text formatted according to layout conventions established by the publisher. The text may be reformatted to appear on a display of an electronic device in a manner similar to the formatting in the original scanned document. Reformatting may include reflowing the text in order to fit a greater or lesser number of characters on a line. The reflowing may maintain character spacing from the original document and follow formatting rules against locating certain characters at the start or end of a line.

    摘要翻译: 披露了促进东亚文本布局的基础网格结构的确定。 底层网格结构包括字符帧的大小和文本块帧的大小。 东亚文本可以从具有根据出版商建立的布局惯例格式化的文本的打印材料的扫描获得。 可以以与原始扫描文档中的格式相似的方式将文本重新格式化以在电子设备的显示器上显示。 重新格式化可能包括回填文本,以适应一行中更多或更少数量的字符。 回流可能会保持与原始文档的字符间距,并遵循格式规则,以便在行的开头或结尾定位某些字符。

    Adaptive editing in user interface applications
    6.
    发明授权
    Adaptive editing in user interface applications 有权
    在用户界面应用程序中自适应编辑

    公开(公告)号:US09268754B1

    公开(公告)日:2016-02-23

    申请号:US13565544

    申请日:2012-08-02

    摘要: Systems and methods for improving automated processing of electronic media items are disclosed. In one embodiment, a computer system identifies a first set of regions of a page of an electronic media item, and a respective region type for at least one region of the first set, where the identification of the respective region type is based on one or more typographical features, historical data, and, optionally, the position and/or dimensions of the region. The computer system receives an identification by a user of a second set of regions of the page and a respective region type for at least one region of the second set, and then modifies the historical data when there is a difference between the regions and respective region types of the first set, and the regions and respective region types of the second set.

    摘要翻译: 公开了用于改进电子媒体项目的自动处理的系统和方法。 在一个实施例中,计算机系统识别电子媒体项目的页面的第一组区域以及针对第一组的至少一个区域的相应区域类型,其中相应区域类型的标识基于一个或多个 更多的印刷特征,历史数据,以及可选的区域的位置和/或尺寸。 计算机系统由用户接收页面的第二组区域的标识和用于第二组的至少一个区域的相应区域类型,然后当区域和相应区域之间存在差异时修改历史数据 第一组的类型,以及第二组的区域和各自的区域类型。

    Skew detection for vertical text
    7.
    发明授权
    Skew detection for vertical text 有权
    倾斜检测垂直文本

    公开(公告)号:US09110926B1

    公开(公告)日:2015-08-18

    申请号:US13671495

    申请日:2012-11-07

    IPC分类号: G06K9/18 G06F17/30 G06K9/00

    摘要: A method for detecting and correcting skew in scanned vertical text includes identifying an image of vertically oriented characters, and identifying a plurality of vertical lines corresponding to character positions of the vertically oriented characters in the image. The method further includes generating an average slope of a subset of the plurality of lines, and causing the image to be deskewed based on the average slope.

    摘要翻译: 用于检测和校正扫描的垂直文本中的偏斜的方法包括识别垂直取向字符的图像,以及识别对应于图像中的垂直取向字符的字符位置的多个垂直线。 该方法还包括生成多条线的子集的平均斜率,并且基于平均斜率使图像进行偏斜校正。