On-Screen Guideline-Based Selective Text Recognition
    81.
    发明申请
    On-Screen Guideline-Based Selective Text Recognition 有权
    基于屏幕指导的选择性文本识别

    公开(公告)号:US20110123115A1

    公开(公告)日:2011-05-26

    申请号:US12626520

    申请日:2009-11-25

    摘要: A live video stream captured by an on-device camera is displayed on a screen with an overlaid guideline. Video frames of the live video stream are analyzed for a video frame with acceptable quality. A text region is identified in the video frame approximate to the on-screen guideline and cropped from the video frame. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and generates text in an editable symbolic form (the OCR'ed text). A confidence score is determined for the OCR'ed text and compared with a threshold value. If the confidence score exceeds the threshold value, the OCR'ed text is outputted.

    摘要翻译: 由设备上相机拍摄的实时视频流将显示在具有重叠指南的屏幕上。 分析实时视频流的视频帧,以获得可接受的质量的视频帧。 在视频帧中识别文本区域,其近似于屏幕上的指南并从视频帧中裁剪。 裁剪的图像被传输到光学字符识别(OCR)引擎,其处理裁剪的图像并以可编辑的符号形式(OCR的文本)生成文本。 确定OCR文本的置信度,并与阈值进行比较。 如果置信度分数超过阈值,则输出OCR的文本。

    Media material analysis of continuing article portions
    82.
    发明申请
    Media material analysis of continuing article portions 有权
    继续文章部分的媒体材料分析

    公开(公告)号:US20080107338A1

    公开(公告)日:2008-05-08

    申请号:US11644009

    申请日:2006-12-22

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00469

    摘要: The present invention relates to systems and methods for analyzing media material having articles continuing across multiple pages. A media material analyzer includes a segmenter and an article composer. The segmenter identifies block segments associated with columnar body test in the media material. The article composer determines which of the identified block segments belong to a continuing article extending across multiple pages in the media material based on language statistics information and continuation transition information.

    摘要翻译: 本发明涉及用于分析介质材料的系统和方法,所述介质材料具有跨越多页的物品。 媒体材料分析器包括分割器和文章作曲者。 分割器识别与介质材料中柱状体测试相关的块段。 文章作曲者基于语言统计信息和连续转换信息确定哪些标识的块段属于在媒体材料中的多个页面上延伸的连续文章。

    DATABASE FOR MIXED MEDIA DOCUMENT SYSTEM
    83.
    发明申请
    DATABASE FOR MIXED MEDIA DOCUMENT SYSTEM 有权
    混合媒体文件系统的数据库

    公开(公告)号:US20070050411A1

    公开(公告)日:2007-03-01

    申请号:US11461164

    申请日:2006-07-31

    IPC分类号: G06F7/00

    CPC分类号: G06F17/3002 G06F17/30047

    摘要: A Mixed Media Reality (MMR) system and associated techniques are disclosed. The MMR system provides mechanisms for forming a mixed media document that includes media of at least two types (e.g., printed paper as a first medium and digital content and/or web link as a second medium). In one particular embodiment, the MMR system includes a content-based retrieval database configured with an index table to represent two-dimensional geometric relationships between objects extracted from a printed document in a way that allows look-up using a text-based index. A ranked set of document, page and location hypotheses can be computed given data from the index table. The techniques effectively transform features detected in an image patch into textual terms (or other searchable features) that represent both the features themselves and the geometric relationship between them. A storage facility can be used to store additional characteristics about each document image patch.

    摘要翻译: 公开了混合媒体现实(MMR)系统及其相关技术。 MMR系统提供用于形成包括至少两种类型的媒体(例如,作为第一媒体的打印纸和作为第二媒体的数字内容和/或web链接)的混合媒体文档的机制。 在一个具体实施例中,MMR系统包括配置有索引表的基于内容的检索数据库,以表示以允许使用基于文本的索引进行查找的方式从打印文档提取的对象之间的二维几何关系。 可以根据索引表中的数据计算一组排序的文档,页面和位置假设。 这些技术有效地将图像补丁中检测到的特征转换成代表特征本身和它们之间的几何关系的文本术语(或其他可搜索特征)。 可以使用存储设备来存储关于每个文档图像补丁的附加特征。

    DATA ORGANIZATION AND ACCESS FOR MIXED MEDIA DOCUMENT SYSTEM
    84.
    发明申请
    DATA ORGANIZATION AND ACCESS FOR MIXED MEDIA DOCUMENT SYSTEM 有权
    混合媒体文件系统的数据组织和访问

    公开(公告)号:US20070047819A1

    公开(公告)日:2007-03-01

    申请号:US11461147

    申请日:2006-07-31

    IPC分类号: G06K9/46 G06F7/00

    摘要: A Mixed Media Reality (MMR) system and associated techniques are disclosed. The MMR system provides mechanisms for forming a mixed media document that includes media of at least two types (e.g., printed paper as a first medium and digital content and/or web link as a second medium). In one particular embodiment, the MMR system includes a content-based retrieval database configured with an index table to represent two-dimensional geometric relationships between objects extracted from a printed document in a way that allows look-up using a text-based index. A ranked set of document, page and location hypotheses can be computed given data from the index table. The techniques effectively transform features detected in an image patch into textual terms (or other searchable features) that represent both the features themselves and the geometric relationship between them. A storage facility can be used to store additional characteristics about each document image patch.

    摘要翻译: 公开了混合媒体现实(MMR)系统及其相关技术。 MMR系统提供用于形成包括至少两种类型的媒体(例如,作为第一媒体的打印纸和作为第二媒体的数字内容和/或web链接)的混合媒体文档的机制。 在一个具体实施例中,MMR系统包括配置有索引表的基于内容的检索数据库,以表示以允许使用基于文本的索引进行查找的方式从打印文档提取的对象之间的二维几何关系。 可以根据索引表中的数据计算一组排序的文档,页面和位置假设。 这些技术有效地将图像补丁中检测到的特征转换成代表特征本身和它们之间的几何关系的文本术语(或其他可搜索特征)。 可以使用存储设备来存储关于每个文档图像补丁的附加特征。

    GESTURE-BASED SELECTIVE TEXT RECOGNITION
    88.
    发明申请
    GESTURE-BASED SELECTIVE TEXT RECOGNITION 有权
    基于GESTURE的选择性文本识别

    公开(公告)号:US20110081083A1

    公开(公告)日:2011-04-07

    申请号:US12575015

    申请日:2009-10-07

    IPC分类号: G06K9/18

    摘要: An image is displayed on a touch screen. A user's underline gesture on the displayed image is detected. The area of the image touched by the underline gesture and a surrounding region approximate to the touched area are identified. Skew for text in the surrounding region is determined and compensated. A text region including the text is identified in the surrounding region and cropped from the image. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and returns OCR'ed text. The OCR'ed text is outputted.

    摘要翻译: 图像显示在触摸屏上。 检测到用户在显示图像上的下划线手势。 识别由下划线手势触摸的图像的区域和接近触摸区域的周边区域。 确定并补偿周边地区的文字偏差。 在周围区域中识别包括文本的文本区域,并从图像中裁剪。 裁剪的图像被传输到光学字符识别(OCR)引擎,其处理裁剪的图像并返回OCR的文本。 输出OCR的文本。

    AUTOMATED TECHNIQUES FOR COMPARING CONTENTS OF IMAGES
    89.
    发明申请
    AUTOMATED TECHNIQUES FOR COMPARING CONTENTS OF IMAGES 有权
    自动化技术用于比较图像的内容

    公开(公告)号:US20070269139A1

    公开(公告)日:2007-11-22

    申请号:US11749606

    申请日:2007-05-16

    IPC分类号: G06K9/54

    摘要: Automated techniques for comparing contents of images. For a given image (referred to as an “input image”), a set of images (referred to as “a set of candidate images”) are processed to determine if the set of candidate images comprises an image whose contents or portions thereof match contents included in a region of interest in the input image.

    摘要翻译: 用于比较图像内容的自动化技术。 对于给定图像(称为“输入图像”),处理一组图像(称为“一组候选图像”),以确定候选图像的集合是否包括其内容或部分匹配的图像 包含在输入图像中的感兴趣区域中的内容。