Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Yue Liu"

21.

发明授权
Using a front-facing camera to improve OCR with a rear-facing camera 有权
Title translation: 使用前置摄像头，使用后置摄像头改善OCR

公开(公告)号：US09269009B1

公开(公告)日：2016-02-23

申请号：US14283115

申请日：2014-05-20

Applicant: Amazon Technologies, Inc.

Inventor： Yue Liu , Sonjeev Jahagirdar , Matthew Joseph Cole , Utkarsh Prateek , Emilie Noelle McConville , Daniel Makoto Wilenson , Avnish Sikka

IPC: G06K9/18 , G06K9/00

CPC classification number: G06K9/18 , G06K9/00302 , G06K9/00664 , G06K9/033 , G06K2209/01

Abstract: Various embodiments enable a computing device to incorporate frame selection or preprocessing techniques into a text recognition pipeline in an attempt to improve text recognition accuracy in various environments and situations. For example, a mobile computing device can capture images of text using a first camera, such as a rear-facing camera, while capturing images of the environment or a user with a second camera, such as a front-facing camera. Based on the images captured of the environment or user, one or more image preprocessing parameters can be determined and applied to the captured images in an attempt to improve text recognition accuracy.

Abstract translation: 各种实施例使得计算设备能够将帧选择或预处理技术合并到文本识别流水线中，以试图改善各种环境和情况下的文本识别精度。例如，移动计算设备可以使用诸如后置摄像机之类的第一照相机捕获文本的图像，同时利用诸如前置摄像机的第二照相机拍摄环境图像或用户。基于捕获的环境或用户的图像，可以确定一个或多个图像预处理参数并将其应用于捕获的图像，以提高文本识别精度。

22.

发明授权
Text recognition near an edge 有权
Title translation: 靠近边缘的文本识别

公开(公告)号：US09239961B1

公开(公告)日：2016-01-19

申请号：US14495589

申请日：2014-09-24

Applicant: Amazon Technologies, Inc.

Inventor： Matthew Joseph Cole , Yue Liu , David Paul Ramos , Avnish Sikka

IPC: G06K9/00 , G06K9/18 , G06K9/32

CPC classification number: G06K9/00456 , G06K9/2081 , G06K9/325

Abstract: The recognition of text in an acquired image is improved by using general and type-specific heuristics that can determine the likelihood that a portion of the text is truncated at an edge of an image, frame, or screen. Truncated text can be filtered such that the user is not provided with an option to perform an undesirable task, such as to dial an incorrect number or connect to an incorrect Web address, based on recognizing an incomplete text string. The general and type-specific heuristics can be combined to improve confidence, and the image data can be pre-processed on the device before processing with an optical character recognition (OCR) engine. Multiple frames can be analyzed to attempt to recognize words or characters that might have been truncated in one or more of the frames.

Abstract translation: 通过使用可以确定文本的一部分在图像，帧或屏幕的边缘被截断的可能性的一般和类型特定的启发式算法来改进获取的图像中的文本的识别。截断的文本可以被过滤，以便基于识别不完整的文本字符串，用户未被提供执行不期望的任务的选项，例如拨打不正确的号码或连接到不正确的Web地址。一般和类型特定的启发式可以组合以提高置信度，并且可以在使用光学字符识别（OCR）引擎处理之前在设备上预处理图像数据。可以分析多个帧以尝试识别可能在一个或多个帧中被截断的字或字符。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification