Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Utkarsh Prateek"

1.

发明授权
Item recognition using context data 有权

公开(公告)号：US10043069B1

公开(公告)日：2018-08-07

申请号：US14196669

申请日：2014-03-04

Applicant: Amazon Technologies, Inc.

Inventor： Yue Liu , Utkarsh Prateek , Avnish Sikka , Matthew Daniel Hart , Emilie Noelle McConville , Sonjeev Jahagirdar

IPC: G06K9/00

Abstract: A system for recognizing objects and/or text in image data may use context data to perform object/text recognition. The system may also use context data when determining potential functions to execute in response to recognizing the object/text. Context data may be gathered based on device sensor data, user profile data such as the behavior of a user or the behavior of those in a user's social network, or other factors. Recognition processing and/or function selection may be configured to account for context data when operating to improve output results.

2.

发明授权
Recognizing text from frames of image data using contextual information 有权
Title translation: 使用上下文信息识别来自图像数据帧的文本

公开(公告)号：US09355336B1

公开(公告)日：2016-05-31

申请号：US14259905

申请日：2014-04-23

Applicant: Amazon Technologies, Inc.

Inventor： Sonjeev Jahagirdar , Matthew Joseph Cole , David Paul Ramos , Utkarsh Prateek , Emilie Noelle McConville , Ankur Datta , Laura Varnum Finney , Yue Liu , Bhavesh Anil Doshi , Avnish Sikka , Michael Vanne

IPC: G06K9/00 , G06K9/62

CPC classification number: G06K9/6217 , G06K9/00979 , G06K9/723 , G06K2209/01

Abstract: Disclosed are techniques for recognizing text from one or more frames of image data using contextual information. In some implementations, image data including a captured textual item is processed to identify an entity in the image data. A context can be selected using the entity, where the context corresponds to a dictionary. Text in the captured textual item can be identified using the dictionary. The identified text can be output to a display device.

Abstract translation: 公开了使用上下文信息从一个或多个图像数据帧识别文本的技术。在一些实现中，处理包括捕获的文本项的图像数据以识别图像数据中的实体。可以使用实体选择上下文，其中上下文对应于字典。捕获的文本项目中的文本可以使用字典来识别。识别的文本可以输出到显示设备。

3.

发明授权
Using a front-facing camera to improve OCR with a rear-facing camera 有权
Title translation: 使用前置摄像头，使用后置摄像头改善OCR

公开(公告)号：US09269009B1

公开(公告)日：2016-02-23

申请号：US14283115

申请日：2014-05-20

Applicant: Amazon Technologies, Inc.

Inventor： Yue Liu , Sonjeev Jahagirdar , Matthew Joseph Cole , Utkarsh Prateek , Emilie Noelle McConville , Daniel Makoto Wilenson , Avnish Sikka

IPC: G06K9/18 , G06K9/00

CPC classification number: G06K9/18 , G06K9/00302 , G06K9/00664 , G06K9/033 , G06K2209/01

Abstract: Various embodiments enable a computing device to incorporate frame selection or preprocessing techniques into a text recognition pipeline in an attempt to improve text recognition accuracy in various environments and situations. For example, a mobile computing device can capture images of text using a first camera, such as a rear-facing camera, while capturing images of the environment or a user with a second camera, such as a front-facing camera. Based on the images captured of the environment or user, one or more image preprocessing parameters can be determined and applied to the captured images in an attempt to improve text recognition accuracy.

Abstract translation: 各种实施例使得计算设备能够将帧选择或预处理技术合并到文本识别流水线中，以试图改善各种环境和情况下的文本识别精度。例如，移动计算设备可以使用诸如后置摄像机之类的第一照相机捕获文本的图像，同时利用诸如前置摄像机的第二照相机拍摄环境图像或用户。基于捕获的环境或用户的图像，可以确定一个或多个图像预处理参数并将其应用于捕获的图像，以提高文本识别精度。

4.

发明授权
Merging optical character recognized text from frames of image data 有权

公开(公告)号：US09659224B1

公开(公告)日：2017-05-23

申请号：US14230471

申请日：2014-03-31

Applicant: Amazon Technologies, Inc.

Inventor： Matthew Joseph Cole , Sonjeev Jahagirdar , Matthew Daniel Hart , David Paul Ramos , Ankur Datta , Utkarsh Prateek , Emilie Noelle McConville , Prashant Hegde , Avnish Sikka

IPC: G06K9/18 , G06K9/00

CPC classification number: G06K9/18 , G06K9/00979 , G06K9/6292 , G06K9/72 , G06K2209/01 , G06K9/00449 , G06K9/00463 , G06K9/00442

Abstract: Disclosed are techniques for merging optical character recognized (OCR'd) text from frames of image data. In some implementations, a device sends frames of image data to a server, where each frame includes at least a portion of a captured textual item. The server performs optical character recognition (OCR) on the image data of each frame. When OCR'd text from respective frames is returned to the device from the server, the device can perform matching operations on the text, for instance, using bounding boxes and/or edit distance processing. The device can merge any identified matches of OCR'd text from different frames. The device can then display the merged text with any corrections.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification