-
1.
公开(公告)号:US08965117B1
公开(公告)日:2015-02-24
申请号:US14109204
申请日:2013-12-17
Applicant: Amazon Technologies, Inc.
Inventor: Oleg Rybakov , Christopher John Lish , Chang Yuan , Junxiong Jia , Rakesh Madhavan Nambiar , Matias Omar Gregorio Benitez
CPC classification number: G06K9/18 , G06K9/00456 , G06K9/183 , G06K9/4604 , G06K9/6227 , G06K9/6267 , G06T5/002
Abstract: Embodiments of the subject technology provide methods and systems of image pre-processing for improving the accuracy of optical character recognition (OCR) and reducing the power consumption on a given computing device (e.g., mobile computing device). The subject technology, in some examples, classifies an image received from a camera of a mobile computing device into one or more classes: 1) normal background, 2) textured background, 3) image with text, 4) image with barcode, 5) image with QR code, and/or 6) image with clutter or “garbage.” Based on the classes associated with the image, the subject technology may forgo certain image processing operations, when the image is not associated with a particular class, in order to save resources (e.g., CPU cycles, battery power, memory usage, etc.) on the mobile computing device.
Abstract translation: 主题技术的实施例提供了用于提高光学字符识别(OCR)的精度并降低给定计算设备(例如,移动计算设备)的功耗的图像预处理的方法和系统。 在一些示例中,主题技术将从移动计算设备的相机接收的图像分类为一个或多个类别:1)正常背景,2)纹理背景,3)具有文本的图像,4)具有条形码的图像,5) 具有QR码的图像和/或6)具有杂波或“垃圾”的图像。基于与图像相关联的类别,当图像与特定类别不相关时,主题技术可以放弃某些图像处理操作 以在移动计算设备上节省资源(例如,CPU周期,电池电量,存储器使用等)。
-
公开(公告)号:US09536161B1
公开(公告)日:2017-01-03
申请号:US14307090
申请日:2014-06-17
Applicant: Amazon Technologies, Inc.
Inventor: Christopher John Lish , Oleg Rybakov , Sonjeev Jahagirdar , Junxiong Jia , Neil David Cooper , Avnish Sikka
CPC classification number: H04N5/23245 , G01S3/00 , G06K9/00664 , G06K2009/3291 , H04N5/232 , H04N5/23219 , H04N5/247
Abstract: Various embodiments describe systems and methods for utilizing a reduced amount of processing capacity for incoming data over time, and, in response to detecting a scene-change-event, notify one or more data processors that a scene-change-event has occurred, and cause incoming data to be processed as new data. In some embodiments, an incoming frame can be compared with a reference frame to determine a difference between the reference frame and the incoming frame. The reference frame may be correlated to a latest scene-change-event. In response to a determination that the difference does not meet one or more difference criteria, a user interface or at least one processor of the computing device can be notified to reduce processing of incoming data over time. In response to a determination that the difference meets the one or more difference criteria, the user interface or the at least one processor can be notified that a scene-change-event has occurred. Incoming data to the computing device is then treated as new and processed as those under an active condition. The current incoming frame can be selected as a new reference frame for detecting next scene-change-event.
Abstract translation: 各种实施例描述了随着时间的推移对于输入数据利用减少量的处理能力的系统和方法,并且响应于检测到场景改变事件,通知一个或多个数据处理器已经发生场景变化事件,以及 将传入的数据作为新数据进行处理。 在一些实施例中,输入帧可以与参考帧进行比较,以确定参考帧和输入帧之间的差异。 参考帧可以与最新的场景变化事件相关联。 响应于差异不符合一个或多个差异标准的确定,可以通知用户界面或计算设备的至少一个处理器以减少输入数据随时间的处理。 响应于差异满足一个或多个差异标准的确定,可以向用户界面或至少一个处理器通知场景变化事件已经发生。 然后将接收到计算设备的数据视为新的,并处理为处于活动状态的数据。 可以将当前输入帧选择为用于检测下一个场景改变事件的新参考帧。
-
公开(公告)号:US09160993B1
公开(公告)日:2015-10-13
申请号:US13945823
申请日:2013-07-18
Applicant: Amazon Technologies, Inc.
Inventor: Christopher John Lish , Geoffrey Scott Heller , Jim Oommen Thomas , Chang Yuan , Oleg Rybakov
CPC classification number: H04N9/3185 , G06F3/0425 , G06F3/0488 , H04N5/23219 , H04N5/23229 , H04N5/23293 , H04N9/3194
Abstract: Approaches enable the projection of one or more visual elements, such as one or more dynamically changing graphical elements, that can substantially bound, or otherwise at least partially surround or identify, an object recognized by a computing device. The computing device can project the graphical elements to collectively appear as a bounding element for the recognized/actionable object or object portion. As such, the graphical elements can appear as a bounding element that adorns, decorates, highlights, and/or emphasizes, etc., the recognized/actionable object or object portion. The graphical elements to be dynamic. For example, the graphical elements can be projected to move around individually over time, while still appearing to at least partially surround the recognized/actionable object or object portion. Further, the graphical elements can be used to improve various object recognition approaches.
Abstract translation: 方法使得能够基本上绑定或以其他方式至少部分地围绕或识别由计算设备识别的对象的一个或多个可视元素的投影,诸如一个或多个动态变化的图形元素。 计算设备可以投影图形元素以集体显示为识别/可操作的对象或对象部分的边界元素。 因此,图形元素可以显示为对已识别/可操作的对象或对象部分进行装饰,装饰,突出显示和/或强调等的边界元素。 图形元素是动态的。 例如,图形元素可以被投影以随着时间逐渐移动,同时仍然显示为至少部分地围绕识别/可操作的对象或对象部分。 此外,图形元素可以用于改进各种对象识别方法。
-
-