-
公开(公告)号:US10438262B1
公开(公告)日:2019-10-08
申请号:US14740045
申请日:2015-06-15
Applicant: Amazon Technologies, Inc.
Inventor: Scott Daniel Helmer , Junxiong Jia
Abstract: A virtual browsing experience may be implemented that allows a user to move a mobile device within a physical environment in order to control browser navigation to different items on an associated display. The virtual browsing experience improves the user's ability to recall where previously-viewed items are located in the virtual browsing environment. In some embodiments, a mobile device may determine its position and/or orientation in a physical environment, and when movement of the mobile device is detected, a user interface on an associated display may digitally navigate through multiple items according to the position and/or orientation of the mobile device. The position and orientation of the mobile device may be determined from position information or data obtained by a sensor device of the mobile device, and appropriate subsets of items can be determined for display based on detected movement of the mobile device.
-
2.
公开(公告)号:US08965117B1
公开(公告)日:2015-02-24
申请号:US14109204
申请日:2013-12-17
Applicant: Amazon Technologies, Inc.
Inventor: Oleg Rybakov , Christopher John Lish , Chang Yuan , Junxiong Jia , Rakesh Madhavan Nambiar , Matias Omar Gregorio Benitez
CPC classification number: G06K9/18 , G06K9/00456 , G06K9/183 , G06K9/4604 , G06K9/6227 , G06K9/6267 , G06T5/002
Abstract: Embodiments of the subject technology provide methods and systems of image pre-processing for improving the accuracy of optical character recognition (OCR) and reducing the power consumption on a given computing device (e.g., mobile computing device). The subject technology, in some examples, classifies an image received from a camera of a mobile computing device into one or more classes: 1) normal background, 2) textured background, 3) image with text, 4) image with barcode, 5) image with QR code, and/or 6) image with clutter or “garbage.” Based on the classes associated with the image, the subject technology may forgo certain image processing operations, when the image is not associated with a particular class, in order to save resources (e.g., CPU cycles, battery power, memory usage, etc.) on the mobile computing device.
Abstract translation: 主题技术的实施例提供了用于提高光学字符识别(OCR)的精度并降低给定计算设备(例如,移动计算设备)的功耗的图像预处理的方法和系统。 在一些示例中,主题技术将从移动计算设备的相机接收的图像分类为一个或多个类别:1)正常背景,2)纹理背景,3)具有文本的图像,4)具有条形码的图像,5) 具有QR码的图像和/或6)具有杂波或“垃圾”的图像。基于与图像相关联的类别,当图像与特定类别不相关时,主题技术可以放弃某些图像处理操作 以在移动计算设备上节省资源(例如,CPU周期,电池电量,存储器使用等)。
-
公开(公告)号:US11238513B1
公开(公告)日:2022-02-01
申请号:US16579760
申请日:2019-09-23
Applicant: Amazon Technologies, Inc.
Inventor: Scott Daniel Helmer , Junxiong Jia
Abstract: A virtual browsing experience may be implemented that allows a user to move a mobile device within a physical environment in order to control browser navigation to different items on an associated display. The virtual browsing experience improves the user's ability to recall where previously-viewed items are located in the virtual browsing environment. In some embodiments, a mobile device may determine its position and/or orientation in a physical environment, and when movement of the mobile device is detected, a user interface on an associated display may digitally navigate through multiple items according to the position and/or orientation of the mobile device. The position and orientation of the mobile device may be determined from position information or data obtained by a sensor device of the mobile device, and appropriate subsets of items can be determined for display based on detected movement of the mobile device.
-
公开(公告)号:US09305227B1
公开(公告)日:2016-04-05
申请号:US14139752
申请日:2013-12-23
Applicant: Amazon Technologies, Inc.
Inventor: Rakesh Madhavan Nambiar , Sonjeev Jahagirdar , Matthew Joseph Cole , Matias Omar Gregorio Benitez , Junxiong Jia , David Paul Ramos
IPC: G06K9/18
CPC classification number: G06K9/18 , G06K9/00979 , G06K9/6292 , G06K2209/01
Abstract: Embodiments of the subject technology provide for a hybrid OCR approach which combines server and device side processing that can offset disadvantages of performing OCR solely on the server side or the device side. More specifically, the subject technology utilizes image characteristics such as glyph details and image quality measurements to opportunistically schedule OCR processing on the mobile device and/or server. In this regard, text extracted by a “faster” OCR engine (e.g., one with less latency) is displayed to a user, which is then updated by the result of a more accurate OCR engine (e.g., an OCR engine provided by the server). This approach allows factoring in additional parameters such as network latency and user preference for making scheduling decisions. Thus, the subject technology may provide significant gains in terms of reduced latency and increased accuracy by implementing one or more techniques associated with this hybrid OCR approach.
Abstract translation: 本技术的实施例提供了一种组合服务器和设备侧处理的混合OCR方法,其可以抵消仅在服务器侧或设备侧执行OCR的缺点。 更具体地,本主题技术利用诸如字形细节和图像质量测量的图像特征来机会地在移动设备和/或服务器上调度OCR处理。 在这方面,由“更快的”OCR引擎提取的文本(例如,具有较小延迟的引擎)被显示给用户,然后由更准确的OCR引擎(例如,由服务器提供的OCR引擎)的结果来更新 )。 这种方法允许考虑附加参数,例如网络延迟和用户偏好以进行调度决策。 因此,本技术可以通过实施与该混合OCR方法相关联的一种或多种技术在减少的延迟和增加的准确性方面提供显着的增益。
-
公开(公告)号:US09536161B1
公开(公告)日:2017-01-03
申请号:US14307090
申请日:2014-06-17
Applicant: Amazon Technologies, Inc.
Inventor: Christopher John Lish , Oleg Rybakov , Sonjeev Jahagirdar , Junxiong Jia , Neil David Cooper , Avnish Sikka
CPC classification number: H04N5/23245 , G01S3/00 , G06K9/00664 , G06K2009/3291 , H04N5/232 , H04N5/23219 , H04N5/247
Abstract: Various embodiments describe systems and methods for utilizing a reduced amount of processing capacity for incoming data over time, and, in response to detecting a scene-change-event, notify one or more data processors that a scene-change-event has occurred, and cause incoming data to be processed as new data. In some embodiments, an incoming frame can be compared with a reference frame to determine a difference between the reference frame and the incoming frame. The reference frame may be correlated to a latest scene-change-event. In response to a determination that the difference does not meet one or more difference criteria, a user interface or at least one processor of the computing device can be notified to reduce processing of incoming data over time. In response to a determination that the difference meets the one or more difference criteria, the user interface or the at least one processor can be notified that a scene-change-event has occurred. Incoming data to the computing device is then treated as new and processed as those under an active condition. The current incoming frame can be selected as a new reference frame for detecting next scene-change-event.
Abstract translation: 各种实施例描述了随着时间的推移对于输入数据利用减少量的处理能力的系统和方法,并且响应于检测到场景改变事件,通知一个或多个数据处理器已经发生场景变化事件,以及 将传入的数据作为新数据进行处理。 在一些实施例中,输入帧可以与参考帧进行比较,以确定参考帧和输入帧之间的差异。 参考帧可以与最新的场景变化事件相关联。 响应于差异不符合一个或多个差异标准的确定,可以通知用户界面或计算设备的至少一个处理器以减少输入数据随时间的处理。 响应于差异满足一个或多个差异标准的确定,可以向用户界面或至少一个处理器通知场景变化事件已经发生。 然后将接收到计算设备的数据视为新的,并处理为处于活动状态的数据。 可以将当前输入帧选择为用于检测下一个场景改变事件的新参考帧。
-
-
-
-