-
公开(公告)号:US09465444B1
公开(公告)日:2016-10-11
申请号:US14319876
申请日:2014-06-30
Applicant: Amazon Technologies, Inc.
Inventor: Sharadh Ramaswamy
CPC classification number: G06F3/017 , G06K9/00389 , G06K9/2018 , G06K9/209 , G06K9/3241 , H04N5/2352
Abstract: The accuracy of object tracking using relatively low power gesture cameras can be improved by adjusting camera settings to ensure a sufficient level of contrast or texture to enable stereo disparity calculations for the representations of the object in the captured images. For general gesture input, for example, a user might use a variety of objects in various orientations, such that conventional face or object recognition processes may not be sufficient. Further, such processes are very resource intensive. By adjusting the camera settings (e.g., exposure, gain, and/or aperture) to ensure an adequate level of contrast, objects of an appropriate size and location can be tracked for purposes such as gesture input. Once such an object is identified, coordinates for a bounding box or other indicator can be transferred to any camera or sensor to be used for the tracking.
Abstract translation: 通过调整相机设置以确保足够的对比度或纹理水平,以便对拍摄图像中的对象的表示进行立体视差计算,可以提高使用相对低功率手势相机的对象跟踪的精度。 对于一般手势输入,例如,用户可以使用各种方向的各种对象,使得传统的面部或对象识别过程可能不足够。 此外,这样的过程是非常耗资源的。 通过调整相机设置(例如,曝光,增益和/或光圈)以确保足够的对比度,可以跟踪诸如手势输入之类的适当尺寸和位置的对象。 一旦识别出这样的物体,边界框或其他指示器的坐标可以传送到任何用于跟踪的相机或传感器。
-
公开(公告)号:US09424461B1
公开(公告)日:2016-08-23
申请号:US13929672
申请日:2013-06-27
Applicant: Amazon Technologies, Inc.
Inventor: Chang Yuan , Geoffrey Scott Heller , Oleg Rybakov , Sharadh Ramaswamy , Jim Oommen Thomas
IPC: G06K9/00
CPC classification number: G06K9/00201 , G06K9/00208 , G06K9/00214
Abstract: Various embodiments utilize two-dimensional (“2D”) and three-dimensional (“3D”) object features for purposes such as object recognition and/or image matching. For example, a user can capture an image (e.g., still images or video) of an object and can receive information about items that are determined to match the object. For example, the image can be analyzed to detect visual features (e.g., corners, edges, etc.) of the object and the detected visual features can be combined to generate a combined visual feature vector which can be used for object recognition, image matching, or other such purposes. Other approaches utilize the image to generate a 3D model of the object represented in the image, which can be used to determine at least one object or types of objects that match the object represented in the image.
Abstract translation: 各种实施例利用二维(“2D”)和三维(“3D”)对象特征用于诸如对象识别和/或图像匹配的目的。 例如,用户可以捕获对象的图像(例如,静止图像或视频),并且可以接收关于被确定为匹配对象的项目的信息。 例如,可以分析图像以检测对象的视觉特征(例如,角,边等),并且可以组合检测到的视觉特征以生成可用于对象识别,图像匹配的组合视觉特征向量 ,或其他此类用途。 其他方法利用图像来生成在图像中表示的对象的3D模型,其可以用于确定与图像中表示的对象匹配的对象的至少一个对象或类型。
-
公开(公告)号:US09417689B1
公开(公告)日:2016-08-16
申请号:US13896998
申请日:2013-05-17
Applicant: Amazon Technologies, Inc.
Inventor: Sharadh Ramaswamy
IPC: G06F3/01
CPC classification number: G06F3/01 , G06F3/017 , G06F3/0386
Abstract: Systems and approaches are provided for robustly determining the motion of a computing device. Multiple cameras on the device can each capture a sequence of images, and the images can be analyzed to determine motion of the device with respect to a user, an object, or scenery captured in the images. The estimated motion may be complemented with measurements from an inertial sensor such as a gyroscope or an accelerometer to provide more accurate estimations of device motion than can be provided by image data or inertial sensor data alone. A computing device can then be configured to detect device motion as user input such as to navigate a user interface or to remotely control movement of another electronic device.
Abstract translation: 提供了用于鲁棒地确定计算设备的运动的系统和方法。 设备上的多个摄像机可以各自捕获一系列图像,并且可以分析图像以确定设备相对于图像中捕获的用户,对象或风景的运动。 估计的运动可以用诸如陀螺仪或加速度计的惯性传感器的测量来补充,以提供比由图像数据或惯性传感器数据单独提供的装置运动更准确的估计。 然后可以将计算设备配置为检测设备运动作为用户输入,例如导航用户界面或远程控制另一电子设备的移动。
-
公开(公告)号:US09384551B2
公开(公告)日:2016-07-05
申请号:US13858813
申请日:2013-04-08
Applicant: Amazon Technologies, Inc.
Inventor: Sharadh Ramaswamy , Matthew Paul Bell , Dmitri Khoklov , David W. Stafford , Isaac Scott Noble
CPC classification number: G06T7/002 , G06T7/85 , H04N13/239 , H04N13/243 , H04N2213/001
Abstract: An electronic device can have two or more pairs of cameras capable of performing three-dimensional imaging. In order to provide accurate disparity information, these cameras should be sufficiently rectified. Automatic rectification can be performed by periodically capturing images with the cameras of interest, and locating matching feature points in corresponding images captured by those cameras. Small misalignment errors can be treated as linear translations, such that a set of linear equations can be used to solve for the misalignments. Another process can process a set of homographies for the cameras until a cost function converges. Various other approaches can be used as well, such as to directly solve for yaw, pitch, and roll errors. Once this information is obtained, the misalignment values (or related values) can be stored for use in correcting images subsequently captured by those cameras.
Abstract translation: 电子设备可以具有能够执行三维成像的两对或更多对相机。 为了提供准确的视差信息,这些摄像机应该被充分纠正。 可以通过周围拍摄感兴趣的摄像机的图像,并将匹配的特征点定位在由这些摄像机拍摄的相应图像中,从而进行自动校正。 小的对准误差可以被视为线性平移,使得可以使用一组线性方程来解决不对准。 另一个过程可以处理一组相机的同画,直到成本函数收敛。 也可以使用各种其他方法,例如直接解决偏航,俯仰和滚动误差。 一旦得到该信息,就可以存储未对准值(或相关值),以用于校正随后由这些相机拍摄的图像。
-
公开(公告)号:US09224060B1
公开(公告)日:2015-12-29
申请号:US14029528
申请日:2013-09-17
Applicant: Amazon Technologies, Inc.
Inventor: Sharadh Ramaswamy
CPC classification number: G06K9/00912 , G06K9/00228 , G06K9/00389 , G06K9/3241 , G06K2009/3291
Abstract: Systems and approaches are provided for tracking an object of interest using depth or disparity information, such as obtained by calculating stereo disparity between a pair of images. The depth or disparity information can be used as an additional signature for a template of the object of interest for tracking the object. A template that includes depth, distance, or disparity information for an object of interest may be invariant to the effects of lighting, such as shadows and changes in illumination conditions. Depth, distance, or disparity information can also provide information regarding shape and size that can be used to differentiate foreground objects. Depth, distance, or disparity information can also better handle occlusion. Depth, distance, or disparity information can also provide an additional disambiguating dimension for tracking an object.
Abstract translation: 提供了系统和方法,用于使用诸如通过计算一对图像之间的立体视差来获得的深度或视差信息来跟踪感兴趣的对象。 深度或视差信息可以用作跟踪对象的感兴趣对象的模板的附加签名。 包括感兴趣对象的深度,距离或视差信息的模板可能对照明的影响是不变的,例如阴影和照明条件的变化。 深度,距离或视差信息还可以提供关于可用于区分前景对象的形状和尺寸的信息。 深度,距离或视差信息也可以更好地处理闭塞。 深度,距离或视差信息还可以为跟踪对象提供额外的消歧尺寸。
-
公开(公告)号:US09111111B1
公开(公告)日:2015-08-18
申请号:US14034067
申请日:2013-09-23
Applicant: Amazon Technologies, Inc.
Inventor: Sharadh Ramaswamy
CPC classification number: G06F21/6218 , G06F2221/2111 , G06F2221/2139
Abstract: Systems and methods for implementing location-based security protocols for information and/or data files are disclosed. The location-based security protocols may be enforced by an organization to protect and/or provide additional levels of security for information and/or data files associated with the organization and stored and/or used by agents of the organization on their user devices, such as personal user devices. Location-based security protocols may be applied to data file functions that include generating, receiving, transmitting, sharing, backing-up, or rendering a data file. Data files that are to adhere to one or more location-based security protocols may be tagged, such as in metadata associated with the data file to indicate that it is subject to adherence to the one or more location-based security protocols.
Abstract translation: 公开了用于实现用于信息和/或数据文件的基于位置的安全协议的系统和方法。 基于位置的安全协议可以由组织来实施,以保护和/或为与组织相关联的信息和/或数据文件提供额外的安全级别,并且在组织的用户设备上存储和/或使用其,例如 作为个人用户设备。 基于位置的安全协议可以应用于包括生成,接收,发送,共享,备份或呈现数据文件的数据文件功能。 可以标记要遵守一个或多个基于位置的安全协议的数据文件,例如在与数据文件相关联的元数据中,以指示其遵守一个或多个基于位置的安全协议。
-
公开(公告)号:US09021020B1
公开(公告)日:2015-04-28
申请号:US13707329
申请日:2012-12-06
Applicant: Amazon Technologies, Inc.
Inventor: Sharadh Ramaswamy , Kenneth Mark Karakotsios
CPC classification number: G06Q30/06
Abstract: A processing device receives, from a user device, image information associated with an image, the image information providing an indication of an application installed on the user device or a second electronic device. The processing device determines a descriptor associated with the application based on analyzing the image information. The processing device compares the descriptor to one or more stored image descriptors associated with each of a plurality of known applications. Based at least in part on the comparing, the processing device determines identifying information associated with the application. The processing device sends the identifying information to the user device.
Abstract translation: 处理设备从用户设备接收与图像相关联的图像信息,所述图像信息提供安装在用户设备上的应用的指示或第二电子设备。 处理装置基于分析图像信息来确定与应用相关联的描述符。 处理装置将描述符与与多个已知应用中的每一个相关联的一个或多个存储的图像描述符进行比较。 至少部分地基于比较,处理设备确定与应用相关联的识别信息。 处理装置将识别信息发送给用户装置。
-
公开(公告)号:US20150077323A1
公开(公告)日:2015-03-19
申请号:US14029612
申请日:2013-09-17
Applicant: Amazon Technologies, Inc.
Inventor: Sharadh Ramaswamy , Jason Robert Weber
CPC classification number: G06F3/012 , G06F3/0304 , G06F3/04815 , G06F2200/1637
Abstract: Systems and approaches provide for user interfaces (UIs) that are based on object tracking. For example, the object may be a user's head or face. As the user moves his head or face and/or tilts a computing device, the content displayed on the computing device will adapt to the user's perspective. The content may include three-dimensional (3D) graphical elements projected onto a two-dimensional (2D) plane and/or the graphical elements can be associated with textural shading, shadowing, or reflections that change according to user or device motion to give the user the impression that the user is interacting with the graphical elements in 3D environment. To enhance the user experience, a state of motion of the device can be determined and jitter and/or latency corresponding to the rendering of content can be altered so as to minimize or decrease jitter when the device is stationary and/or to decrease or minimize latency when the device is in motion.
Abstract translation: 系统和方法提供基于对象跟踪的用户界面(UI)。 例如,对象可以是用户的头部或脸部。 当用户移动其头部或面部和/或倾斜计算设备时,计算设备上显示的内容将适应用户的观点。 内容可以包括投影到二维(2D)平面上的三维(3D)图形元素和/或图形元素可以与根据用户或设备运动而改变的纹理着色,阴影或反射相关联,以给出 使用者在3D环境中与图形元素交互的印象。 为了增强用户体验,可以确定设备的运动状态,并且可以改变与内容的呈现相对应的抖动和/或延迟,以便当设备静止时和/或降低或最小化时最小化或减少抖动 设备运行时的延迟。
-
公开(公告)号:US20140300704A1
公开(公告)日:2014-10-09
申请号:US13858813
申请日:2013-04-08
Applicant: Amazon Technologies, Inc.
Inventor: Sharadh Ramaswamy , Matthew Paul Bell , Dmitri Khoklov , David W. Stafford , Isaac Scott Noble
IPC: H04N13/02
CPC classification number: G06T7/002 , G06T7/85 , H04N13/239 , H04N13/243 , H04N2213/001
Abstract: An electronic device can have two or more pairs of cameras capable of performing three-dimensional imaging. In order to provide accurate disparity information, these cameras should be sufficiently rectified. Automatic rectification can be performed by periodically capturing images with the cameras of interest, and locating matching feature points in corresponding images captured by those cameras. Small misalignment errors can be treated as linear translations, such that a set of linear equations can be used to solve for the misalignments. Another process can process a set of homographies for the cameras until a cost function converges. Various other approaches can be used as well, such as to directly solve for yaw, pitch, and roll errors. Once this information is obtained, the misalignment values (or related values) can be stored for use in correcting images subsequently captured by those cameras.
Abstract translation: 电子设备可以具有能够执行三维成像的两对或更多对相机。 为了提供准确的视差信息,这些摄像机应该被充分纠正。 可以通过周围拍摄感兴趣的摄像机的图像,并将匹配的特征点定位在由这些摄像机拍摄的相应图像中,从而进行自动校正。 小的对准误差可以被视为线性平移,使得可以使用一组线性方程来解决不对准。 另一个过程可以处理一组相机的同画,直到成本函数收敛。 也可以使用各种其他方法,例如直接解决偏航,俯仰和滚动误差。 一旦得到该信息,就可以存储未对准值(或相关值),以用于校正随后由这些相机拍摄的图像。
-
-
-
-
-
-
-
-