-
公开(公告)号:US09536135B2
公开(公告)日:2017-01-03
申请号:US13526501
申请日:2012-06-18
IPC分类号: G06K9/00
CPC分类号: G06F3/017 , G06K9/00355 , G06K9/6277 , G06K9/6297
摘要: The subject disclosure is directed towards a technology by which dynamic hand gestures are recognized by processing depth data, including in real-time. In an offline stage, a classifier is trained from feature values extracted from frames of depth data that are associated with intended hand gestures. In an online stage, a feature extractor extracts feature values from sensed depth data that corresponds to an unknown hand gesture. These feature values are input to the classifier as a feature vector to receive a recognition result of the unknown hand gesture. The technology may be used in real time, and may be robust to variations in lighting, hand orientation, and the user's gesturing speed and style.
摘要翻译: 主题公开涉及一种通过处理深度数据(包括实时)来识别动态手势的技术。 在离线阶段,从与预期的手势相关联的深度数据的帧中提取的特征值训练分类器。 在在线阶段,特征提取器从对应于未知手势的感测深度数据中提取特征值。 将这些特征值作为特征向量输入到分类器,以接收未知手势的识别结果。 该技术可以实时使用,并且对于照明,手取向和用户的手势速度和风格的变化可能是鲁棒的。
-
公开(公告)号:US09055607B2
公开(公告)日:2015-06-09
申请号:US12323570
申请日:2008-11-26
申请人: Michael J. Sinclair , Yuan Kong , Zhengyou Zhang , Behrooz Chitsaz , David W. Williams , Silviu-Petru Cucerzan , Zicheng Liu
发明人: Michael J. Sinclair , Yuan Kong , Zhengyou Zhang , Behrooz Chitsaz , David W. Williams , Silviu-Petru Cucerzan , Zicheng Liu
CPC分类号: H04W88/06 , H04M1/72572 , H04M2250/12 , H04M2250/58 , H04W8/245 , H04W92/02 , H04W92/10
摘要: Multi-modal, multi-lingual devices can be employed to consolidate numerous items including, but not limited to, keys, remote controls, image capture devices, audio recorders, cellular telephone functionalities, location/direction detectors, health monitors, calendars, gaming devices, smart home inputs, pens, optical pointing devices or the like. For example, a corner of a cellular telephone can be used as an electronic pen. Moreover, the device can be used to snap multiple pictures stitching them together to create a panoramic image. A device can automate ignition of an automobile, initiate appliances, etc. based upon relative distance. The device can provide for near to eye capabilities for enhanced image viewing. Multiple cameras/sensors can be provided on a single device to provide for stereoscopic capabilities. The device can also provide assistance to blind, privacy, etc. by consolidating services.
摘要翻译: 可以使用多模式,多语言设备来整合许多项目,包括但不限于键,遥控器,图像捕获设备,音频记录器,蜂窝电话功能,位置/方向检测器,健康监视器,日历,游戏设备 智能家庭输入,笔,光学指向装置等。 例如,蜂窝电话的角落可以用作电子笔。 此外,该设备可以用于将多个图片拼接在一起以创建全景图像。 设备可以基于相对距离自动点火汽车,起动电器等。 该设备可以提供近眼睛的功能,以增强图像观看效果。 可以在单个设备上提供多个摄像机/传感器以提供立体能力。 该设备还可以通过整合服务来提供盲人,隐私等方面的帮助。
-
公开(公告)号:US08941710B2
公开(公告)日:2015-01-27
申请号:US13584633
申请日:2012-08-13
申请人: Christian Huitema , William A. S. Buxton , Jonathan E. Paff , Zicheng Liu , Rajesh Kutpadi Hegde , Zhengyou Zhang , Kori Marie Quinn , Jin Li , Michel Pahud
发明人: Christian Huitema , William A. S. Buxton , Jonathan E. Paff , Zicheng Liu , Rajesh Kutpadi Hegde , Zhengyou Zhang , Kori Marie Quinn , Jin Li , Michel Pahud
IPC分类号: H04N7/15 , H04N7/14 , H04N21/422 , H04N21/4223 , H04N21/442 , H04N21/4788 , H04L12/18
CPC分类号: H04N7/147 , H04L12/1827 , H04N7/142 , H04N7/15 , H04N21/42203 , H04N21/4223 , H04N21/44213 , H04N21/4788 , H04N2007/145
摘要: A system facilitates managing one or more devices utilized for communicating data within a telepresence session. A telepresence session can be initiated within a communication framework that includes a first user and one or more second users. In response to determining a temporary absence of the first user from the telepresence session, a recordation of the telepresence session is initialized to enable a playback of a portion or a summary of the telepresence session that the first user has missed.
摘要翻译: 系统便于管理用于在远程呈现会话内传送数据的一个或多个设备。 可以在包括第一用户和一个或多个第二用户的通信框架内启动远程呈现会话。 响应于从远程呈现会话确定暂时不存在第一用户,初始化远程呈现会话的记录,以便能够播放第一用户已经错过的远程呈现会话的部分或摘要。
-
公开(公告)号:US08797386B2
公开(公告)日:2014-08-05
申请号:US13092276
申请日:2011-04-22
申请人: Philip A. Chou , Zhengyou Zhang , Dinei Florencio
发明人: Philip A. Chou , Zhengyou Zhang , Dinei Florencio
IPC分类号: H04N13/02
CPC分类号: H04N13/271 , A61H3/061 , A61H2201/0157 , A61H2201/165 , A61H2201/501 , A61H2201/5048 , A61H2201/5058 , A61H2201/5092 , G01S15/025 , G01S15/89 , G01S15/93 , H04N13/239 , H04R5/033 , H04R2420/07
摘要: A person is provided with the ability to auditorily determine the spatial geometry of his current physical environment. A spatial map of the current physical environment of the person is generated. The spatial map is then used to generate a spatialized audio representation of the environment. The spatialized audio representation is then output to a stereo listening device which is being worn by the person.
摘要翻译: 一个人被赋予了能够自觉地确定他当前的物理环境的空间几何的能力。 生成人的当前物理环境的空间映射。 然后使用空间映射来生成环境的空间化音频表示。 然后将空间化音频表示输出到由人佩戴的立体声聆听装置。
-
公开(公告)号:US08737648B2
公开(公告)日:2014-05-27
申请号:US12472080
申请日:2009-05-26
申请人: Wei-ge Chen , Zhengyou Zhang
发明人: Wei-ge Chen , Zhengyou Zhang
IPC分类号: H04R5/02
CPC分类号: H04R27/00
摘要: A spatial element is added to communications, including over telephone conference calls heard through headphones or a stereo speaker setup. Functions are created to modify signals from different callers to create the illusion that the callers are speaking from different parts of the room.
摘要翻译: 一个空间元素添加到通信中,包括通过耳机听到的电话会议通话或立体声扬声器设置。 创建功能来修改来自不同呼叫者的信号,以创建呼叫者从房间的不同部分讲话的错觉。
-
公开(公告)号:US08675926B2
公开(公告)日:2014-03-18
申请号:US12796470
申请日:2010-06-08
申请人: Zhengyou Zhang , Qin Cai , Pieter R. Kasselman , Arthur H. Baker
发明人: Zhengyou Zhang , Qin Cai , Pieter R. Kasselman , Arthur H. Baker
IPC分类号: G06K9/00
CPC分类号: G06K9/00228 , G06K9/00906
摘要: Multiple images including a face presented by a user are accessed. One or more determinations are made based on the multiple images, such as a determination of whether the face included in the multiple images is a 3-dimensional structure or a flat surface and/or a determination of whether motion is present in one or more face components (e.g., eyes or mouth). If it is determined that the face included in the multiple images is a 3-dimensional structure or that that motion is present in the one or more face components, then an indication is provided that the user can be authenticated. However, if it is determined that the face included in the multiple images is a flat surface or that motion is not present in the one or more face components, then an indication is provided that the user cannot be authenticated.
摘要翻译: 访问包括用户呈现的脸部的多个图像。 基于多个图像进行一个或多个确定,例如确定包括在多个图像中的面是三维结构还是平面,和/或确定运动是否存在于一个或多个面中 组分(如眼睛或嘴巴)。 如果确定包括在多个图像中的面是三维结构或者该一个或多个面部组件中存在该运动,则提供用户可被认证的指示。 然而,如果确定包括在多个图像中的面是平面或者一个或多个面部组件中不存在运动,则提供用户不能被认证的指示。
-
公开(公告)号:US08670018B2
公开(公告)日:2014-03-11
申请号:US12789055
申请日:2010-05-27
申请人: Sharon K. Cunnington , Rajesh K. Hegde , Kori Quinn , Jin Li , Philip A. Chou , Zhengyou Zhang , Desney S. Tan
发明人: Sharon K. Cunnington , Rajesh K. Hegde , Kori Quinn , Jin Li , Philip A. Chou , Zhengyou Zhang , Desney S. Tan
IPC分类号: H04N7/14
摘要: Reaction information of participants to an interaction may be sensed and analyzed to determine one or more reactions or dispositions of the participants. Feedback may be provided based on the determined reactions. The participants may be given an opportunity to opt in to having their reaction information collected, and may be provided complete control over how their reaction information is shared or used.
摘要翻译: 可以感测和分析参与者对于相互作用的反应信息以确定参与者的一个或多个反应或处置。 可以基于确定的反应来提供反馈。 参与者可能有机会选择收集他们的反应信息,并且可以完全控制他们的反应信息如何共享或使用。
-
公开(公告)号:US08620009B2
公开(公告)日:2013-12-31
申请号:US12140283
申请日:2008-06-17
申请人: Zhengyou Zhang , James D. Johnston
发明人: Zhengyou Zhang , James D. Johnston
CPC分类号: H04S7/302 , H04S2400/11
摘要: Systems and methods for determining a virtual sound source position by determining an output for loudspeakers by the position of the loudspeakers in relation to a listener. The output of respective loudspeakers is generated using aural cues to give the listener knowledge of the virtual position of the virtual sound source. Both a gain in intensity and a delay are simulated.
摘要翻译: 用于通过扬声器相对于收听者的位置确定扬声器的输出来确定虚拟声源位置的系统和方法。 使用听觉提示产生各个扬声器的输出,以使聆听者了解虚拟声源的虚拟位置。 模拟强度和延迟的增益。
-
9.
公开(公告)号:US20130294710A1
公开(公告)日:2013-11-07
申请号:US13463934
申请日:2012-05-04
申请人: Philip Andrew Chou , Cha Zhang , Zhengyou Zhang , Shujie Liu
发明人: Philip Andrew Chou , Cha Zhang , Zhengyou Zhang , Shujie Liu
IPC分类号: G06K9/32
摘要: A temporal information integration dis-occlusion system and method for using historical data to reconstruct a virtual view containing an occluded area. Embodiments of the system and method use temporal information of the scene captured previously to obtain a total history. This total history is warped onto information captured by a camera at a current time in order to help reconstruct the dis-occluded areas. The historical data (or frames) from the total history match only a portion of the frames contained in the captured information. This warping yields warped history information. Warping is performed by using one of two embodiments to match points in an estimation of the current information to points in the captured information. Next, regions of current information are split using a classifier. The warped history information and the captured information then are merged to obtain an estimate for the current information and the reconstructed virtual view.
摘要翻译: 一种用于使用历史数据重建包含遮挡区域的虚拟视图的时间信息整合遮挡系统和方法。 系统和方法的实施例使用先前捕获的场景的时间信息来获得总历史。 这个总历史在当前时间由相机拍摄的信息扭曲,以帮助重建被遮挡的区域。 来自总历史记录的历史数据(或帧)仅匹配捕获信息中包含的帧的一部分。 这种扭曲产生扭曲的历史信息。 通过使用两个实施例中的一个实现扭曲,以将当前信息的估计中的点与捕获的信息中的点进行匹配。 接下来,使用分类器分割当前信息的区域。 然后将翘曲的历史信息和捕获的信息合并,以获得当前信息和重建的虚拟视图的估计。
-
公开(公告)号:US08401979B2
公开(公告)日:2013-03-19
申请号:US12618799
申请日:2009-11-16
申请人: Cha Zhang , Zhengyou Zhang
发明人: Cha Zhang , Zhengyou Zhang
IPC分类号: G06F15/18
CPC分类号: G06N99/005
摘要: Described is multiple category learning to jointly train a plurality of classifiers in an iterative manner. Each training iteration associates an adaptive label with each training example, in which during the iterations, the adaptive label of any example is able to be changed by the subsequent reclassification. In this manner, any mislabeled training example is corrected by the classifiers during training. The training may use a probabilistic multiple category boosting algorithm that maintains probability data provided by the classifiers, or a winner-take-all multiple category boosting algorithm selects the adaptive label based upon the highest probability classification. The multiple category boosting training system may be coupled to a multiple instance learning mechanism to obtain the training examples. The trained classifiers may be used as weak classifiers that provide a label used to select a deep classifier for further classification, e.g., to provide a multi-view object detector.
摘要翻译: 描述了多类学习,以迭代的方式联合训练多个分类器。 每个训练迭代将自适应标签与每个训练示例相关联,其中在迭代期间,任何示例的自适应标签能够由随后的重新分类改变。 以这种方式,任何错误标记的训练示例在训练期间由分类器校正。 训练可以使用维护由分类器提供的概率数据的概率多类别提升算法,或者获胜者全部多类别增强算法基于最高概率分类来选择自适应标签。 多类别增强训练系统可以耦合到多实例学习机制以获得训练示例。 经训练的分类器可以用作弱分类器,其提供用于选择用于进一步分类的深分类器的标签,例如提供多视图对象检测器。
-
-
-
-
-
-
-
-
-