Data buddy
    1.
    发明授权
    Data buddy 有权
    资料好友

    公开(公告)号:US09055607B2

    公开(公告)日:2015-06-09

    申请号:US12323570

    申请日:2008-11-26

    摘要: Multi-modal, multi-lingual devices can be employed to consolidate numerous items including, but not limited to, keys, remote controls, image capture devices, audio recorders, cellular telephone functionalities, location/direction detectors, health monitors, calendars, gaming devices, smart home inputs, pens, optical pointing devices or the like. For example, a corner of a cellular telephone can be used as an electronic pen. Moreover, the device can be used to snap multiple pictures stitching them together to create a panoramic image. A device can automate ignition of an automobile, initiate appliances, etc. based upon relative distance. The device can provide for near to eye capabilities for enhanced image viewing. Multiple cameras/sensors can be provided on a single device to provide for stereoscopic capabilities. The device can also provide assistance to blind, privacy, etc. by consolidating services.

    摘要翻译: 可以使用多模式,多语言设备来整合许多项目,包括但不限于键,遥控器,图像捕获设备,音频记录器,蜂窝电话功能,位置/方向检测器,健康监视器,日历,游戏设备 智能家庭输入,笔,光学指向装置等。 例如,蜂窝电话的角落可以用作电子笔。 此外,该设备可以用于将多个图片拼接在一起以创建全景图像。 设备可以基于相对距离自动点火汽车,起动电器等。 该设备可以提供近眼睛的功能,以增强图像观看效果。 可以在单个设备上提供多个摄像机/传感器以提供立体能力。 该设备还可以通过整合服务来提供盲人,隐私等方面的帮助。

    Hierarchical video sub-volume search
    3.
    发明授权
    Hierarchical video sub-volume search 有权
    分层视频子卷搜索

    公开(公告)号:US08416990B2

    公开(公告)日:2013-04-09

    申请号:US12858301

    申请日:2010-08-17

    IPC分类号: G06K9/00

    摘要: Described is a technology by which video, which may be relatively high-resolution video, is efficiently processed to determine whether the video contains a specified action. The video corresponds to a spatial-temporal volume. The volume is searched with a top-k search that finds a plurality of the most likely sub-volumes simultaneously in a single search round. The score volumes of larger spatial resolution videos may be down-sampled into lower-resolution score volumes prior to searching.

    摘要翻译: 描述了一种技术,通过该技术可以高效地处理可能是相对高分辨率视频的视频,以确定视频是否包含指定的动作。 视频对应于空间 - 时间体积。 使用在单个搜索循环中同时找到多个最可能的子卷的top-k搜索来搜索该卷。 在搜索之前,较大的空间分辨率视频的得分体积可能被下采样为较低分辨率的分数体积。

    Multi-camera head pose tracking
    5.
    发明授权
    Multi-camera head pose tracking 有权
    多摄像机头姿态跟踪

    公开(公告)号:US08339459B2

    公开(公告)日:2012-12-25

    申请号:US12561154

    申请日:2009-09-16

    IPC分类号: H04N5/225

    摘要: Techniques and technologies for tracking a face with a plurality of cameras wherein a geometry between the cameras is initially unknown. One disclosed method includes detecting a head with two of the cameras and registering a head model with the image of the head (as detected by one of the cameras). The method also includes back projecting the other detected face image to the head model and determining a head pose from the back-projected head image. Furthermore, the determined geometry is used to track the face with at least one of the cameras.

    摘要翻译: 用于跟踪具有多个相机的面部的技术和技术,其中相机之间的几何形状最初是未知的。 一种公开的方法包括使用两个摄像机检测头部,并用头部的图像(由相机之一检测到)登记头部模型。 该方法还包括将另一个检测到的脸部图像反投影到头部模型并且从后投影的头部图像确定头部姿势。 此外,所确定的几何形状用于利用至少一个相机跟踪面部。

    AUTOMATIC LABELING OF A VIDEO SESSION
    6.
    发明申请
    AUTOMATIC LABELING OF A VIDEO SESSION 审中-公开
    视频会议的自动标签

    公开(公告)号:US20110096135A1

    公开(公告)日:2011-04-28

    申请号:US12604415

    申请日:2009-10-23

    CPC分类号: H04N7/14 H04N5/23219

    摘要: Described is labeling a video session with metadata representing a recognized person or object, such as to identify a person corresponding to a recognized face when that face is being shown during the video session. The identification may be made by overlaying text on the video session, e.g., the person's name and/or other related information. Facial recognition and/or other (e.g., voice) recognition may be used to identify a person. The facial recognition process may be made more efficient by using known narrowing information, such as calendar information that indicates who the invitees are to a meeting that is being shown in the video session.

    摘要翻译: 描述了用表示识别的人或物体的元数据来标记视频会话,例如当在视频会话期间正在显示该脸部时识别对应于识别的脸部的人物。 可以通过在视频会话上重叠文本,例如该人的姓名和/或其他相关信息来进行识别。 可以使用面部识别和/或其他(例如,语音)识别来识别人。 可以通过使用已知的缩小信息(例如指示被邀请者是谁正在视频会话中显示的会议的日历信息)来使面部识别过程更有效。

    System and Method Providing Improved Head Motion Estimations for Animation
    8.
    发明申请
    System and Method Providing Improved Head Motion Estimations for Animation 有权
    系统和方法为动画提供改进的头部运动估计

    公开(公告)号:US20100189310A1

    公开(公告)日:2010-07-29

    申请号:US12751705

    申请日:2010-03-31

    IPC分类号: G06K9/00

    摘要: The computer-readable media provides improved procedures to estimate head motion between two images of a face. Locations of a number of distinct facial features are determined in two images. The locations are converted into as a set of physical face parameters based on the symmetry of the identified distinct facial features. An estimation objective function is determined by: (a) estimating each of the set of physical parameters, (b) estimating a first head pose transform corresponding to the first image, and (c) estimating a second head pose transform corresponding to the second image. The motion is estimated between the two images based on the set of physical face parameters by multiplying each term of the estimation objective function by a weighted contribution factor based on the confidence of data corresponding to the estimation objective function.

    摘要翻译: 计算机可读介质提供改进的程序来估计脸部的两个图像之间的头部运动。 许多不同的脸部特征的位置在两个图像中确定。 基于识别出的不同面部特征的对称性,这些位置被转换为一组物理面部参数。 通过以下方式确定估计目标函数:(a)估计所述一组物理参数中的每一个,(b)估计与所述第一图像相对应的第一头部姿态变换,以及(c)估计与所述第二图像相对应的第二头部姿态变换 。 通过基于与估计目标函数对应的数据的置信度,将估计目标函数的每个项乘以加权的贡献因子,基于物理面参数的集合来估计两个图像之间的运动。

    System and method for devising a human interactive proof that determines whether a remote client is a human or a computer program
    9.
    发明授权
    System and method for devising a human interactive proof that determines whether a remote client is a human or a computer program 有权
    用于设计确定远程客户端是人类还是计算机程序的人类交互式证明的系统和方法

    公开(公告)号:US07725395B2

    公开(公告)日:2010-05-25

    申请号:US10664657

    申请日:2003-09-19

    申请人: Yong Rui Zicheng Liu

    发明人: Yong Rui Zicheng Liu

    IPC分类号: G06Q99/00

    CPC分类号: G06Q30/02

    摘要: A system and method for automatically determining if a remote client is a human or a computer. A set of HIP design guidelines which are important to ensure the security and usability of a HIP system are described. Furthermore, one embodiment of this new HIP system and method is based on human face and facial feature detection. Because human face is the most familiar object to all human users the embodiment of the invention employing a face is possibly the most universal HIP system so far.

    摘要翻译: 用于自动确定远程客户端是人机还是计算机的系统和方法。 描述了一套重要的HIP设计指南,以确保HIP系统的安全性和可用性。 此外,这种新的HIP系统和方法的一个实施例是基于人脸和面部特征检测。 因为人脸是所有人类用户最熟悉的对象,所以使用脸部的发明的实施方式可能是迄今为止最普遍的HIP系统。

    SPEECH SEPARATION WITH MICROPHONE ARRAYS
    10.
    发明申请
    SPEECH SEPARATION WITH MICROPHONE ARRAYS 有权
    与麦克风阵列的语音分离

    公开(公告)号:US20090214052A1

    公开(公告)日:2009-08-27

    申请号:US12035439

    申请日:2008-02-22

    IPC分类号: H04R3/00

    CPC分类号: H04R27/00 G10L21/0272

    摘要: A system that facilitates blind source separation in a distributed microphone meeting environment for improved teleconferencing. Input sensor (e.g., microphone) signals are transformed to the frequency-domain and independent component analysis is applied to compute estimates of frequency-domain processing matrices. Modified permutations of the processing matrices are obtained based upon a maximum magnitude based de-permutation scheme. Estimates of the plurality of source signals are provided based upon the modified frequency-domain processing matrices and input sensor signals.Optionally, segments during which the set of active sources is a subset of the set of all sources can be exploited to compute more accurate estimates of frequency-domain mixing matrices. Source activity detection can be applied to determine which speaker(s), if any, are active. Thereafter, a least squares post-processing of the frequency-domain independent components analysis outputs can be employed to adjust the estimates of the source signals based on source inactivity.

    摘要翻译: 一种促进分布式麦克风会议环境中盲源分离的系统,用于改进电话会议。 输入传感器(例如麦克风)信号被变换到频域,并且应用独立分量分析来计算频域处理矩阵的估计。 基于最大幅度的去排列方案获得处理矩阵的修改排列。 基于改进的频域处理矩阵和输入传感器信号来提供多个源信号的估计。 可选地,可以利用其中该组活动源是所有源的集合的子集的段来计算频域混合矩阵的更准确的估计。 源活动检测可以应用于确定哪些扬声器(如果有)是活动的。 此后,可以采用频域独立分量分析输出的最小二乘后处理,以基于源不活动来调整源信号的估计。