Content-adaptive systems, methods and apparatus for determining optical flow
    2.
    发明授权
    Content-adaptive systems, methods and apparatus for determining optical flow 有权
    用于确定光流的内容自适应系统,方法和装置

    公开(公告)号:US08553943B2

    公开(公告)日:2013-10-08

    申请号:US13160457

    申请日:2011-06-14

    IPC分类号: G06K9/00 H04N7/18

    摘要: Embodiments include methods and systems which determine pixel displacement between frames based on a respective weighting-value for each pixel or a group of pixels. The weighting-values provide an indication as to which pixels are more pertinent to optical flow computations. Computational resources and effort can be focused on pixels with higher weights, which are generally more pertinent to optical flow determinations.

    摘要翻译: 实施例包括基于每个像素或一组像素的相应加权值来确定帧之间的像素位移的方法和系统。 加权值提供关于哪些像素与光流计算更相关的指示。 计算资源和努力可以集中在具有较高权重的像素上,这通常与光流测定更相关。

    HYBRID REALITY FOR 3D HUMAN-MACHINE INTERFACE
    3.
    发明申请
    HYBRID REALITY FOR 3D HUMAN-MACHINE INTERFACE 审中-公开
    用于3D人机界面的混合现实

    公开(公告)号:US20120139906A1

    公开(公告)日:2012-06-07

    申请号:US13234028

    申请日:2011-09-15

    IPC分类号: G06T15/00

    CPC分类号: G06T19/006 H04N13/156

    摘要: A three dimensional (3D) mixed reality system combines a real 3D image or video, captured by a 3D camera for example, with a virtual 3D image rendered by a computer or other machine to render a 3D mixed-reality image or video. A 3D camera can acquire two separate images (a left and a right) of a common scene, and superimpose the two separate images to create a real image with a 3D depth effect. The 3D mixed-reality system can determine a distance to a zero disparity plane for the real 3D image, determine one or more parameters for a projection matrix based on the distance to the zero disparity plane, render a virtual 3D object based on the projection matrix, combine the real image and the virtual 3D object to generate a mixed-reality 3D image.

    摘要翻译: 三维(3D)混合现实系统将由3D摄像机捕获的真实3D图像或视频与由计算机或其他机器呈现的虚拟3D图像组合以渲染3D混合现实图像或视频。 3D摄像机可以获取公共场景的两个单独的图像(左和右),并且叠加两个分离的图像以创建具有3D深度效果的实际图像。 3D混合现实系统可以确定实际3D图像到零视差平面的距离,基于到零视差平面的距离确定用于投影矩阵的一个或多个参数,基于投影矩阵渲染虚拟3D对象 ,组合真实图像和虚拟3D对象以产生混合现实的3D图像。

    GESTURE-BASED USER INTERFACE
    4.
    发明申请
    GESTURE-BASED USER INTERFACE 审中-公开
    基于GESTURE的用户界面

    公开(公告)号:US20110107216A1

    公开(公告)日:2011-05-05

    申请号:US12785709

    申请日:2010-05-24

    申请人: NING BI

    发明人: NING BI

    IPC分类号: G06F3/033 G06F3/01

    摘要: A gesture-based user interface system that includes a media-capturing device, a processor, and a display device. The media-capturing device captures media associated with a user and his/her surrounding environment. Using the captured media, the processor recognizes gestures the user uses to interact with display virtual objects displayed on the display device, without the user touching the display. A mirror image of the user and the surrounding environment is displayed in 3D on the display device with the display virtual objects in a virtual environment. The interaction between the image of the user and the display virtual objects is also displayed, in addition to an indication of the interaction such as a visual and/or an audio feedback.

    摘要翻译: 一种基于手势的用户界面系统,其包括媒体捕获设备,处理器和显示设备。 媒体捕获设备捕获与用户及其周围环境相关联的媒体。 使用捕获的媒体,处理器识别用户用来与显示设备上显示的显示虚拟对象进行交互的手势,而无需用户触摸显示。 用户和周围环境的镜像在显示设备上以3D显示,虚拟环境中的显示虚拟对象。 除了诸如视觉和/或音频反馈之间的交互的指示之外,还显示用户的图像和显示虚拟对象之间的交互。

    MULTI-STAGE TESSELLATION FOR GRAPHICS RENDERING
    5.
    发明申请
    MULTI-STAGE TESSELLATION FOR GRAPHICS RENDERING 有权
    用于图形渲染的多阶段测量

    公开(公告)号:US20090237401A1

    公开(公告)日:2009-09-24

    申请号:US12052628

    申请日:2008-03-20

    IPC分类号: G06T17/00

    CPC分类号: G06T11/203

    摘要: This disclosure describes a multi-stage tessellation technique for tessellating a curve during graphics rendering. In particular, a first tessellation stage tessellates the curve into a first set of line segments that each represents a portion of the curve. A second tessellation stage further tessellates the portion of the curve represented by each of the line segments of the first set into additional line segments that more finely represent the shape of the curve. In this manner, each portion of the curve that was represented by only one line segment after the first tessellation stage is represented by more than one line segment after the second tessellation stage. In some instances, more than two tessellation stages may be performed to tessellate the curve.

    摘要翻译: 本公开描述了用于在图形渲染期间细分曲线的多阶段镶嵌技术。 特别地,第一细分阶段将曲线细分为第一组线段,每组线段表示曲线的一部分。 第二细分阶段进一步将由第一组的每个线段表示的曲线的部分细分为更精细地表示曲线形状的附加线段。 以这种方式,在第一细分阶段之后仅由一个线段表示的曲线的每个部分在第二细分阶段之后被多于一个线段表示。 在一些情况下,可以执行多于两个的细分阶段来细分曲线。

    Voice recognition system method and apparatus
    6.
    发明授权
    Voice recognition system method and apparatus 有权
    语音识别系统的方法和装置

    公开(公告)号:US06941265B2

    公开(公告)日:2005-09-06

    申请号:US10017270

    申请日:2001-12-14

    IPC分类号: G10L15/28 G10L15/00

    CPC分类号: G10L15/28

    摘要: Generally stated a method and an accompanying apparatus provides for a voice recognition system (300) with programmable front end processing unit (400). The front end processing unit (400) requests and receives different configuration files at different times for processing voice data in the voice recognition system (300). The configuration files are communicated to the front end unit via a communication link (310) for configuring the front end processing unit (400). A microprocessor may provide the front end configuration files on the communication link at different times.

    摘要翻译: 通常所述方法和伴随装置提供具有可编程前端处理单元(400)的语音识别系统(300)。 前端处理单元400在不同时间请求并接收不同的配置文件,以处理语音识别系统(300)中的语音数据。 配置文件经由用于配置前端处理单元(400)的通信链路(310)传送到前端单元。 微处理器可以在不同时间在通信链路上提供前端配置文件。

    Voice recognition user interface for telephone handsets
    7.
    发明授权
    Voice recognition user interface for telephone handsets 有权
    语音识别用户界面,用于电话手机

    公开(公告)号:US06449496B1

    公开(公告)日:2002-09-10

    申请号:US09246499

    申请日:1999-02-08

    IPC分类号: H04B138

    CPC分类号: H04M1/271

    摘要: A method and apparatus providing a user interface within a phone that responds to a limited vocabulary of user trained voice commands. The interface allows users to perform all phone handset dialing functions using voice commands. Additionally, users will be able to create and modify entries within a voice recognition phonebook, whereby a number within the voice recognition phonebook can be called by saying the name associated with the number. The user interface provides a combination of voice and LCD displayed user prompts and responses to voice input. The interface responds to user voice commands and performs the command functions based upon matches to previously user trained voice command vocabulary words stored in memory.

    摘要翻译: 一种在电话内提供用户界面的方法和装置,其响应于用户训练的语音命令的有限词汇。 该接口允许用户使用语音命令执行所有手机拨号功能。 此外,用户将能够创建和修改语音识别电话簿内的条目,由此可以通过说出与该号码相关联的名称来呼叫语音识别电话簿内的号码。 用户界面提供语音和LCD组合,显示用户提示和响应语音输入。 接口响应用户语音命令,并且基于与存储在存储器中的先前用户训练的语音命令词汇词的匹配来执行命令功能。

    Noise-compensated speech recognition templates
    8.
    发明授权
    Noise-compensated speech recognition templates 失效
    噪声补偿语音识别模板

    公开(公告)号:US06381569B1

    公开(公告)日:2002-04-30

    申请号:US09018257

    申请日:1998-02-04

    IPC分类号: G10L1520

    CPC分类号: G10L15/20 G10L21/0216

    摘要: The speech recognition training unit is modified to store digitized speech samples into a speech database that can be accessed at recognition time. The improved recognition unit comprises a noise analysis, modeling, and synthesis unit which continually analyzes the noise characteristics present in the audio environment and produces an estimated noise signal with similar characteristics. The recognition unit then constructs a noise-compensated template database by adding the estimated noise signal to each of the speech samples in the speech database and performing parameter determination on the resulting sums. This procedure accounts for the presence of noise in the recognition phase by retraining all the templates using an estimated noise signal with similar characteristics as the actual noise signal that corrupted the word to be recognized. This method improves the likelihood of a good template match, which increases the recognition accuracy.

    摘要翻译: 修改语音识别训练单元以将数字化语音样本存储到可在识别时被访问的语音数据库中。 改进的识别单元包括噪声分析,建模和合成单元,其连续分析存在于音频环境中的噪声特性并产生具有相似特性的估计噪声信号。 然后,识别单元通过将估计的噪声信号加到语音数据库中的每个语音样本上并对所得到的和进行参数确定来构建噪声补偿模板数据库。 该过程通过使用具有与损坏要识别的字的实际噪声信号相似的特性的估计噪声信号重新训练所有模板来解决识别阶段中的噪声的存在。 该方法提高了模板匹配的可能性,从而提高了识别精度。

    System and method for segmentation and recognition of speech signals
    9.
    发明授权
    System and method for segmentation and recognition of speech signals 有权
    用于语音信号的分割和识别的系统和方法

    公开(公告)号:US06278972B1

    公开(公告)日:2001-08-21

    申请号:US09225891

    申请日:1999-01-04

    IPC分类号: G01L1504

    CPC分类号: G10L15/04

    摘要: A system and method for forming a segmented speech signal from an input speech signal having a plurality of frames. The input speech signal is converted from a time domain signal to a frequency domain signal having a plurality of speech frames, wherein each speech frame in the frequency domain signal is represented by at least one spectral value associated with the speech frame. A spectral difference value is then determined for each pair of adjacent frames in the frequency domain signal, wherein the spectral difference value for each pair of adjacent frames is representative of a difference between the at least one spectral value associated with each frame in the pair of adjacent frames. An initial cluster boundary is set between each pair of adjacent frames in the frequency domain signal, and a variance value is assigned to each cluster in the frequency domain signal, wherein the variance value for each cluster is equal to one of the determined spectral difference values. Next, a plurality of cluster merge parameters is calculated, wherein each of the cluster merge parameters is associated with a pair of adjacent clusters in the frequency domain signal. A minimum cluster merge parameter is selected from the plurality of cluster merge parameters. A merged cluster is then formed by canceling a cluster boundary between the clusters associated with the minimum merge parameter and assigning a merged variance value to the merged cluster, wherein the merged variance value is representative of the variance values assigned to the clusters associated with the minimum merge parameter. The process is repeated in order to form a plurality of merged clusters, and the segmented speech signal is formed in accordance with the plurality of merged clusters.

    摘要翻译: 一种用于从具有多个帧的输入语音信号形成分段语音信号的系统和方法。 输入语音信号从时域信号转换为具有多个语音帧的频域信号,其中频域信号中的每个语音帧由与语音帧相关联的至少一个频谱值表示。 然后对频域信号中的每对相邻帧确定频谱差值,其中每对相邻帧的频谱差值表示与该对相邻帧中的每个帧相关联的至少一个频谱值之间的差异 相邻帧。 在频域信号中的每对相邻帧之间设置初始簇边界,并且将频域值分配给频域信号中的每个簇,其中每个簇的方差值等于所确定的光谱差值之一 。 接下来,计算多个集群合并参数,其中每个集群合并参数与频域信号中的一对相邻集群相关联。 从多个集群合并参数中选择最小集群合并参数。 然后通过消除与最小合并参数相关联的集群之间的集群边界并将合并的方差值分配给合并的集群来形成合并的集群,其中合并的方差值表示分配给与最小合并参数相关联的集群的方差值 合并参数。 重复该过程以形成多个合并的群集,并且根据多个合并的群集形成分段语音信号。