Patent search ap:("Sanjeev Mehrotra") AND inv:"Sanjeev Mehrotra" Page 1

1.

发明授权
Bitstream syntax for multi-process audio decoding 有权
Title translation: 多进程音频解码的比特流语法

公开(公告)号：US08645146B2

公开(公告)日：2014-02-04

申请号：US13595939

申请日：2012-08-27

Applicant: Kazuhito Koishida , Sanjeev Mehrotra , Chao He , Wei-Ge Chen

Inventor： Kazuhito Koishida , Sanjeev Mehrotra , Chao He , Wei-Ge Chen

IPC: G10L19/00

CPC classification number: G10L19/167 , G10L19/002 , G10L19/008 , G10L19/022 , G10L19/03 , G10L19/038 , G10L19/04 , G10L19/24

Abstract: An audio decoder provides a combination of decoding components including components implementing base band decoding, spectral peak decoding, frequency extension decoding and channel extension decoding techniques. The audio decoder decodes a compressed bitstream structured by a bitstream syntax scheme to permit the various decoding components to extract the appropriate parameters for their respective decoding technique.

Abstract translation: 音频解码器提供包括实现基带解码，频谱峰值解码，频率扩展解码和信道扩展解码技术的组件的解码组件的组合。音频解码器解码由比特流语法方案构成的压缩比特流，以允许各种解码组件为它们各自的解码技术提取适当的参数。

2.

发明授权
Efficient coding of digital media spectral data using wide-sense perceptual similarity 有权
Title translation: 使用广义感知相似性对数字媒体光谱数据进行高效编码

公开(公告)号：US08645127B2

公开(公告)日：2014-02-04

申请号：US12324689

申请日：2008-11-26

Applicant: Sanjeev Mehrotra , Wei-Ge Chen

Inventor： Sanjeev Mehrotra , Wei-Ge Chen

IPC: G10L11/04

CPC classification number: G10L19/0208 , G10L19/0204 , G10L19/035

Abstract: Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.

Abstract translation: 传统的音频编码器可以通过编码少于所有频谱系数来节省编码比特率，这可以在重建中产生模糊的低通声音。使用广义感知相似性的音频编码器通过编码被忽略的频谱系数的感知相似版本来提高质量，表示为已编码频谱的缩放版本。省略的频谱系数被划分为多个子带。子带被编码为两个参数：比例因子，其可以表示频带中的能量; 以及形状参数，其可以表示带的形状。形状参数可以是指向已编码频谱的一部分的运动矢量的形式，固定码本中的频谱形状的索引或随机噪声向量。因此，编码有效地表示在解码时要复制的类似形状的频谱部分的缩放版本。

3.

发明申请
ADAPTIVE BANDWIDTH ESTIMATION 有权
Title translation: 自适应带宽估计

公开(公告)号：US20130114421A1

公开(公告)日：2013-05-09

申请号：US13288968

申请日：2011-11-04

Applicant: Tin Qian , Jin Li , Tanner M. Hodgeson , Sanjeev Mehrotra , Jiannan Zheng , Timothy M. Moore

Inventor： Tin Qian , Jin Li , Tanner M. Hodgeson , Sanjeev Mehrotra , Jiannan Zheng , Timothy M. Moore

IPC: H04L12/26

CPC classification number: H04L43/0829 , H04L43/0852 , H04L43/16

Abstract: It can be determined whether relative one way delay for data packets in a data stream exceeds a delay threshold. If so, then a delay congestion signal indicating that the relative one way delay exceeds the delay threshold can be generated. The delay congestion signal can be used in calculating an adaptive bandwidth estimate for the data stream. A packet loss rate congestion signal may also be used in calculating the bandwidth estimate. It can be determined whether a data stream of data packets is in a contention state. If the data stream is in the contention state, then an adaptive bandwidth estimate can be calculated for the data stream using a first bandwidth estimation technique. If the data stream is not in the contention state, then the bandwidth estimate for the data stream can be calculated using a second bandwidth estimation technique.

Abstract translation: 可以确定数据流中的数据分组的相对单向延迟是否超过延迟阈值。如果是，则可以产生指示相对单向延迟超过延迟阈值的延迟拥塞信号。延迟拥塞信号可用于计算数据流的自适应带宽估计。丢包率拥塞信号也可用于计算带宽估计。可以确定数据包的数据流是否处于争用状态。如果数据流处于竞争状态，则可以使用第一带宽估计技术对数据流计算自适应带宽估计。如果数据流不处于竞争状态，则可以使用第二带宽估计技术来计算数据流的带宽估计。

4.

发明授权
Encoding streaming media as a high bit rate layer, a low bit rate layer, and one or more intermediate bit rate layers 有权
Title translation: 将流媒体编码为高比特率层，低比特率层和一个或多个中间比特率层

公开(公告)号：US08325800B2

公开(公告)日：2012-12-04

申请号：US12116878

申请日：2008-05-07

Applicant: Thomas W. Holcomb , Sanjeev Mehrotra , Serge Smirnov , Bharath Siravara

Inventor： Thomas W. Holcomb , Sanjeev Mehrotra , Serge Smirnov , Bharath Siravara

IPC: H04N7/12 , H04N11/02 , H04N11/04

CPC classification number: H04N19/34 , H04N19/114 , H04N19/115 , H04N19/124 , H04N19/14 , H04N19/147 , H04N19/164 , H04N19/166 , H04N19/179 , H04N19/187

Abstract: A method of encoding an input video stream comprising a video component and an audio component is disclosed. The input video stream is split into a plurality of segments, each comprising a plurality of frames. Each of the segments is encoded as a low bit rate layer, a high bit rate layer, and one or more intermediate bit rate layers. The bit rate of the low bit rate layer is selected such that a network streaming the segment will always be able to stream the segment encoded as the low bit rate layer. The bit rate of the high bit rate layer is selected such that the segment is able to be decoded and played back at or above a quality threshold. The bit rates of the intermediate bit rate layers are produced by applying a bit rate factor to another bit rate.

Abstract translation: 公开了一种编码包括视频分量和音频分量的输入视频流的方法。输入视频流被分割成多个段，每个段包括多个帧。每个段被编码为低比特率层，高比特率层和一个或多个中间比特率层。选择低比特率层的比特率，使得流分段的网络将总是能够将编码为低比特率层的段流传输。选择高比特率层的比特率使得该片段能够在质量阈值以上或高于质量阈值时被解码和回放。中间比特率层的比特率通过将比特率因子应用于另一个比特率来产生。

5.

发明授权
Low complexity decoder for complex transform coding of multi-channel sound 有权
Title translation: 低复杂度解码器，用于多声道声音的复杂变换编码

公开(公告)号：US08046214B2

公开(公告)日：2011-10-25

申请号：US11767457

申请日：2007-06-22

Applicant: Sanjeev Mehrotra , Wei-Ge Chen

Inventor： Sanjeev Mehrotra , Wei-Ge Chen

IPC: G10L19/00

CPC classification number: G10L19/008

Abstract: A multi-channel audio decoder provides a reduced complexity processing to reconstruct multi-channel audio from an encoded bitstream in which the multi-channel audio is represented as a coded subset of the channels along with a complex channel correlation matrix parameterization. The decoder translates the complex channel correlation matrix parameterization to a real transform that satisfies the magnitude of the complex channel correlation matrix. The multi-channel audio is derived from the coded subset of channels via channel extension processing using a real value effect signal and real number scaling.

Abstract translation: 多声道音频解码器提供了一种降低复杂度的处理，从编码比特流重建多声道音频，其中多声道音频被表示为频道的编码子集以及复信道相关矩阵参数化。解码器将复信道相关矩阵参数化转换为满足复信道相关矩阵幅度的实数变换。多声道音频通过使用实数值效应信号和实数缩放的信道扩展处理从编码的信道子集导出。

6.

发明授权
Adaptive vector Huffman coding and decoding based on a sum of values of audio data symbols 有权
Title translation: 基于音频数据符号值的和的自适应向量霍夫曼编码和解码

公开(公告)号：US07822601B2

公开(公告)日：2010-10-26

申请号：US12122553

申请日：2008-05-16

Applicant: Sanjeev Mehrotra , Wei-Ge Chen

Inventor： Sanjeev Mehrotra , Wei-Ge Chen

IPC: G10L19/00 , H03M7/38 , H03M7/40

CPC classification number: G10L19/032 , H03M7/40 , H03M7/4006 , H03M7/4093 , H03M7/46

Abstract: An audio encoder performs entropy encoding of audio data. For example, an audio encoder determines a Huffman code from a Huffman code table to use for encoding a vector of audio data symbols, where the determining is based on a sum of values of the audio data symbols. An audio decoder performs corresponding entropy decoding.

Abstract translation: 音频编码器执行音频数据的熵编码。例如，音频编码器从霍夫曼码表确定霍夫曼码，以用于对音频数据符号的矢量进行编码，其中该确定基于音频数据符号的值之和。音频解码器执行相应的熵解码。

7.

发明授权
Coding of sparse digital media spectral data 有权
Title translation: 稀疏数字媒体光谱数据编码

公开(公告)号：US07774205B2

公开(公告)日：2010-08-10

申请号：US11764108

申请日：2007-06-15

Applicant: Kazuhito Koishida , Sanjeev Mehrotra , Wei-Ge Chen

Inventor： Kazuhito Koishida , Sanjeev Mehrotra , Wei-Ge Chen

IPC: G10L21/04

CPC classification number: G10L19/02 , G10L19/0212 , G10L19/032 , G10L19/18

Abstract: An audio encoder/decoder provides efficient compression of spectral transform coefficient data characterized by sparse spectral peaks. The audio encoder/decoder applies a temporal prediction of the frequency position of spectral peaks. The spectral peaks in the transform coefficients that are predicted from those in a preceding transform coding block are encoded as a shift in frequency position from the previous transform coding block and two non-zero coefficient levels. The prediction may avoid coding very large zero-level transform coefficient runs as compared to conventional run length coding. For spectral peaks not predicted from those in a preceding transform coding block, the spectral peaks are encoded as a value trio of a length of a run of zero-level spectral transform coefficients, and two non-zero coefficient levels.

Abstract translation: 音频编码器/解码器提供以稀疏频谱峰值为特征的频谱变换系数数据的有效压缩。音频编码器/解码器对频谱峰值的频率位置进行时间预测。从前一变换编码块中预测的变换系数中的频谱峰值被编码为来自先前变换编码块和两个非零系数电平的频率位置的移位。与常规游程长度编码相比，预测可以避免编码非常大的零电平变换系数运行。对于未在前面的变换编码块中预测的频谱峰值，频谱峰值被编码为零电平频谱变换系数的行程的长度和两个非零系数电平的三值。

8.

发明申请
ENTROPY CODING BY ADAPTING CODING BETWEEN LEVEL AND RUN LENGTH/LEVEL MODES 有权

公开(公告)号：US20080262855A1

公开(公告)日：2008-10-23

申请号：US12122553

申请日：2008-05-16

Applicant: Sanjeev Mehrotra , Wei-Ge Chen

Inventor： Sanjeev Mehrotra , Wei-Ge Chen

IPC: G10L19/00

CPC classification number: G10L19/032 , H03M7/40 , H03M7/4006 , H03M7/4093 , H03M7/46

Abstract: An audio encoder performs adaptive entropy encoding of audio data. For example, an audio encoder switches between variable dimension vector Huffman coding of direct levels of quantized audio data and run-level coding of run lengths and levels of quantized audio data. The encoder can use, for example, context-based arithmetic coding for coding run lengths and levels. The encoder can determine when to switch between coding modes by counting consecutive coefficients having a predominant value (e.g., zero). An audio decoder performs corresponding adaptive entropy decoding.

9.

发明申请
Selectively using multiple entropy models in adaptive coding and decoding 有权
Title translation: 在自适应编码和解码中选择性地使用多个熵模型

公开(公告)号：US20070016418A1

公开(公告)日：2007-01-18

申请号：US11183266

申请日：2005-07-15

Applicant: Sanjeev Mehrotra , Wei-Ge Chen

Inventor： Sanjeev Mehrotra , Wei-Ge Chen

IPC: G10L15/00

CPC classification number: G10L19/0017 , H03M7/40 , H04N19/13 , H04N19/46

Abstract: Techniques and tools for selectively using multiple entropy models in adaptive coding and decoding are described herein. For example, for multiple symbols, an audio encoder selects an entropy model from a first model set that includes multiple entropy models. Each of the multiple entropy models includes a model switch point for switching to a second model set that includes one or more entropy models. The encoder processes the multiple symbols using the selected entropy model and outputs results. Techniques and tools for generating entropy models are also described.

Abstract translation: 本文描述了在自适应编码和解码中选择性地使用多个熵模型的技术和工具。例如，对于多个符号，音频编码器从包括多个熵模型的第一模型集合中选择熵模型。多个熵模型中的每一个包括用于切换到包括一个或多个熵模型的第二模型集合的模型切换点。编码器使用所选择的熵模型处理多个符号并输出结果。还描述了用于生成熵模型的技术和工具。

10.

发明申请
Modification of codewords in dictionary used for efficient coding of digital media spectral data 有权
Title translation: 用于数字媒体频谱数据有效编码的字典中的码字的修改

公开(公告)号：US20070016414A1

公开(公告)日：2007-01-18

申请号：US11183084

申请日：2005-07-15

Applicant: Sanjeev Mehrotra , Wei-Ge Chen , Kazuhito Koishida

Inventor： Sanjeev Mehrotra , Wei-Ge Chen , Kazuhito Koishida

IPC: G10L19/00

CPC classification number: G10L19/038 , G10L19/24

Abstract: Coding of spectral data by representing certain portions of the spectral data as a scaled version of a code-vector, where the code-vector is chosen from either a fixed predetermined codebook or a codebook taken from a baseband. Various optional features are described for modifying the code-vectors in the codebook according to some rules which allow the code-vector to better represent the data they are modeling. The code-vector modification comprises a linear or non-linear transform of one or more code-vectors, such as, by exponentiation, negation, reversing, or combining elements from plural code-vectors.

Abstract translation: 通过将频谱数据的某些部分表示为编码矢量的缩放版本来编码频谱数据，其中从固定预定码本或从基带获取的码本中选择码矢量。描述了各种可选特征，用于根据允许代码矢量更好地表示其正在建模的数据的一些规则来修改码本中的码矢量。码矢量修改包括一个或多个码矢量的线性或非线性变换，例如通过取幂，否定，反转或从多个码矢量组合元素。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification