Efficient coding of digital media spectral data using wide-sense perceptual similarity
    2.
    发明授权
    Efficient coding of digital media spectral data using wide-sense perceptual similarity 有权
    使用广义感知相似性对数字媒体光谱数据进行高效编码

    公开(公告)号:US08645127B2

    公开(公告)日:2014-02-04

    申请号:US12324689

    申请日:2008-11-26

    CPC classification number: G10L19/0208 G10L19/0204 G10L19/035

    Abstract: Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.

    Abstract translation: 传统的音频编码器可以通过编码少于所有频谱系数来节省编码比特率,这可以在重建中产生模糊的低通声音。 使用广义感知相似性的音频编码器通过编码被忽略的频谱系数的感知相似版本来提高质量,表示为已编码频谱的缩放版本。 省略的频谱系数被划分为多个子带。 子带被编码为两个参数:比例因子,其可以表示频带中的能量; 以及形状参数,其可以表示带的形状。 形状参数可以是指向已编码频谱的一部分的运动矢量的形式,固定码本中的频谱形状的索引或随机噪声向量。 因此,编码有效地表示在解码时要复制的类似形状的频谱部分的缩放版本。

    ADAPTIVE BANDWIDTH ESTIMATION
    3.
    发明申请
    ADAPTIVE BANDWIDTH ESTIMATION 有权
    自适应带宽估计

    公开(公告)号:US20130114421A1

    公开(公告)日:2013-05-09

    申请号:US13288968

    申请日:2011-11-04

    CPC classification number: H04L43/0829 H04L43/0852 H04L43/16

    Abstract: It can be determined whether relative one way delay for data packets in a data stream exceeds a delay threshold. If so, then a delay congestion signal indicating that the relative one way delay exceeds the delay threshold can be generated. The delay congestion signal can be used in calculating an adaptive bandwidth estimate for the data stream. A packet loss rate congestion signal may also be used in calculating the bandwidth estimate. It can be determined whether a data stream of data packets is in a contention state. If the data stream is in the contention state, then an adaptive bandwidth estimate can be calculated for the data stream using a first bandwidth estimation technique. If the data stream is not in the contention state, then the bandwidth estimate for the data stream can be calculated using a second bandwidth estimation technique.

    Abstract translation: 可以确定数据流中的数据分组的相对单向延迟是否超过延迟阈值。 如果是,则可以产生指示相对单向延迟超过延迟阈值的延迟拥塞信号。 延迟拥塞信号可用于计算数据流的自适应带宽估计。 丢包率拥塞信号也可用于计算带宽估计。 可以确定数据包的数据流是否处于争用状态。 如果数据流处于竞争状态,则可以使用第一带宽估计技术对数据流计算自适应带宽估计。 如果数据流不处于竞争状态,则可以使用第二带宽估计技术来计算数据流的带宽估计。

    Encoding streaming media as a high bit rate layer, a low bit rate layer, and one or more intermediate bit rate layers
    4.
    发明授权
    Encoding streaming media as a high bit rate layer, a low bit rate layer, and one or more intermediate bit rate layers 有权
    将流媒体编码为高比特率层,低比特率层和一个或多个中间比特率层

    公开(公告)号:US08325800B2

    公开(公告)日:2012-12-04

    申请号:US12116878

    申请日:2008-05-07

    Abstract: A method of encoding an input video stream comprising a video component and an audio component is disclosed. The input video stream is split into a plurality of segments, each comprising a plurality of frames. Each of the segments is encoded as a low bit rate layer, a high bit rate layer, and one or more intermediate bit rate layers. The bit rate of the low bit rate layer is selected such that a network streaming the segment will always be able to stream the segment encoded as the low bit rate layer. The bit rate of the high bit rate layer is selected such that the segment is able to be decoded and played back at or above a quality threshold. The bit rates of the intermediate bit rate layers are produced by applying a bit rate factor to another bit rate.

    Abstract translation: 公开了一种编码包括视频分量和音频分量的输入视频流的方法。 输入视频流被分割成多个段,每个段包括多个帧。 每个段被编码为低比特率层,高比特率层和一个或多个中间比特率层。 选择低比特率层的比特率,使得流分段的网络将总是能够将编码为低比特率层的段流传输。 选择高比特率层的比特率使得该片段能够在质量阈值以上或高于质量阈值时被解码和回放。 中间比特率层的比特率通过将比特率因子应用于另一个比特率来产生。

    Low complexity decoder for complex transform coding of multi-channel sound
    5.
    发明授权
    Low complexity decoder for complex transform coding of multi-channel sound 有权
    低复杂度解码器,用于多声道声音的复杂变换编码

    公开(公告)号:US08046214B2

    公开(公告)日:2011-10-25

    申请号:US11767457

    申请日:2007-06-22

    CPC classification number: G10L19/008

    Abstract: A multi-channel audio decoder provides a reduced complexity processing to reconstruct multi-channel audio from an encoded bitstream in which the multi-channel audio is represented as a coded subset of the channels along with a complex channel correlation matrix parameterization. The decoder translates the complex channel correlation matrix parameterization to a real transform that satisfies the magnitude of the complex channel correlation matrix. The multi-channel audio is derived from the coded subset of channels via channel extension processing using a real value effect signal and real number scaling.

    Abstract translation: 多声道音频解码器提供了一种降低复杂度的处理,从编码比特流重建多声道音频,其中多声道音频被表示为频道的编码子集以及复信道相关矩阵参数化。 解码器将复信道相关矩阵参数化转换为满足复信道相关矩阵幅度的实数变换。 多声道音频通过使用实数值效应信号和实数缩放的信道扩展处理从编码的信道子集导出。

    Coding of sparse digital media spectral data
    7.
    发明授权
    Coding of sparse digital media spectral data 有权
    稀疏数字媒体光谱数据编码

    公开(公告)号:US07774205B2

    公开(公告)日:2010-08-10

    申请号:US11764108

    申请日:2007-06-15

    CPC classification number: G10L19/02 G10L19/0212 G10L19/032 G10L19/18

    Abstract: An audio encoder/decoder provides efficient compression of spectral transform coefficient data characterized by sparse spectral peaks. The audio encoder/decoder applies a temporal prediction of the frequency position of spectral peaks. The spectral peaks in the transform coefficients that are predicted from those in a preceding transform coding block are encoded as a shift in frequency position from the previous transform coding block and two non-zero coefficient levels. The prediction may avoid coding very large zero-level transform coefficient runs as compared to conventional run length coding. For spectral peaks not predicted from those in a preceding transform coding block, the spectral peaks are encoded as a value trio of a length of a run of zero-level spectral transform coefficients, and two non-zero coefficient levels.

    Abstract translation: 音频编码器/解码器提供以稀疏频谱峰值为特征的频谱变换系数数据的有效压缩。 音频编码器/解码器对频谱峰值的频率位置进行时间预测。 从前一变换编码块中预测的变换系数中的频谱峰值被编码为来自先前变换编码块和两个非零系数电平的频率位置的移位。 与常规游程长度编码相比,预测可以避免编码非常大的零电平变换系数运行。 对于未在前面的变换编码块中预测的频谱峰值,频谱峰值被编码为零电平频谱变换系数的行程的长度和两个非零系数电平的三值。

    Selectively using multiple entropy models in adaptive coding and decoding
    9.
    发明申请
    Selectively using multiple entropy models in adaptive coding and decoding 有权
    在自适应编码和解码中选择性地使用多个熵模型

    公开(公告)号:US20070016418A1

    公开(公告)日:2007-01-18

    申请号:US11183266

    申请日:2005-07-15

    CPC classification number: G10L19/0017 H03M7/40 H04N19/13 H04N19/46

    Abstract: Techniques and tools for selectively using multiple entropy models in adaptive coding and decoding are described herein. For example, for multiple symbols, an audio encoder selects an entropy model from a first model set that includes multiple entropy models. Each of the multiple entropy models includes a model switch point for switching to a second model set that includes one or more entropy models. The encoder processes the multiple symbols using the selected entropy model and outputs results. Techniques and tools for generating entropy models are also described.

    Abstract translation: 本文描述了在自适应编码和解码中选择性地使用多个熵模型的技术和工具。 例如,对于多个符号,音频编码器从包括多个熵模型的第一模型集合中选择熵模型。 多个熵模型中的每一个包括用于切换到包括一个或多个熵模型的第二模型集合的模型切换点。 编码器使用所选择的熵模型处理多个符号并输出结果。 还描述了用于生成熵模型的技术和工具。

    Modification of codewords in dictionary used for efficient coding of digital media spectral data
    10.
    发明申请
    Modification of codewords in dictionary used for efficient coding of digital media spectral data 有权
    用于数字媒体频谱数据有效编码的字典中的码字的修改

    公开(公告)号:US20070016414A1

    公开(公告)日:2007-01-18

    申请号:US11183084

    申请日:2005-07-15

    CPC classification number: G10L19/038 G10L19/24

    Abstract: Coding of spectral data by representing certain portions of the spectral data as a scaled version of a code-vector, where the code-vector is chosen from either a fixed predetermined codebook or a codebook taken from a baseband. Various optional features are described for modifying the code-vectors in the codebook according to some rules which allow the code-vector to better represent the data they are modeling. The code-vector modification comprises a linear or non-linear transform of one or more code-vectors, such as, by exponentiation, negation, reversing, or combining elements from plural code-vectors.

    Abstract translation: 通过将频谱数据的某些部分表示为编码矢量的缩放版本来编码频谱数据,其中从固定预定码本或从基带获取的码本中选择码矢量。 描述了各种可选特征,用于根据允许代码矢量更好地表示其正在建模的数据的一些规则来修改码本中的码矢量。 码矢量修改包括一个或多个码矢量的线性或非线性变换,例如通过取幂,否定,反转或从多个码矢量组合元素。

Patent Agency Ranking