Method for representing multi-channel audio signals
    41.
    发明申请
    Method for representing multi-channel audio signals 有权
    表示多声道音频信号的方法

    公开(公告)号:US20070258607A1

    公开(公告)日:2007-11-08

    申请号:US11549963

    申请日:2006-10-16

    IPC分类号: H04R5/02

    摘要: A multi-channel input signal having at least three original channels is represented by a parameter representation of the multi-channel signal. A first balance parameter, a first coherence parameter, or a first inter-channel time difference between a first channel pair and a second balance parameter, or a second coherence parameter, or a second inter-channel time difference parameter between a second channel pair are calculated. This set of parameters is the parameter representation of the original signals. The first channel pair has two channels, which are different from two channels of a second channel pair. Furthermore, each channel of the two channel pairs is one of the original channels, or a weighted combination of the original channels, and the first channel pair and the second channel pair include information on the three original channels. For multi-channel reconstruction purposes, the parameters are used in addition to down-mixing information to generate a selectable number of output channels in a scalable fashion.

    摘要翻译: 具有至少三个原始信道的多声道输入信号由多声道信号的参数表示来表示。 在第二通道对之间的第一通道对和第二平衡参数之间的第一平衡参数,第一相干参数或第一通道间时间差或第二相干参数或第二通道间差参数是 计算。 这组参数是原始信号的参数表示。 第一通道对具有两个通道,它们与第二通道对的两个通道不同。 此外,两个信道对的每个信道是原始信道之一或原始信道的加权组合,并且第一信道对和第二信道对包括关于三个原始信道的信息。 对于多通道重建的目的,除了降混信息之外还使用这些参数,以便以可缩放的方式生成可选数量的输出通道。

    Adaptive residual audio coding
    42.
    发明申请
    Adaptive residual audio coding 有权
    自适应残差音频编码

    公开(公告)号:US20060233379A1

    公开(公告)日:2006-10-19

    申请号:US11247555

    申请日:2005-10-11

    IPC分类号: H04R5/00

    CPC分类号: G10L19/008

    摘要: An audio signal having at least two channels can be efficiently down-mixed into a downmix signal and a residual signal, when the down-mixing rule used depends on a spatial parameter that is derived from the audio signal and that is post-processed by a limiter to apply a certain limit to the derived spatial parameter with the aim of avoiding instabilities during the up-mixing or down-mixing process. By having a down-mixing rule that dynamically depends on parameters describing an interrelation between the audio channels, one can assure that the energy within the down-mixed residual signal is as minimal as possible, which is advantageous in the view of coding efficiency. By post processing the spatial parameter with a limiter prior to using it in the down-mixing, one can avoid instabilities in the down- or up-mixing, which otherwise could result in a disturbance of the spatial perception of the encoded or decoded audio signal.

    摘要翻译: 具有至少两个通道的音频信号可以被有效地下混合成降混信号和残留信号,当所使用的下混合规则取决于从音频信号导出的空间参数,并且被后处理时 限制器对推导的空间参数应用一定的限制,目的是避免在上混合或下混合过程中的不稳定性。 通过具有动态地取决于描述音频通道之间的相互关系的参数的下混合规则,可以确保下混合残差信号内的能量尽可能地最小,这在编码效率方面是有利的。 通过在下混合中使用限幅器之前对空间参数进行后处理,可以避免下混合或上混合中的不稳定性,否则可能导致编码或解码的音频信号的空间感知的干扰 。

    MDCT-based complex prediction stereo coding
    45.
    发明授权
    MDCT-based complex prediction stereo coding 有权
    基于MDCT的复合预测立体声编码

    公开(公告)号:US09378745B2

    公开(公告)日:2016-06-28

    申请号:US13638901

    申请日:2011-04-06

    IPC分类号: G10L19/06 H04S3/00 G10L19/008

    摘要: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The method comprises applying independent bandwidth limits for the input channels.

    摘要翻译: 本发明提供使用频域中的复数预测的立体声编码和解码的方法和装置。 在一个实施例中,一种用于从由复数预测编码编码的输入立体声信号中获得输出立体声信号并包括两个输入声道的第一频域表示的解码方法包括上混合步骤:(i)计算第二频率 - 第一输入通道的域表示; 以及(ii)基于第一输入通道的第一和第二频域表示,第二输入通道的第一频域表示和复数预测系数来计算输出通道。 该方法包括对输入通道应用独立的带宽限制。

    Apparatus and method for generating a high frequency audio signal using adaptive oversampling
    46.
    发明授权
    Apparatus and method for generating a high frequency audio signal using adaptive oversampling 有权
    使用自适应过采样来生成高频音频信号的装置和方法

    公开(公告)号:US09159337B2

    公开(公告)日:2015-10-13

    申请号:US13503248

    申请日:2010-05-25

    CPC分类号: G10L21/038 G10L19/025

    摘要: An apparatus for generating a high frequency audio signal that includes an analyzer for analyzing an input signal to determine a transient information adaptively. Additionally a spectral converter is provided for converting the input signal into an input spectral representation. A spectral processor processes the input spectral representation to generate a processed spectral representation including values for higher frequencies than the input spectral representation. A time converter is configured for converting the processed spectral representation to a time representation, wherein the spectral converter or the time converter are controllable to perform a frequency domain oversampling for the first portion of the input signal having the transient information associated and to not perform the frequency domain oversampling for the second portion of the input signal not having the associated transient information.

    摘要翻译: 一种用于产生高频音频信号的装置,包括用于分析输入信号的分析器,以自适应地确定瞬时信息。 另外,提供了用于将输入信号转换为输入频谱表示的频谱转换器。 频谱处理器处理输入频谱表示以产生包括比输入频谱表示更高频率的值的经处理的频谱表示。 时间转换器被配置为将经处理的频谱表示转换为时间表示,其中频谱转换器或时间转换器是可控制的,以对具有相关联的瞬态信息的输入信号的第一部分执行频域过采样,并且不执行 输入信号的第二部分不具有相关联的瞬态信息的频域过采样。

    MDCT-based complex prediction stereo coding
    47.
    发明授权
    MDCT-based complex prediction stereo coding 有权
    基于MDCT的复合预测立体声编码

    公开(公告)号:US09159326B2

    公开(公告)日:2015-10-13

    申请号:US13638900

    申请日:2011-04-06

    IPC分类号: H04R5/00 G10L19/008

    摘要: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency domain representation of the second input channel and a complex prediction coefficient. The method comprises performing frequency-domain modifications selectively before or after upmixing.

    摘要翻译: 本发明提供使用频域中的复数预测的立体声编码和解码的方法和装置。 在一个实施例中,一种用于从由复数预测编码编码的输入立体声信号中获得输出立体声信号并包括两个输入声道的第一频域表示的解码方法包括上混合步骤:(i)计算第二频率 - 第一输入通道的域表示; 以及(ii)基于第一输入通道的第一和第二频域表示,第二输入通道的第一频域表示和复数预测系数来计算输出通道。 该方法包括在混合之前或之后选择性地执行频域修改。

    Audio signal decoder, audio signal encoder, methods and computer program using a sampling rate dependent time-warp contour encoding
    48.
    发明授权
    Audio signal decoder, audio signal encoder, methods and computer program using a sampling rate dependent time-warp contour encoding 有权
    音频信号解码器,音频信号编码器,方法和计算机程序使用采样率相关的时间 - 扭曲轮廓编码

    公开(公告)号:US09129597B2

    公开(公告)日:2015-09-08

    申请号:US13604869

    申请日:2012-09-06

    摘要: An audio signal decoder configured to provide a decoded audio signal representation on the basis of an encoded audio signal representation including a sampling frequency information, an encoded time warp information and an encoded spectrum representation includes a time warp calculator and a warp decoder. The time warp calculator is configured to adapt a mapping rule for mapping codewords of the encoded time warp information onto decoded time warp values describing the decoded time warp information in dependence on the sampling frequency information. The warp decoder is configured to provide the decoded audio signal representation on the basis of the encoded spectrum representation and in dependence on the decoded time warp information.

    摘要翻译: 音频信号解码器被配置为基于包括采样频率信息,编码时间扭曲信息和编码频谱表示的编码音频信号表示来提供解码音频信号表示,包括时间扭曲计算器和扭曲解码器。 时间扭曲计算器被配置成适应用于将编码的时间扭曲信息的码字映射到根据采样频率信息描述解码的时间扭曲信息的解码的时间扭曲值的映射规则。 翘曲解码器被配置为基于编码的频谱表示并且依赖于解码的时间扭曲信息来提供解码的音频信号表示。

    Oversampling in a combined transposer filter bank
    49.
    发明授权
    Oversampling in a combined transposer filter bank 有权
    组合式转换器滤波器组中的过采样

    公开(公告)号:US08886346B2

    公开(公告)日:2014-11-11

    申请号:US13499893

    申请日:2010-05-25

    IPC分类号: G06F17/00 G10L21/038

    摘要: The present invention relates to coding of audio signals, and in particular to high frequency reconstruction methods including a frequency domain harmonic transposer. A system and method for generating a high frequency component of a signal from a low frequency component of the signal is described. The system comprises an analysis filter bank (501) comprising an analysis transformation unit (601) having a frequency resolution of Δf; and an analysis window (611) having a duration of DA; the analysis filter bank (501) being configured to provide a set of analysis subband signals from the low frequency component of the signal; a nonlinear processing unit (502, 650) configured to determine a set of synthesis subband signals based on a portion of the set of analysis subband signals, wherein the portion of the set of analysis subband signals is phase shifted by a transposition order T; and a synthesis filter bank (504) comprising a synthesis transformation unit (602) having a frequency resolution of QΔf; and a synthesis window (612) having a duration of Ds; the synthesis filter bank (504) being configured to generate the high frequency component of the signal from the set of synthesis subband signals; wherein Q is a frequency resolution factor with Q≧1 and smaller than the transposition order T; and wherein the value of the product of the frequency resolution Δf and the duration DA of the analysis filter bank is selected based on the frequency resolution factor Q.

    摘要翻译: 本发明涉及音频信号的编码,特别涉及包括频域谐波转移器的高频重构方法。 描述了用于从信号的低频分量产生信号的高频分量的系统和方法。 该系统包括分析滤波器组(501),其包括频率分辨率为&Dgr; f的分析变换单元(601) 和具有DA持续时间的分析窗口(611); 所述分析滤波器组(501)被配置为从所述信号的低频分量提供一组分析子带信号; 非线性处理单元(502,650),被配置为基于所述一组分析子带信号来确定一组合成子带信号,其中所述一组分析子带信号的所述部分被相移一个换位阶数T; f)具有频率分辨率为Q&Dgr; f的合成变换单元(602)的合成滤波器组(504) 和具有持续时间Ds的合成窗口(612); 所述合成滤波器组(504)被配置为从所述一组合成子带信号中产生所述信号的高频分量; 其中Q是具有Q≥1且小于置换次数T的频率分辨率因子; 并且其中基于频率分辨率因子Q选择频率分辨率&Dgr; f的乘积值和分析滤波器组的持续时间DA。

    Methods for improved performance of prediction based multi-channel reconstruction
    50.
    发明授权
    Methods for improved performance of prediction based multi-channel reconstruction 有权
    改进基于预测的多通道重建性能的方法

    公开(公告)号:US08515083B2

    公开(公告)日:2013-08-20

    申请号:US11290370

    申请日:2005-11-29

    IPC分类号: H04R5/00 G06F17/00 G10L19/00

    摘要: For a multi-channel reconstruction of audio signals based on at least one base channel, an energy measure is used for compensating energy losses due to an predictive upmix. The energy measure can be applied in the encoder or the decoder. Furthermore, a decorrelated signal is added to output channels generated by an energy-loss introducing upmix procedure. The energy of the decorrelated signal is smaller than or equal to an energy error introduced by the predictive upmix. Thus, problems occurring for prediction based up-mix methods such as up-mixing signals that are coded with High Frequency Reconstruction techniques are solved, so that the correct correlation between the up-mixed channels is obtained or the up-mix is adapted to arbitrary down-mixes.

    摘要翻译: 对于基于至少一个基本信道的音频信号的多声道重建,能量测量被用于由于预测上混补偿能量损失。 能量测量可以应用于编码器或解码器。 此外,将去相关信号添加到通过能量损失引入上混程序产生的输出通道。 解相关信号的能量小于或等于由预测上混引入的能量误差。 因此,解决了用于基于预测的上混合方法(例如用高频重建技术编码的上混合信号)所出现的问题,从而获得上混合信道之间的正确相关性,或者将混合信号适配为任意的 下混