METHOD AND SYSTEM FOR CODING METADATA IN AUDIO STREAMS AND FOR EFFICIENT BITRATE ALLOCATION TO AUDIO STREAMS CODING

    公开(公告)号:US20220319524A1

    公开(公告)日:2022-10-06

    申请号:US17596567

    申请日:2020-07-07

    发明人: Vaclav EKSLER

    摘要: A system and method code an object-based audio signal comprising audio objects in response to audio streams with associated metadata. In the system and method, a metadata processor codes the metadata and generates information about bit-budgets for the coding of the metadata of the audio objects. An encoder codes the audio streams while a bit-budget allocator is responsive to the information about the bit-budgets for the coding of the metadata of the audio objects from the metadata processor to allocate bitrates for the coding of the audio streams by the encoder.

    METHODS AND DEVICES FOR DETECTING AN ATTACK IN A SOUND SIGNAL TO BE CODED AND FOR CODING THE DETECTED ATTACK

    公开(公告)号:US20220180884A1

    公开(公告)日:2022-06-09

    申请号:US17602071

    申请日:2020-05-01

    发明人: Vaclav Eksler

    IPC分类号: G10L19/22 G10L25/21 G10L25/93

    摘要: A method and device for detecting an attack in a sound signal to be coded wherein the sound signal is processed in successive frames each including a number of sub-frames. The device comprises a first-stage attack detector for detecting the attack in a last sub-frame of a current frame, and a second-stage attack detector for detecting the attack in one of the sub-frames of the current frame, including the sub-frames preceding the last sub-frame. No attack is detected when the current frame is not an active frame previously classified to be coded using a generic coding mode. A method and device for coding an attack in a sound signal are also provided. The coding device comprises the above mentioned attack detecting device and an encoder of the sub-frame comprising the detected attack using a transition coding mode using a glottal-shape codebook populated with glottal impulse shapes.

    Device and method for quantizing the gains of the adaptive and fixed contributions of the excitation in a CELP codec

    公开(公告)号:US10115408B2

    公开(公告)日:2018-10-30

    申请号:US15610268

    申请日:2017-05-31

    摘要: A device and method for quantizing a gain of a fixed contribution of an excitation in a frame, including sub-frames, of a coded sound signal, wherein the gain of the fixed excitation contribution is estimated in a sub-frame using a parameter representative of a classification of the frame. The gain of the fixed excitation contribution is then quantized in the sub-frame using the estimated gain. The device and method is used in jointly quantizing gains of adaptive and fixed contributions of an excitation in a frame of a coded sound signal. For retrieving a quantized gain of a fixed contribution of an excitation in a sub-frame of a frame, the gain of the fixed excitation contribution is estimated using a parameter representative of a classification of the frame, a gain codebook supplies a correction factor in response to a received, gain codebook index, and a multiplier multiplies the estimated gain by the correction factor to provide a quantized gain of the fixed excitation contribution.

    Device And Method For Quantizing The Gains Of The Adaptive And Fixed Contributions Of The Excitation In A Celp Codec

    公开(公告)号:US20170186439A1

    公开(公告)日:2017-06-29

    申请号:US15461945

    申请日:2017-03-17

    摘要: A device and method for quantizing a gain of a fixed contribution of an excitation in a frame, including sub-frames, of a coded sound signal, wherein the gain of the fixed excitation contribution is estimated in a sub-frame using a parameter representative of a classification of the frame. The gain of the fixed excitation contribution is then quantized in the sub-frame using the estimated gain. The device and method is used in jointly quantizing gains of adaptive and fixed contributions of an excitation in a frame of a coded sound signal. For retrieving a quantized gain of a fixed contribution of an excitation in a sub-frame of a frame, the gain of the fixed excitation contribution is estimated using a parameter representative of a classification of the frame, a gain codebook supplies a correction factor in response to a received, gain codebook index, and a multiplier multiplies the estimated gain by the correction factor to provide a quantized gain of the fixed excitation contribution.

    Device and Method for Reducing Quantization Noise in a Time-Domain Decoder
    6.
    发明申请
    Device and Method for Reducing Quantization Noise in a Time-Domain Decoder 有权
    降低时域解码器量化噪声的装置和方法

    公开(公告)号:US20160300582A1

    公开(公告)日:2016-10-13

    申请号:US15187464

    申请日:2016-06-20

    摘要: The present disclosure relates to a device and method for reducing quantization noise in a sound signal contained in a time-domain excitation decoded by a time-domain decoder. A future frame time-domain excitation is evaluated based on the decoded time-domain excitation. A concatenated time-domain excitation is produced from the decoded time-domain excitation of the time-domain excitation of the future frame and is converted into a frequency-domain excitation. A weighting mask is produced for retrieving spectral information lost in the quantization noise. The frequency-domain excitation is modified to increase spectral dynamics by application of the weighting mask. The modified frequency-domain excitation is converted into a modified time-domain excitation. The latter conversion is delay-less. In an embodiment, the weighting mask may be produced using time averaging or frequency averaging or a combination of time and frequency averaging of the frequency-domain excitation. The method and device can be used for improving music content rendering of linear-prediction (LP) based codecs.

    摘要翻译: 本公开涉及一种用于减少包含在由时域解码器解码的时域激励中的声音信号中的量化噪声的装置和方法。 基于解码的时域激励来评估未来帧时域激励。 从未来帧的时域激励的解码时域激励产生级联的时域激励,并将其转换为频域激励。 产生用于检索在量化噪声中丢失的光谱信息的加权掩码。 通过应用加权掩模来修改频域激发以增加光谱动力学。 修改的频域激发被转换成修改的时域激励。 后者的转换是无延迟的。 在一个实施例中,可以使用时域平均或频率平均或频域激励的时间和频率平均的组合来产生加权掩码。 该方法和装置可用于改进基于线性预测(LP)的编解码器的音乐内容呈现。

    Device and method for reducing quantization noise in a time-domain decoder
    7.
    发明授权
    Device and method for reducing quantization noise in a time-domain decoder 有权
    用于减少时域解码器中的量化噪声的装置和方法

    公开(公告)号:US09384755B2

    公开(公告)日:2016-07-05

    申请号:US14196585

    申请日:2014-03-04

    摘要: The present disclosure relates to a device and method for reducing quantization noise in a signal contained in a time-domain excitation decoded by a time-domain decoder. The decoded time-domain excitation is converted into a frequency-domain excitation. A weighting mask is produced for retrieving spectral information lost in the quantization noise. The frequency-domain excitation is modified to increase spectral dynamics by application of the weighting mask. The modified frequency-domain excitation is converted into a modified time-domain excitation. The method and device can be used for improving music content rendering of linear-prediction (LP) based codecs. Optionally, a synthesis of the decoded time-domain excitation may be classified into one of a first set of excitation categories and a second set of excitation categories, the second set including INACTIVE or UNVOICED categories, the first set including an OTHER category.

    摘要翻译: 本公开涉及用于减少由时域解码器解码的时域激励中包含的信号中的量化噪声的装置和方法。 解码的时域激发被转换成频域激励。 产生用于检索在量化噪声中丢失的光谱信息的加权掩码。 通过应用加权掩模来修改频域激发以增加光谱动力学。 修改的频域激发被转换成修改的时域激励。 该方法和装置可用于改进基于线性预测(LP)的编解码器的音乐内容呈现。 可选地,解码的时域激励的合成可以分类为第一组激励类别和第二组激励类别中的一个,第二组包括INACTIVE或UNVOICED类别,第一组包括OTHER类别。

    Multi-resolution switched audio encoding/decoding scheme
    9.
    发明授权
    Multi-resolution switched audio encoding/decoding scheme 有权
    多分辨率切换音频编解码方案

    公开(公告)号:US09043215B2

    公开(公告)日:2015-05-26

    申请号:US13707192

    申请日:2012-12-06

    摘要: An audio encoder for encoding an audio signal has a first coding branch, the first coding branch comprising a first converter for converting a signal from a time domain into a frequency domain. Furthermore, the audio encoder has a second coding branch comprising a second time/frequency converter. Additionally, a signal analyzer for analyzing the audio signal is provided. The signal analyzer, on the hand, determines whether an audio portion is effective in the encoder output signal as a first encoded signal from the first encoding branch or as a second encoded signal from a second encoding branch. On the other hand, the signal analyzer determines a time/frequency resolution to be applied by the converters when generating the encoded signals. An output interface includes, in addition to the first encoded signal and the second encoded signal, a resolution information identifying the resolution used by the first time/frequency converter and used by the second time/frequency converter.

    摘要翻译: 用于编码音频信号的音频编码器具有第一编码分支,所述第一编码分支包括用于将来自时域的信号转换成频域的第一转换器。 此外,音频编码器具有包括第二时间/频率转换器的第二编码分支。 另外,提供了用于分析音频信号的信号分析器。 手持信号分析仪确定音频部分是否在编码器输出信号中作为来自第一编码分支的第一编码信号或来自第二编码分支的第二编码信号有效。 另一方面,当生成编码信号时,信号分析仪确定由转换器施加的时间/频率分辨率。 除了第一编码信号和第二编码信号之外,输出接口包括识别由第一时间/频率转换器使用并由第二时间/频率转换器使用的分辨率的分辨率信息。

    SWITCHING BETWEEN STEREO CODING MODES IN A MULTICHANNEL SOUND CODEC

    公开(公告)号:US20230051420A1

    公开(公告)日:2023-02-16

    申请号:US17758115

    申请日:2021-02-01

    发明人: Vaclav EKSLER

    IPC分类号: G10L19/008 G10L21/04

    摘要: A method and device for encoding a stereo sound signal comprise stereo encoders using stereo modes operating in time domain (TD), in frequency domain (FD) or in modified discrete Fourier transform (MDCT) domain. A controller controls switching between the TD, FD and MDCT stereo modes. Upon switching from one stereo mode to the other, the switching controller may (a) recalculate at least one length of down-processed/mixed signal in a current frame of the stereo sound signal, (b) reconstruct a down-processed/mixed signal and also other signals related to the other stereo mode in the current frame, (c) adapt data structures and/or memories for coding the stereo sound signal in the current frame using the other stereo mode, and/or (d) alter a TD stereo channel down-mixing to maintain a correct phase of left and right channels of the stereo sound signal. Corresponding stereo sound signal decoding method and device are described.