Method for predicting high frequency band signal, encoding device, and decoding device

    公开(公告)号:US09704500B2

    公开(公告)日:2017-07-11

    申请号:US14808145

    申请日:2015-07-24

    CPC classification number: G10L19/20 G10L19/04 G10L21/02 G10L21/038

    Abstract: A method includes obtaining a signal type of an audio signal and a low frequency band signal of the audio signal, where the audio signal includes the low frequency band signal and a high frequency band signal; obtaining a frequency envelope of the high frequency band signal according to the signal type; predicting an excitation signal of the high frequency band signal according to the low frequency band signal; and restoring the high frequency band signal according to the frequency envelope of the high frequency band signal and the excitation signal of the high frequency band signal. By using the technical solutions of the embodiments of the present invention, an error existing between a high frequency band signal obtained by prediction and an actual high frequency band signal can be effectively reduced, and an accuracy rate of the predicted high frequency band signal can be increased.

    Speech/audio signal processing method and apparatus

    公开(公告)号:US09691396B2

    公开(公告)日:2017-06-27

    申请号:US14470559

    申请日:2014-08-27

    Inventor: Zexin Liu Lei Miao

    CPC classification number: G10L19/00 G10L19/0204 G10L19/083

    Abstract: The present invention discloses a speech/audio signal processing method and apparatus. In an embodiment, the speech/audio signal processing method includes: when a speech/audio signal switches bandwidth, obtaining an initial high frequency signal corresponding to a current frame of speech/audio signal; obtaining a time-domain global gain parameter of the initial high frequency signal; performing weighting processing on an energy ratio and the time-domain global gain parameter, and using an obtained weighted value as a predicted global gain parameter, where the energy ratio is a ratio between energy of a historical frame of high frequency time-domain signal and energy of a current frame of initial high frequency signal; correcting the initial high frequency signal by using the predicted global gain parameter, to obtain a corrected high frequency time-domain signal; and synthesizing a current frame of narrow frequency time-domain signal and the corrected high frequency time-domain signal and outputting the synthesized signal.

    METHOD AND APPARATUS FOR PROCESSING LOST FRAME

    公开(公告)号:US20170103764A1

    公开(公告)日:2017-04-13

    申请号:US15385881

    申请日:2016-12-21

    Abstract: Embodiments of the present application provide a method and an apparatus for recovering a lost frame in a received audio signal. The method for recovering a lost frame includes: determining an initial high-frequency band signal of a current lost frame; determining a gain of the current lost frame; determining gain adjustment information of the current lost frame; adjusting the gain of the current lost frame according to the gain adjustment information, to obtain an adjusted gain of the current lost frame; and adjusting the initial high-band signal according to the adjusted gain, to obtain a high-frequency band signal of the current lost frame. The method and the apparatus for recovering a lost frame provided in the embodiments of the present application can be used in an audio signal decoding process for low-loss recovery of a lost frame of the audio signal, resulting in improved performance of an audio signal decoder.

    Method and Apparatus for Allocating Bits of Audio Signal
    224.
    发明申请
    Method and Apparatus for Allocating Bits of Audio Signal 有权
    用于分配音频信号位的方法和装置

    公开(公告)号:US20170069329A1

    公开(公告)日:2017-03-09

    申请号:US15354641

    申请日:2016-11-17

    CPC classification number: G10L19/002 G10L19/0204 G10L19/032 G10L19/035

    Abstract: A method and an apparatus for allocating bits of an audio signal. The method includes dividing a frequency band of an audio signal into multiple sub-bands, and quantizing a sub-band normalization factor of each sub-band; classifying the multiple sub-bands into multiple groups, and acquiring a sum of intra-group sub-band normalization factors of each group; performing initial inter-group bit allocation to determine the initial number of bits of each group; performing secondary inter-group bit allocation to allocate coding bits of the audio signal to at least one group; and allocating the bits of the audio signal to sub-bands in the group. The present disclosure can, by means of grouping, ensure relatively stable allocation in a previous frame and a next frame and reduce an impact of global allocation on local discontinuity in a case of low and medium bit rates.

    Abstract translation: 一种用于分配音频信号的位的方法和装置。 该方法包括将音频信号的频带划分成多个子带,并量化每个子带的子带归一化因子; 将多个子带分为多个组,并获取每组的组内子带归一化因子之和; 执行初始组间比特分配以确定每组的初始比特数; 执行次要组间比特分配以将音频信号的编码比特分配给至少一个组; 并将音频信号的比特分配给组中的子带。 本公开可以通过分组确保在先前帧和下一帧中的相对稳定的分配,并且在低和中比特率的情况下减少全局分配对局部不连续性的影响。

    SPEECH/AUDIO BITSTREAM DECODING METHOD AND APPARATUS
    225.
    发明申请
    SPEECH/AUDIO BITSTREAM DECODING METHOD AND APPARATUS 审中-公开
    语音/音频BITSTREAM解码方法和设备

    公开(公告)号:US20160372122A1

    公开(公告)日:2016-12-22

    申请号:US15256018

    申请日:2016-09-02

    Abstract: The present invention disclose a speech/audio bitstream decoding method including: acquiring a speech/audio decoding parameter of a current speech/audio frame, where the foregoing current speech/audio frame is a redundant decoded frame or a speech/audio frame previous to the foregoing current speech/audio frame is a redundant decoded frame; performing post processing on the acquired speech/audio decoding parameter according to speech/audio parameters of X speech/audio frames, where the foregoing X speech/audio frames include M speech/audio frames previous to the foregoing current speech/audio frame and/or N speech/audio frames next to the foregoing current speech/audio frame; and recovering a speech/audio signal by using the post-processed speech/audio decoding parameter of the foregoing current speech/audio frame. The technical solutions of the present invention help improve quality of an output speech/audio signal.

    Abstract translation: 本发明公开了一种语音/音频比特流解码方法,包括:获取当前语音/音频帧的语音/音频解码参数,其中前述当前语音/音频帧是冗余解码帧或者之前的语音/音频帧 前述当前语音/音频帧是冗余解码帧; 对所获取的语音/音频解码参数根据X个语音/音频帧的语音/音频参数执行后处理,其中前述X个语音/音频帧包括前述当前语音/音频帧之前的M个语音/音频帧,和/或 上述当前语音/音频帧之后的N个语音/音频帧; 以及通过使用上述当前语音/音频帧的后处理语音/音频解码参数来恢复语音/音频信号。 本发明的技术方案有助于提高输出语音/音频信号的质量。

    Method and apparatus for allocating bit in audio signal
    226.
    发明授权
    Method and apparatus for allocating bit in audio signal 有权
    用于分配音频信号中的位的方法和装置

    公开(公告)号:US09424850B2

    公开(公告)日:2016-08-23

    申请号:US14595672

    申请日:2015-01-13

    CPC classification number: G10L19/002 G10L19/0204

    Abstract: A method and an apparatus for allocating bits in an audio signal. The method includes dividing a frequency band of an audio signal into a plurality of subbands, quantizing a subband normalization factor of each subband; and an energy attribute of an audio signal of the corresponding group; allocating coding bits to at least one group, where a sum of coding bits allocated to the at least one group is the number of coding bits of the audio signal; and allocating the coding bits allocated to the at least one group to each subband in each group of the at least one group. In a case of a low or medium bit rate, the embodiments of the present invention can, by means of grouping, ensure relatively stable allocation of previous and subsequent frames and reduce impact of global allocation on partial discontinuity.

    Abstract translation: 一种用于在音频信号中分配比特的方法和装置。 该方法包括将音频信号的频带划分成多个子带,量化每个子带的子带归一化因子; 以及相应组的音频信号的能量属性; 将编码比特分配给至少一个组,其中分配给所述至少一个组的编码比特的和是所述音频信号的编码比特数; 以及将分配给所述至少一个组的编码比特分配给所述至少一个组的每个组中的每个子带。 在低或中等比特率的情况下,本发明的实施例可以通过分组来确保前一帧和后续帧的相对稳定的分配,并减少全局分配对部分不连续性的影响。

    METHOD AND APPARATUS FOR PREDICTING HIGH BAND EXCITATION SIGNAL
    227.
    发明申请
    METHOD AND APPARATUS FOR PREDICTING HIGH BAND EXCITATION SIGNAL 有权
    用于预测高带激发信号的方法和装置

    公开(公告)号:US20160210979A1

    公开(公告)日:2016-07-21

    申请号:US15080950

    申请日:2016-03-25

    Inventor: Zexin Liu Lei Miao

    Abstract: A method and an apparatus for predicting a high band excitation signal are disclosed. The method includes: acquiring, according to a received low band bitstream, a set of spectral frequency parameters that are arranged in an order of frequencies, calculating a spectral frequency parameter difference between every two spectral frequency parameters that have a same position interval; acquiring a minimum spectral frequency parameter difference from the calculated spectral frequency parameter differences; determining, according to a frequency bin that corresponds to the minimum spectral frequency parameter difference, a start frequency bin for predicting a high band excitation signal from a low band; and predicting the high band excitation signal from the low band according to the start frequency bin. By implementing embodiments of the present invention, a high band excitation signal can be better predicted, thereby improving performance of the high band excitation signal.

    Abstract translation: 公开了一种用于预测高频带激励信号的方法和装置。 该方法包括:根据接收到的低频带比特流获取以频率顺序排列的一组频谱频率参数,计算具有相同位置间隔的每两个频谱频率参数之间的频谱频率参数差; 从所计算的频谱频率参数差获取最小频谱参数差; 根据对应于最小频谱频率参数差的频率仓确定用于从低频段预测高频带激励信号的起始频率仓; 并根据起始频率仓预测来自低频段的高频激励信号。 通过实现本发明的实施例,可以更好地预测高频带激励信号,从而提高高频带激励信号的性能。

    Method for Predicting Bandwidth Extension Frequency Band Signal, and Decoding Device
    228.
    发明申请
    Method for Predicting Bandwidth Extension Frequency Band Signal, and Decoding Device 有权
    预测带宽扩展频带信号和解码设备的方法

    公开(公告)号:US20150332688A1

    公开(公告)日:2015-11-19

    申请号:US14806896

    申请日:2015-07-23

    CPC classification number: G10L19/12 G10L19/02 G10L19/08 G10L21/038

    Abstract: A method for predicting a bandwidth extension frequency band signal includes demultiplexing a received bitstream to obtain a frequency domain signal; determining whether a highest frequency bin, to which a bit is allocated, of the frequency domain signal is less than a preset start frequency bin of a bandwidth extension frequency band; predicting an excitation signal of the bandwidth extension frequency band according to the determination; and predicting the bandwidth extension frequency band signal according to the predicted excitation signal of the bandwidth extension frequency band and a frequency envelope of the bandwidth extension frequency band.

    Abstract translation: 一种用于预测带宽扩展频带信号的方法包括解复用接收的比特流以获得频域信号; 确定频域信号中哪一位被分配的最高频率仓是否小于带宽扩展频带的预设起始频率仓; 根据确定预测带宽扩展频带的激励信号; 以及根据带宽扩展频带的预测激励信号和带宽扩展频带的频率包络来预测带宽扩展频带信号。

    Vector Joint Encoding/Decoding Method and Vector Joint Encoder/Decoder
    229.
    发明申请
    Vector Joint Encoding/Decoding Method and Vector Joint Encoder/Decoder 审中-公开
    矢量联合编码/解码方法和矢量联合编码器/解码器

    公开(公告)号:US20150127328A1

    公开(公告)日:2015-05-07

    申请号:US14547677

    申请日:2014-11-19

    Abstract: A vector joint encoding/decoding method and a vector joint encoder/decoder are provided, more than two vectors are jointly encoded, and an encoding index of at least one vector is split and then combined between different vectors, so that encoding idle spaces of different vectors can be recombined, thereby facilitating saving of encoding bits, and because an encoding index of a vector is split and then shorter split indexes are recombined, thereby facilitating reduction of requirements for the bit width of operating parts in encoding/decoding calculation.

    Abstract translation: 提供了向量联合编码/解码方法和向量联合编码器/解码器,联合编码了两个以上的向量,并且将至少一个向量的编码索引分离,然后在不同向量之间组合,使得编码不同的空闲空间 可以重新组合向量,从而有助于节省编码比特,并且由于矢量的编码索引被分离,并且随后更短的分割索引被重新组合,从而有助于减少编码/解码计算中的操作部分的比特宽度的要求。

    METHOD AND APPARATUS FOR GENERATING AND RESTORING DOWNMIXED SIGNAL
    230.
    发明申请
    METHOD AND APPARATUS FOR GENERATING AND RESTORING DOWNMIXED SIGNAL 有权
    用于产生和恢复下行信号的方法和装置

    公开(公告)号:US20140211947A1

    公开(公告)日:2014-07-31

    申请号:US14227695

    申请日:2014-03-27

    CPC classification number: H04S3/02 G10L19/008 G10L19/02

    Abstract: An embodiment of the present invention provides a method for generating a downmixed signal, including: performing a time-frequency transform on a received left sound channel signal and a received right sound channel signal to obtain a frequency domain signal, and dividing the frequency domain signal into several frequency bands; calculating a sound channel energy ratio and a sound channel phase difference of each frequency band; calculating a phase difference between the downmixed signal and a first sound channel signal in each frequency band according to the sound channel energy ratio and the sound channel phase difference; and calculating a frequency domain downmixed signal according to the left sound channel signal, the right sound channel signal, and the phase difference between the downmixed signal and the first sound channel signal in each frequency band. This method effectively improves quality of stereo encoding and decoding.

    Abstract translation: 本发明的实施例提供了一种用于产生下混合信号的方法,包括:对接收到的左声道信号和接收到的右声道信号执行时频变换以获得频域信号,并且将频域信号 进入几个频带; 计算每个频带的声道能量比和声道相位差; 根据声道能量比和声道相位差计算每个频带中的下混合信号与第一声道信号之间的相位差; 以及根据左声道信号,右声道信号以及每个频带中的下混合信号和第一声道信号之间的相位差来计算频域缩混信号。 该方法有效提高立体声编码和解码质量。

Patent Agency Ranking