Voice signal encoding and decoding method, device, and codec system

    公开(公告)号:US09672830B2

    公开(公告)日:2017-06-06

    申请号:US13632905

    申请日:2012-10-01

    Inventor: Fengyan Qi Lei Miao

    CPC classification number: G10L19/0017 G10L19/24 H04B7/0647

    Abstract: A voice signal encoding and decoding method, device, and codec system are provided. The coding method includes: encoding an input voice signal to obtain a broadband code stream, where the broadband code stream includes a core layer bit stream and an extension enhancement layer bit stream (101); compressing the core layer bit stream to obtain a compressed code stream (102); and packing the compressed code stream and the extension enhancement layer bit stream to obtain a packed code stream (103). The core layer bit stream is compressed, and the compressed code stream and the extension enhancement layer bit stream are packed, thereby reducing transmission bandwidth occupied by the input voice signal. Since the broadband voice encoding is performed on the input voice signal, a broadband voice code stream is transmitted by using narrowband transmission bandwidth, thereby improving the cost performance of voice signal transmission.

    METHOD AND APPARATUS FOR PROCESSING TEMPORAL ENVELOPE OF AUDIO SIGNAL, AND ENCODER

    公开(公告)号:US20170098451A1

    公开(公告)日:2017-04-06

    申请号:US15372130

    申请日:2016-12-07

    Inventor: Zexin Liu Lei Miao

    Abstract: A method and an apparatus for processing a temporal envelope of an audio signal, and an encoder are disclosed. When multiple temporal envelopes are solved, continuity of signal energy can be well maintained, and in addition, complexity of calculating a temporal envelope is reduced. The method includes: obtaining a high-band signal of the current frame audio signal according to the received current frame audio signal; dividing the high-band signal of the current frame signal into M subframes according to a predetermined temporal envelope quantity M, where M is an integer, M is greater than or equal to 2; calculating a temporal envelope of each of the subframes; performing windowing on the first subframe of the M subframes and the last subframe of the M subframes by using an asymmetric window function; and performing windowing on a subframe except the first subframe and the last subframe of the M subframes.

    Method and apparatus for allocating bits of audio signal
    286.
    发明授权
    Method and apparatus for allocating bits of audio signal 有权
    用于分配音频信号位的方法和装置

    公开(公告)号:US09530420B2

    公开(公告)日:2016-12-27

    申请号:US14675031

    申请日:2015-03-31

    CPC classification number: G10L19/002 G10L19/0204 G10L19/032 G10L19/035

    Abstract: A method and an apparatus for allocating bits of an audio signal. The method includes dividing a frequency band of an audio signal into multiple sub-bands, and quantizing a sub-band normalization factor of each sub-band; classifying the multiple sub-bands into multiple groups, and acquiring a sum of intra-group sub-band normalization factors of each group; performing initial inter-group bit allocation to determine the initial number of bits of each group; performing secondary inter-group bit allocation to allocate coding bits of the audio signal to at least one group; and allocating the bits of the audio signal to sub-bands in the group. The present invention can, by means of grouping, ensure relatively stable allocation in a previous frame and a next frame and reduce an impact of global allocation on local discontinuity in a case of low and medium bit rates.

    Abstract translation: 一种用于分配音频信号的位的方法和装置。 该方法包括将音频信号的频带划分成多个子带,并量化每个子带的子带归一化因子; 将多个子带分为多个组,并获取每组的组内子带归一化因子之和; 执行初始组间比特分配以确定每组的初始比特数; 执行次要组间比特分配以将音频信号的编码比特分配给至少一个组; 并将音频信号的比特分配给组中的子带。 本发明可以通过分组确保在先前帧和下一帧中的相对稳定的分配,并且在低和中比特率的情况下减少全局分配对局部不连续性的影响。

    Method and apparatus for generating and restoring downmixed signal
    287.
    发明授权
    Method and apparatus for generating and restoring downmixed signal 有权
    用于产生和恢复下混合信号的方法和装置

    公开(公告)号:US09516447B2

    公开(公告)日:2016-12-06

    申请号:US14227695

    申请日:2014-03-27

    CPC classification number: H04S3/02 G10L19/008 G10L19/02

    Abstract: An embodiment of the present invention provides a method for generating a downmixed signal, including: performing a time-frequency transform on a received left sound channel signal and a received right sound channel signal to obtain a frequency domain signal, and dividing the frequency domain signal into several frequency bands; calculating a sound channel energy ratio and a sound channel phase difference of each frequency band; calculating a phase difference between the downmixed signal and a first sound channel signal in each frequency band according to the sound channel energy ratio and the sound channel phase difference; and calculating a frequency domain downmixed signal according to the left sound channel signal, the right sound channel signal, and the phase difference between the downmixed signal and the first sound channel signal in each frequency band. This method effectively improves quality of stereo encoding and decoding.

    Abstract translation: 本发明的实施例提供了一种用于产生下混合信号的方法,包括:对接收到的左声道信号和接收到的右声道信号执行时频变换以获得频域信号,并且将频域信号 进入几个频带; 计算每个频带的声道能量比和声道相位差; 根据声道能量比和声道相位差计算每个频带中的下混合信号与第一声道信号之间的相位差; 以及根据左声道信号,右声道信号以及每个频带中的下混合信号和第一声道信号之间的相位差来计算频域缩混信号。 该方法有效提高立体声编码和解码质量。

    Audio signal coding method and apparatus
    288.
    发明授权
    Audio signal coding method and apparatus 有权
    音频信号编码方法及装置

    公开(公告)号:US09514762B2

    公开(公告)日:2016-12-06

    申请号:US15011824

    申请日:2016-02-01

    Inventor: Lei Miao Zexin Liu

    Abstract: The present invention relates to an audio signal coding method and apparatus. The method includes: categorizing audio signals into high-frequency audio signals and low-frequency audio signals; coding the low-frequency audio signals by using a corresponding low-frequency coding manner according to characteristics of low-frequency audio signals; and selecting a bandwidth extension mode to code the high-frequency audio signals according to the low-frequency coding manner and/or characteristics of the audio signals.

    Abstract translation: 本发明涉及音频信号编码方法和装置。 该方法包括:将音频信号分类为高频音频信号和低频音频信号; 根据低频音频信号的特性,通过使用相应的低频编码方式对低频音频信号进行编码; 以及根据低频编码方式和/或音频信号的特性来选择带宽扩展模式来对高频音频信号进行编码。

    Method and Apparatus for Decoding Speech/Audio Bitstream
    289.
    发明申请
    Method and Apparatus for Decoding Speech/Audio Bitstream 有权
    用于解码语音/音频比特流的方法和装置

    公开(公告)号:US20160343382A1

    公开(公告)日:2016-11-24

    申请号:US15197364

    申请日:2016-06-29

    Abstract: A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.

    Abstract translation: 公开了一种用于解码语音/音频比特流的方法和装置,其中用于解码语音/音频比特流的方法包括确定当前帧是正常解码帧还是冗余解码帧,获得当前帧的解码参数 通过当前帧是正常解码帧或冗余解码帧进行解析,对当前帧的解码参数执行后处理以获得当前帧的后处理解码参数,并且使用后处理 解码当前帧的参数以重构语音/音频信号。

    Method for predicting bandwidth extension frequency band signal, and decoding device
    290.
    发明授权
    Method for predicting bandwidth extension frequency band signal, and decoding device 有权
    预测带宽扩展频带信号的方法,以及解码装置

    公开(公告)号:US09361904B2

    公开(公告)日:2016-06-07

    申请号:US14806896

    申请日:2015-07-23

    CPC classification number: G10L19/12 G10L19/02 G10L19/08 G10L21/038

    Abstract: A method for predicting a bandwidth extension frequency band signal includes demultiplexing a received bitstream to obtain a frequency domain signal; determining whether a highest frequency bin, to which a bit is allocated, of the frequency domain signal is less than a preset start frequency bin of a bandwidth extension frequency band; predicting an excitation signal of the bandwidth extension frequency band according to the determination; and predicting the bandwidth extension frequency band signal according to the predicted excitation signal of the bandwidth extension frequency band and a frequency envelope of the bandwidth extension frequency band.

    Abstract translation: 一种用于预测带宽扩展频带信号的方法包括解复用接收的比特流以获得频域信号; 确定频域信号中哪一位被分配的最高频率仓是否小于带宽扩展频带的预设起始频率仓; 根据确定预测带宽扩展频带的激励信号; 以及根据带宽扩展频带的预测激励信号和带宽扩展频带的频率包络来预测带宽扩展频带信号。

Patent Agency Ranking