Scalable speech coding/decoding apparatus, method, and medium having mixed structure
    1.
    发明申请
    Scalable speech coding/decoding apparatus, method, and medium having mixed structure 有权
    可扩展语音编码/解码装置,方法和具有混合结构的介质

    公开(公告)号:US20070033023A1

    公开(公告)日:2007-02-08

    申请号:US11490139

    申请日:2006-07-21

    IPC分类号: G10L19/02

    摘要: Provided are a scalable wide-band speech coding/decoding apparatus, method, and medium. An input wide-band speech input signal is first divided into a low-band signal and a high-band signal. The divided low-band signal is then coded using a code excited linear prediction (CELP) method. The divided high-band signal is coded using a harmonic method. A signal representing a difference between a synthetic signal obtained from the low-band and the high band, and a signal input to the low-band and the high-band is then coded using a modified discrete cosine transform (MDCT) method. The coded signal is then multiplexed. The multiplexed signal is then output. Accordingly, high quality speech can be achieved for all layers.

    摘要翻译: 提供了一种可扩展的宽带语音编码/解码装置,方法和媒体。 输入宽带语音输入信号首先被分成低频带信号和高频带信号。 然后使用码激励线性预测(CELP)方法对分频的低频带信号进行编码。 分频高频信号采用谐波法编码。 然后,使用修正的离散余弦变换(MDCT)方法对表示从低频带和高频带获得的合成信号之间的差异以及输入到低频带和高频带的信号进行编码的信号。 然后对编码信号进行多路复用。 然后输出复用的信号。 因此,可以实现对所有层的高质量语音。

    Method, apparatus, and medium for classifying speech signal and method, apparatus, and medium for encoding speech signal using the same
    2.
    发明授权
    Method, apparatus, and medium for classifying speech signal and method, apparatus, and medium for encoding speech signal using the same 有权
    用于对语音信号进行分类的方法,装置和介质以及使用其编码语音信号的方法,装置和介质

    公开(公告)号:US08175869B2

    公开(公告)日:2012-05-08

    申请号:US11480449

    申请日:2006-07-05

    IPC分类号: G10L19/00

    CPC分类号: G10L19/22 G10L19/022

    摘要: A method, apparatus, and medium for classifying a speech signal and a method, apparatus, and medium for encoding the speech signal using the same are provided. The method for classifying a speech signal includes calculating classification parameters from an input signal having block units, calculating a plurality of classification criteria from the classification parameters, and classifying the level of the input signal using the plurality of classification criteria. The classification parameters include at least one of an energy parameter of the input signal, a cross-correlation parameter between a specific block of a present frame and the input signal, and an integrated cross-correlation parameter obtained by accumulating the cross-correlation parameter.

    摘要翻译: 提供了一种用于对语音信号进行分类的方法,装置和媒体,以及使用该语音信号编码语音信号的方法,装置和媒体。 用于分类语音信号的方法包括从具有块单位的输入信号计算分类参数,从分类参数计算多个分类标准,以及使用多个分类标准对输入信号的等级进行分类。 分类参数包括输入信号的能量参数,当前帧的特定块与输入信号之间的互相关参数,以及通过累加互相关参数而获得的积分互相关参数中的至少一个。

    Audio coding and decoding apparatuses and methods, and recording media storing the methods
    3.
    发明申请
    Audio coding and decoding apparatuses and methods, and recording media storing the methods 失效
    音频编码和解码装置和方法,以及存储方法的记录介质

    公开(公告)号:US20060217975A1

    公开(公告)日:2006-09-28

    申请号:US11337487

    申请日:2006-01-24

    IPC分类号: G10L21/02

    CPC分类号: G10L19/0208 G10L19/093

    摘要: Audio coding and decoding apparatuses and methods which support fine granularity scalability (FGS) using harmonic information of a high-band audio signal or wideband error audio signal when performing wideband audio coding and decoding, and recording mediums on which the methods are stored. The audio coding method includes detecting harmonics of a high-band audio signal or wideband error audio signal of an input audio signal; determining an order of the detected harmonics; and coding the detected harmonics based on the determined order.

    摘要翻译: 在执行宽带音频编码和解码时使用高频带音频信号或宽带误差音频信号的谐波信息支持精细粒度可伸缩性(FGS)的音频编码和解码装置和方法,以及存储方法的记录介质。 音频编码方法包括检测输入音频信号的高频带音频信号或宽带误差音频信号的谐波; 确定检测到的谐波的顺序; 并根据确定的顺序对检测到的谐波进行编码。

    Scalable speech coding/decoding apparatus, method, and medium having mixed structure
    4.
    发明授权
    Scalable speech coding/decoding apparatus, method, and medium having mixed structure 有权
    可扩展语音编码/解码装置,方法和具有混合结构的介质

    公开(公告)号:US08271267B2

    公开(公告)日:2012-09-18

    申请号:US11490139

    申请日:2006-07-21

    IPC分类号: G10L21/00

    摘要: Provided are a scalable wide-band speech coding/decoding apparatus, method, and medium. An input wide-band speech input signal is first divided into a low-band signal and a high-band signal. The divided low-band signal is then coded using a code excited linear prediction (CELP) method. The divided high-band signal is coded using a harmonic method. A signal representing a difference between a synthetic signal obtained from the low-band and the high band, and a signal input to the low-band and the high-band is then coded using a modified discrete cosine transform (MDCT) method. The coded signal is then multiplexed. The multiplexed signal is then output. Accordingly, high quality speech can be achieved for all layers.

    摘要翻译: 提供了一种可扩展的宽带语音编码/解码装置,方法和媒体。 输入宽带语音输入信号首先被分成低频带信号和高频带信号。 然后使用码激励线性预测(CELP)方法对分频的低频带信号进行编码。 分频高频信号采用谐波法编码。 然后,使用修正的离散余弦变换(MDCT)方法对表示从低频带和高频带获得的合成信号之间的差异以及输入到低频带和高频带的信号进行编码的信号。 然后对编码信号进行多路复用。 然后输出复用的信号。 因此,可以实现对所有层的高质量语音。

    Band based audio coding and decoding apparatuses, methods, and recording media for scalability
    5.
    发明授权
    Band based audio coding and decoding apparatuses, methods, and recording media for scalability 失效
    基于频带的音频编码和解码装置,方法和可扩展性的记录介质

    公开(公告)号:US08015017B2

    公开(公告)日:2011-09-06

    申请号:US11337487

    申请日:2006-01-24

    IPC分类号: G10L19/00

    CPC分类号: G10L19/0208 G10L19/093

    摘要: Audio coding and decoding apparatuses and methods which support fine granularity scalability (FGS) using harmonic information of a high-band audio signal or wideband error audio signal when performing wideband audio coding and decoding, and recording mediums on which the methods are stored. The audio coding method includes detecting harmonics of a high-band audio signal or wideband error audio signal of an input audio signal; determining an order of the detected harmonics; and coding the detected harmonics based on the determined order.

    摘要翻译: 在执行宽带音频编码和解码时使用高频带音频信号或宽带误差音频信号的谐波信息支持精细粒度可伸缩性(FGS)的音频编码和解码装置和方法,以及存储方法的记录介质。 音频编码方法包括检测输入音频信号的高频带音频信号或宽带误差音频信号的谐波; 确定检测到的谐波的顺序; 并根据确定的顺序对检测到的谐波进行编码。

    Scalable audio encoding and/or decoding method and apparatus
    6.
    发明申请
    Scalable audio encoding and/or decoding method and apparatus 审中-公开
    可扩展音频编码和/或解码方法和装置

    公开(公告)号:US20070040709A1

    公开(公告)日:2007-02-22

    申请号:US11485468

    申请日:2006-07-13

    IPC分类号: H03M7/00

    CPC分类号: G10L19/0208

    摘要: A method and apparatus to scalably encode and/or decode an audio signal includes encoding a specific band signal included in an input signal, encoding a frequency envelope of an excited signal in which the encoded specific band signal is removed from the input signal, encoding a residual signal in which the encoded frequency envelope is removed from the excited signal, and forming a bit-stream by scalably packing the encoded specific band signal, frequency envelop, and residual signal.

    摘要翻译: 一种用于对音频信号进行可缩放编码和/或解码的方法和装置包括编码包括在输入信号中的特定频带信号,对其中去除编码的特定频带信号的激励信号的频率包络进行编码, 残留信号,其中编码的频率包络从激励信号中去除,以及通过可扩展地打包编码的特定频带信号,频率包络和残余信号来形成比特流。

    Method and apparatus to quantize/dequantize frequency amplitude data and method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize frequency amplitude data
    7.
    发明授权
    Method and apparatus to quantize/dequantize frequency amplitude data and method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize frequency amplitude data 有权
    使用该方法和装置来量化/去量化频率振幅数据的方法和装置以及方法和装置进行音频编码/解码以量化/去量化频率振幅数据

    公开(公告)号:US07805314B2

    公开(公告)日:2010-09-28

    申请号:US11471635

    申请日:2006-06-21

    IPC分类号: G10L19/00 G10L19/02

    摘要: A method and apparatus to quantize/dequantize frequency amplitude data and a method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize the frequency amplitude data. The method includes calculating and quantizing power of frequency amplitudes for each of a plurality of bands constituting an audio frame, normalizing frequency amplitude data for each of the bands using the quantized power, and quantizing a first one of even-numbered or odd-numbered data among the normalized frequency amplitude data. The method may further include interpolating frequency amplitude data that corresponds to a second one of the even-numbered or odd-numbered frequency amplitude that is not quantized from among the normalized frequency amplitude data using the quantized first one of the even-numbered or odd-numbered data, and quantizing an interpolation error corresponding to a difference between the second frequency amplitude data that is not quantized and the interpolated frequency amplitude data.

    摘要翻译: 一种用于量化/去量化频率幅度数据的方法和装置以及使用该方法和装置对频率振幅数据进行量化/去量化的音频编码/解码的方法和装置。 该方法包括:计算和量化构成音频帧的多个频带中的每个频带的频率幅度的功率,使用量化功率归一化每个频带的频率振幅数据,以及量化偶数或奇数数据中的第一个 在归一化的频率振幅数据中。 该方法可以进一步包括使用偶数或奇数编号的量化的第一个量化的归一化频率幅度数据中对应于未被量化的偶数或奇数频率振幅中的第二频率振幅数据, 量化与未量化的第二频率振幅数据和内插频率振幅数据之间的差对应的内插误差。

    Method, apparatus, and medium for classifying speech signal and method, apparatus, and medium for encoding speech signal using the same
    8.
    发明申请
    Method, apparatus, and medium for classifying speech signal and method, apparatus, and medium for encoding speech signal using the same 有权
    用于对语音信号进行分类的方法,装置和介质以及使用其编码语音信号的方法,装置和介质

    公开(公告)号:US20070038440A1

    公开(公告)日:2007-02-15

    申请号:US11480449

    申请日:2006-07-05

    IPC分类号: G10L19/00

    CPC分类号: G10L19/22 G10L19/022

    摘要: A method, apparatus, and medium for classifying a speech signal and a method, apparatus, and medium for encoding the speech signal using the same are provided. The method for classifying a speech signal includes calculating classification parameters from an input signal having block units, calculating a plurality of classification criteria from the classification parameters, and classifying the level of the input signal using the plurality of classification criteria. The classification parameters include at least one of an energy parameter of the input signal, a cross-correlation parameter between a specific block of a present frame and the input signal, and an integrated cross-correlation parameter obtained by accumulating the cross-correlation parameter.

    摘要翻译: 提供了一种用于对语音信号进行分类的方法,装置和媒体,以及使用该语音信号编码语音信号的方法,装置和媒体。 用于分类语音信号的方法包括从具有块单位的输入信号计算分类参数,从分类参数计算多个分类标准,以及使用多个分类标准对输入信号的等级进行分类。 分类参数包括输入信号的能量参数,当前帧的特定块与输入信号之间的互相关参数,以及通过累加互相关参数而获得的积分互相关参数中的至少一个。

    Method and apparatus to quantize/dequantize frequency amplitude data and method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize frequency amplitude data
    9.
    发明申请
    Method and apparatus to quantize/dequantize frequency amplitude data and method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize frequency amplitude data 有权
    使用该方法和装置来量化/去量化频率振幅数据的方法和装置以及方法和装置进行音频编码/解码以量化/去量化频率振幅数据

    公开(公告)号:US20070016417A1

    公开(公告)日:2007-01-18

    申请号:US11471635

    申请日:2006-06-21

    IPC分类号: G10L19/00

    摘要: A method and apparatus to quantize/dequantize frequency amplitude data and a method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize the frequency amplitude data. The method includes calculating and quantizing power of frequency amplitudes for each of a plurality of bands constituting an audio frame, normalizing frequency amplitude data for each of the bands using the quantized power, and quantizing a first one of even-numbered or odd-numbered data among the normalized frequency amplitude data. The method may further include interpolating frequency amplitude data that corresponds to a second one of the even-numbered or odd-numbered frequency amplitude that is not quantized from among the normalized frequency amplitude data using the quantized first one of the even-numbered or odd-numbered data, and quantizing an interpolation error corresponding to a difference between the second frequency amplitude data that is not quantized and the interpolated frequency amplitude data.

    摘要翻译: 一种用于量化/去量化频率幅度数据的方法和装置以及使用该方法和装置对频率振幅数据进行量化/去量化的音频编码/解码的方法和装置。 该方法包括:计算和量化构成音频帧的多个频带中的每个频带的频率幅度的功率,使用量化功率归一化每个频带的频率振幅数据,以及量化偶数或奇数数据中的第一个 在归一化的频率振幅数据中。 该方法可以进一步包括使用偶数或奇数编号的量化的第一个量化的归一化频率幅度数据中对应于未被量化的偶数或奇数频率振幅中的第二频率振幅数据, 量化与未量化的第二频率振幅数据和内插频率振幅数据之间的差对应的内插误差。

    Audio coding and decoding apparatuses and methods, and recording mediums storing the methods
    10.
    发明申请
    Audio coding and decoding apparatuses and methods, and recording mediums storing the methods 审中-公开
    音频编码和解码装置和方法以及存储方法的记录介质

    公开(公告)号:US20060206316A1

    公开(公告)日:2006-09-14

    申请号:US11333342

    申请日:2006-01-18

    IPC分类号: G10L11/04

    摘要: Audio coding and decoding apparatuses and methods that can optimize the quality of an audio signal including harmonics, and recording mediums storing the methods. An audio coding apparatus includes: a first harmonic coding module performing first harmonic coding on an input audio signal using a pitch lag of the input audio signal and producing a quantized linear prediction coding coefficient; a first detector detecting a first difference audio signal from a difference between an audio signal output from the first harmonic coding module and the input audio signal; a second harmonic coding module performing harmonic coding on the first difference audio signal using the quantized linear prediction coding coefficient and a previous harmonic coding result; a second detector detecting a second difference audio signal obtained from a difference between an audio signal output from the second harmonic coding module and the first difference audio signal; and a code excited linear prediction (CELP) module CELP coding the second difference audio signal using the quantized linear prediction coding coefficient obtained from the first harmonic coding module.

    摘要翻译: 可以优化包括谐波的音频信号的质量的音频编码和解码装置和方法,以及存储方法的记录介质。 音频编码装置包括:第一谐波编码模块,使用输入音频信号的音调滞后对输入音频信号执行一次谐波编码,并产生量化的线性预测编码系数; 第一检测器,从第一谐波编码模块输出的音频信号与输入音频信号之间的差检测第一差分音频信号; 使用量化线性预测编码系数和先前的谐波编码结果对第一差分音频信号执行谐波编码的二次谐波编码模块; 第二检测器,检测从二次谐波编码模块输出的音频信号与第一差音频信号之间的差获得的第二差分音频信号; 以及使用从第一谐波编码模块获得的量化线性预测编码系数对第二差音频信号进行编码的码激励线性预测(CELP)模块CELP。