Method and apparatus for encoding and decoding audio/speech signal
    11.
    发明授权
    Method and apparatus for encoding and decoding audio/speech signal 有权
    用于对音频/语音信号进行编码和解码的方法和装置

    公开(公告)号:US09418666B2

    公开(公告)日:2016-08-16

    申请号:US14132224

    申请日:2013-12-18

    CPC classification number: G10L19/00 G10L19/025

    Abstract: Provided is a method of encoding an audio/speech signal, the method including determining a variable length of a frame, that is, a processing unit of an input signal in accordance with a position of an attack in the input signal; transforming each frame of the input signal to a frequency domain and dividing the frame into a plurality of sub frequency bands; and, if a signal of a sub frequency band is determined to be encoded in the frequency domain, encoding the signal of the sub frequency band in the frequency domain, and if the signal of the sub frequency band is determined to be encoded in a time domain, inverse transforming the signal of the sub frequency band to the time domain and encoding the inverse transformed signal in the time domain. According to the present invention, the audio/speech signal may be efficiently encoded by controlling time resolution and frequency resolution.

    Abstract translation: 提供了一种对音频/语音信号进行编码的方法,该方法包括根据输入信号中的攻击位置来确定帧的可变长度,即输入信号的处理单元; 将所述输入信号的每个帧变换为频域并将所述帧划分为多个子频带; 并且如果确定在频域中编码子频带的信号,则对频域中的子频带的信号进行编码,并且如果确定子频带的信号被编码在一个时间 域,将子频带的信号逆变换到时域,并对时域中的逆变换信号进行编码。 根据本发明,可以通过控制时间分辨率和频率分辨率来有效地编码音频/语音信号。

    METHOD OF GENERATING MULTI-CHANNEL AUDIO SIGNAL AND APPARATUS FOR CARRYING OUT SAME
    17.
    发明申请
    METHOD OF GENERATING MULTI-CHANNEL AUDIO SIGNAL AND APPARATUS FOR CARRYING OUT SAME 有权
    产生多声道音频信号的方法和实现该方式的装置

    公开(公告)号:US20150117650A1

    公开(公告)日:2015-04-30

    申请号:US14515622

    申请日:2014-10-16

    CPC classification number: H04S7/302 H04S5/00 H04S7/30 H04S2400/01 H04S2400/11

    Abstract: A method of generating a multi-channel audio signal includes: representing locations of a plurality of speakers as a plurality of polygons whose vertices are located at locations of corresponding speakers; acquiring a location of an object sound; calculating distances between the plurality of polygons and the location of the object sound; selecting one of the plurality of polygons on the basis of the calculated distances; and generating a multi-channel audio signal that corresponds to speakers corresponding to the selected polygon by mapping the object sound to the speakers corresponding to the selected polygon.

    Abstract translation: 一种产生多声道音频信号的方法包括:将多个扬声器的位置表示为其顶点位于相应扬声器的位置处的多个多边形; 获取对象声音的位置; 计算多个多边形之间的距离和物体声音的位置; 基于计算出的距离来选择多个多边形中的一个; 以及通过将对象声音映射到与所选择的多边形相对应的扬声器来产生对应于与所选择的多边形相对应的扬声器的多声道音频信号。

    METHOD, MEDIUM, AND SYSTEM SCALABLY ENCODING/DECODING AUDIO/SPEECH
    18.
    发明申请
    METHOD, MEDIUM, AND SYSTEM SCALABLY ENCODING/DECODING AUDIO/SPEECH 有权
    方法,媒体和系统规范编码/解码音频/语音

    公开(公告)号:US20130030820A1

    公开(公告)日:2013-01-31

    申请号:US13645834

    申请日:2012-10-05

    CPC classification number: G10L19/24

    Abstract: A method, medium, and system scalably encoding/decoding audio/speech. The method includes splitting an input signal into a low frequency band signal that is lower than a predetermined frequency and a high frequency band signal that is higher than the predetermined frequency, scalably encoding the split low frequency band signal into a core layer and one or more extension layers and then decoding the encoded core layer and the encoded extension layers, generating an error signal by using the split low frequency band signal and a decoded signal of the encoded core layer and the encoded extension layers, and encoding the error signal and the high frequency band signal into a signal-to-noise ratio (SNR) enhancement layer and a bandwidth extension layer.

    Abstract translation: 一种方法,媒体和系统可扩展地编码/解码音频/语音。 该方法包括将输入信号分成低于预定频率的低频带信号和高于预定频率的高频带信号,将分割的低频带信号可扩展地编码为核心层和一个或多个 扩展层,然后对编码的核心层和编码的扩展层进行解码,通过使用分割的低频带信号和编码的核心层和经编码的扩展层的解码信号产生误差信号,并对误差信号和高 频带信号转换成信噪比(SNR)增强层和带宽扩展层。

Patent Agency Ranking