Perceptual, scalable audio compression
    1.
    发明授权
    Perceptual, scalable audio compression 有权
    感性,可扩展音频压缩

    公开(公告)号:US07835904B2

    公开(公告)日:2010-11-16

    申请号:US11367886

    申请日:2006-03-03

    IPC分类号: G10L19/02 H04B1/66

    CPC分类号: G10L19/24 G10L19/04

    摘要: The perceptual scalable audio coding/decoding technique lies in the use of a psychoacoustic mask to guide residue coding in enhancement layer coders. At the encoder, a psychoacoustic mask is calculated for the enhancement layer coders or is simply extracted from the coded base layer bitstream. One can also decode the coded base layer bitstream into the audio waveform, and calculate the psychoacoustic mask from the decoded base layer waveform. Furthermore, a predictive technology can be used to refine the psychoacoustic mask derived from the base layer bitstream to form a more accurate psychoacoustic mask of the enhancement layer. In addition, one can calculate the enhancement layer psychoacoustic mask from the original audio, and send the difference between the enhancement layer psychoacoustic mask and the base layer psychoacoustic mask as side information to the decoder. This psychoacoustic mask may then be used for the perceptual coding and decoding of the residue.

    摘要翻译: 感知可伸缩音频编码/解码技术在于使用心理声学掩模来指导增强层编码器中的残留编码。 在编码器处,为增强层编码器计算心理声学掩模,或简单地从编码的基本层比特流中提取。 还可以将编码的基本层比特流解码为音频波形,并从解码的基本层波形计算心理声学掩模。 此外,可以使用预测技术来改善从基本层比特流导出的心理声学掩模,以形成增强层的更准确的心理声学掩模。 另外,可以从原始音频计算增强层心理声学掩模,并将增强层心理声学掩模和基础层心理声学掩模之间的差异作为辅助信息发送给解码器。 然后可以将该心理声学掩模用于残留物的感知编码和解码。