Coding vectors decomposed from higher-order ambisonics audio signals

    公开(公告)号:US09852737B2

    公开(公告)日:2017-12-26

    申请号:US14712836

    申请日:2015-05-14

    CPC classification number: G10L19/038 G10L19/008 G10L2019/0001

    Abstract: In general, techniques are described for coding of vectors decomposed from higher order ambisonic coefficients. A device comprising a processor and a memory may perform the techniques. The processor may be configured to obtain from a bitstream data indicative of a plurality of weight values that represent a vector that is included in a decomposed version of the plurality of HOA coefficients. Each of the weight values may correspond to a respective one of a plurality of weights in a weighted sum of code vectors that represents the vector and that includes a set of code vectors. The processor may further be configured to reconstruct the vector based on the weight values and the code vectors. The memory may be configured to store the reconstructed vector.

    Scalable downmix design for object-based surround codec with cluster analysis by synthesis
    136.
    发明授权
    Scalable downmix design for object-based surround codec with cluster analysis by synthesis 有权
    基于对象的环绕编解码器的可缩放缩混设计,通过综合进行聚类分析

    公开(公告)号:US09516446B2

    公开(公告)日:2016-12-06

    申请号:US13945811

    申请日:2013-07-18

    Abstract: In general, techniques are described for grouping audio objects into clusters. In some examples, a device for audio signal processing comprises a cluster analysis module configured to, based on a plurality of audio objects, produce a first grouping of the plurality of audio objects into L clusters, wherein the first grouping is based on spatial information from at least N among the plurality of audio objects and L is less than N. The device also includes an error calculator configured to calculate an error of the first grouping relative to the plurality of audio objects, wherein the error calculator is further configured to, based on the calculated error, produce a plurality L of audio streams according to a second grouping of the plurality of audio objects into L clusters that is different from the first grouping.

    Abstract translation: 一般来说,描述了将音频对象分组成簇的技术。 在一些示例中,用于音频信号处理的设备包括:聚类分析模块,被配置为基于多个音频对象产生所述多个音频对象的第一分组为L个群集,其中所述第一分组基于来自 多个音频对象中的至少N个,L小于N.该设备还包括错误计算器,其被配置为计算相对于多个音频对象的第一分组的错误,其中,误差计算器还被配置为基于 在计算出的误差上,根据与第一分组不同的多个音频对象的第二分组,产生多个音频流。

    Loudspeaker position compensation with 3D-audio hierarchical coding
    139.
    发明授权
    Loudspeaker position compensation with 3D-audio hierarchical coding 有权
    扬声器位置补偿与3D音频分层编码

    公开(公告)号:US09473870B2

    公开(公告)日:2016-10-18

    申请号:US13942657

    申请日:2013-07-15

    Inventor: Dipanjan Sen

    CPC classification number: H04S3/006 H04S3/002 H04S7/30 H04S2400/03 H04S2420/11

    Abstract: In general, techniques are described for compensating for loudspeaker positions using hierarchical three-dimensional (3D) audio coding. An apparatus comprising or more processors may perform the techniques. The processors may be configured to perform a first transform that is based on a spherical wave model on a first set of audio channel information for a first geometry of speakers to generate a first hierarchical set of elements that describes a sound field. The processors may further be configured to perform a second transform in a frequency domain on the first hierarchical set of elements to generate a second set of audio channel information for a second geometry of speakers.

    Abstract translation: 通常,描述了使用分层三维(3D)音频编码来补偿扬声器位置的技术。 包括或多个处理器的装置可以执行这些技术。 处理器可以被配置为执行基于用于扬声器的第一几何形状的第一组音频信道信息上的球面波模型的第一变换,以生成描述声场的元素的第一分层集合。 处理器还可以被配置为在第一分层元件组上的频域中执行第二变换,以生成用于扬声器的第二几何形状的第二组音频通道信息。

    Coding of spherical harmonic coefficients
    140.
    发明授权
    Coding of spherical harmonic coefficients 有权
    球谐函数的编码

    公开(公告)号:US09466302B2

    公开(公告)日:2016-10-11

    申请号:US14479752

    申请日:2014-09-08

    CPC classification number: G10L19/008

    Abstract: In general, techniques are described for coding of spherical harmonic coefficients representative of a three dimensional soundfield. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store a plurality of spherical harmonic coefficients. The one or more processors may be configured to perform an energy analysis with respect to the plurality of spherical harmonic coefficients to determine a reduced version of the plurality of spherical harmonic coefficients.

    Abstract translation: 通常,描述了代表三维声场的球谐函数的编码技术。 包括存储器和一个或多个处理器的设备可以被配置为执行这些技术。 存储器可以被配置为存储多个球谐函数。 一个或多个处理器可以被配置为执行关于多个球谐函数的能量分析以确定多个球谐函数的简化版本。

Patent Agency Ranking