SPATIAL AUDIO PARAMETER ENCODING AND ASSOCIATED DECODING

    公开(公告)号:US20230335141A1

    公开(公告)日:2023-10-19

    申请号:US17788155

    申请日:2020-12-11

    CPC classification number: G10L19/008 G10L19/032

    Abstract: An apparatus comprising means configured to: obtain at least one parameter value (106) associated with at least two time-frequency parts of at least one audio signal (104); obtain at least one similarity value based on the at least one parameter value (106) associated with the at least two time-frequency parts of at least one audio signal (104); determine at least one group of time-frequency parts from the at least two time-frequency parts of at least one audio signal (104), the at least one group of time-frequency parts based on the at least one similarity value; and generate for the at least one group of time-frequency parts at least one associated group parameter (204), the at least one group parameter (204) based on the at least one parameter value (106) associated with the time-frequency parts.

    SPATIAL AUDIO PARAMETER ENCODING AND ASSOCIATED DECODING

    公开(公告)号:US20230197087A1

    公开(公告)日:2023-06-22

    申请号:US17998866

    申请日:2021-04-15

    CPC classification number: G10L19/008 G10L19/032 G10L25/21 G10L2019/0004

    Abstract: An apparatus comprising means configured to: obtain at least one direction parameter value for a time-frequency part of at least one audio signal (301); obtain at least one energy ratio for the time-frequency part (301), wherein each energy ratio is associated with a respective direction parameter value; generate respective at least one modified energy ratio from the at least one energy ratio for the time-frequency part (304); determine a quantization spatial resolution for encoding the at least one obtained direction parameter value based on the at least one modified energy ratio (305); and encode the obtained direction parameter values based on the quantization spatial resolution (306).

    THE MERGING OF SPATIAL AUDIO PARAMETERS
    3.
    发明公开

    公开(公告)号:US20230197086A1

    公开(公告)日:2023-06-22

    申请号:US17786088

    申请日:2020-11-13

    CPC classification number: G10L19/008 H04S7/302 H04S2420/03

    Abstract: There is inter alia disclosed an apparatus for spatial audio encoding comprising: means for determining at least two of a type of spatial audio parameter for one or more audio signals, wherein a first of the type of spatial audio parameter is associated with a first group of samples in a domain of the one or more audio signals and a second of the type of spatial audio parameter is associated with a second group of samples in the domain of the one or more audio signals; and means for merging the first of the type of spatial audio parameter and the second of the type of spatial audio parameter into a merged spatial audio parameter.

    METHOD AND APPARATUS FOR LATTICE VECTOR QUANTIZATION OF AN AUDIO SIGNAL
    6.
    发明申请
    METHOD AND APPARATUS FOR LATTICE VECTOR QUANTIZATION OF AN AUDIO SIGNAL 审中-公开
    方法和装置用于音频信号的矢量矢量量化

    公开(公告)号:US20160019900A1

    公开(公告)日:2016-01-21

    申请号:US14763497

    申请日:2013-02-01

    CPC classification number: G10L19/008 G10L19/038

    Abstract: An apparatus comprising: a vector generator configured to generate a first vector of parameters defining at least one audio signal; a vector extender configured to extend the first vector of parameters to a second vector, where the first vector is length n and the second vector is length n, where m is greater than n; a vector transformer configured to transform the second vector, a lattice quantizer configured to lattice quantize the transformed second vector; and a reverse transformer configured to reverse transform the lattice quantized transformed second vector, such that the first n components of a reverse transformed lattice quantized transformed second vector are a lattice quantization of the first vector.

    Abstract translation: 一种装置,包括:矢量发生器,被配置为生成定义至少一个音频信号的参数的第一矢量; 向量扩展器,被配置为将第一向量参数扩展到第二向量,其中第一向量是长度n,第二向量是长度n,其中m大于n; 被配置为变换所述第二矢量的矢量变换器,被配置为对经变换的第二矢量进行点阵量化的晶格量化器; 以及逆变换器,被配置为反转变换所述格子量化变换的第二矢量,使得反向变换的晶格量化变换的第二矢量的前n个分量是所述第一矢量的晶格量化。

    QUANTIZING SPATIAL AUDIO PARAMETERS
    8.
    发明公开

    公开(公告)号:US20230335143A1

    公开(公告)日:2023-10-19

    申请号:US18044666

    申请日:2021-08-19

    CPC classification number: G10L19/008 H04S7/305

    Abstract: There is inter alia disclosed an apparatus for spatial audio encoding configured to convert two or more energy ratios associated with a time frequency tile of one or more audio signals to a further energy ratio parameter which is related to the two or more energy ratios; quantize the further energy ratio parameter using a first quantizer; determine a distribution factor of energy ratios dependent on a ratio of a first of the two or more energy ratios to the sum of the two or more energy ratios; select a further quantizer from a plurality of further quantizers using the quantized further energy ratio parameter; and quantize the distribution factor of energy ratios using the selected further quantizer.

    SPATIAL AUDIO PARAMETER ENCODING AND ASSOCIATED DECODING

    公开(公告)号:US20230047237A1

    公开(公告)日:2023-02-16

    申请号:US17791115

    申请日:2020-12-07

    Abstract: An apparatus comprising means configured to obtain direction parameter values (108) associated with at least two time-frequency parts (202) of at least one audio signal (102); and encode the obtained direction parameter values based on a codebook (206), wherein the codebook comprises two or more quantization levels arranged such that a first quantization level comprises a first set of quantization values, and a second or succeeding quantization level comprises a second or further set of quantization values and preceding quantization level quantization values.

    DETERMINATION OF SPATIAL AUDIO PARAMETER ENCODING AND ASSOCIATED DECODING

    公开(公告)号:US20220343928A1

    公开(公告)日:2022-10-27

    申请号:US17642288

    申请日:2020-09-09

    Abstract: An apparatus comprising means configured to: generate spatial audio signal directional metadata parameters for a block of time-frequencies; generate encoded spatial audio signal directional metadata parameters (108) for a block of time-frequencies based on a first quantization resolution (203); compare a number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution against a determined number of bits; output or store the encoded spatial audio signal directional metadata parameters for a block of time-frequencies (108) based on a first quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies (108) based on the first quantization resolution is less than a determined number of bits (217); generate encoded spatial audio signal directional metadata parameters (108) for the block of time-frequencies based on a second quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies (108) based on the first quantization resolution is more than the determined number of bits and a difference between the determined number of bits and the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is less than a determined number of bits is within a determined threshold (217); generate encoded spatial audio signal directional metadata parameters (108) for the block of time-frequencies based on a third quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is more than the determined number of bits and the difference between the determined number of bits and the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is greater than the determined threshold, wherein the third quantization resolution is determined such that a number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies based on the third quantization resolution is always equal to or less than the determined number of bits (217).

Patent Agency Ranking