SPATIAL AUDIO PARAMETER ENCODING AND ASSOCIATED DECODING

    公开(公告)号:US20230335141A1

    公开(公告)日:2023-10-19

    申请号:US17788155

    申请日:2020-12-11

    CPC classification number: G10L19/008 G10L19/032

    Abstract: An apparatus comprising means configured to: obtain at least one parameter value (106) associated with at least two time-frequency parts of at least one audio signal (104); obtain at least one similarity value based on the at least one parameter value (106) associated with the at least two time-frequency parts of at least one audio signal (104); determine at least one group of time-frequency parts from the at least two time-frequency parts of at least one audio signal (104), the at least one group of time-frequency parts based on the at least one similarity value; and generate for the at least one group of time-frequency parts at least one associated group parameter (204), the at least one group parameter (204) based on the at least one parameter value (106) associated with the time-frequency parts.

    THE MERGING OF SPATIAL AUDIO PARAMETERS
    12.
    发明公开

    公开(公告)号:US20230197086A1

    公开(公告)日:2023-06-22

    申请号:US17786088

    申请日:2020-11-13

    CPC classification number: G10L19/008 H04S7/302 H04S2420/03

    Abstract: There is inter alia disclosed an apparatus for spatial audio encoding comprising: means for determining at least two of a type of spatial audio parameter for one or more audio signals, wherein a first of the type of spatial audio parameter is associated with a first group of samples in a domain of the one or more audio signals and a second of the type of spatial audio parameter is associated with a second group of samples in the domain of the one or more audio signals; and means for merging the first of the type of spatial audio parameter and the second of the type of spatial audio parameter into a merged spatial audio parameter.

    SOUND SOURCE DISTANCE ESTIMATION
    13.
    发明申请

    公开(公告)号:US20200217919A1

    公开(公告)日:2020-07-09

    申请号:US16626242

    申请日:2018-06-13

    Abstract: An apparatus for generating at least one distance estimate to at least one sound source within a sound scene comprising the least one sound source, the apparatus configured to: receive at least two audio signals from a microphone array located within the sound scene; receive at least one further audio signal associated with the at least one sound source; determine at least one portion of the at least two audio signals from a microphone array corresponding to the at least one further audio signal associated with the at least one sound source; determine a distance estimate to the at least one sound source based on the at least one portion of the at least two audio signals from a microphone array corresponding to the at least one further audio signal associated with the at least one sound source.

    ENERGY-RATIO SIGNALLING AND SYNTHESIS
    14.
    发明申请

    公开(公告)号:US20200015028A1

    公开(公告)日:2020-01-09

    申请号:US16502838

    申请日:2019-07-03

    Abstract: An apparatus comprising at least one processor and at least one memory including a computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to: receive at least one audio signal; obtain, associated with the at least one audio signal over at least one frequency band: at least one spatial audio energy ratio parameter; and at least one remainder energy ratio, wherein a sum of the at least one spatial audio energy ratio parameter and the at least one remainder energy ratio over the frequency band equal a determined value; and control a transmission/storage of the at least one spatial audio energy ratio, and the at least one remainder energy ratio.

    QUANTIZING SPATIAL AUDIO PARAMETERS
    16.
    发明公开

    公开(公告)号:US20230335143A1

    公开(公告)日:2023-10-19

    申请号:US18044666

    申请日:2021-08-19

    CPC classification number: G10L19/008 H04S7/305

    Abstract: There is inter alia disclosed an apparatus for spatial audio encoding configured to convert two or more energy ratios associated with a time frequency tile of one or more audio signals to a further energy ratio parameter which is related to the two or more energy ratios; quantize the further energy ratio parameter using a first quantizer; determine a distribution factor of energy ratios dependent on a ratio of a first of the two or more energy ratios to the sum of the two or more energy ratios; select a further quantizer from a plurality of further quantizers using the quantized further energy ratio parameter; and quantize the distribution factor of energy ratios using the selected further quantizer.

    SOUND SOURCE DISTANCE ESTIMATION
    17.
    发明公开

    公开(公告)号:US20230273290A1

    公开(公告)日:2023-08-31

    申请号:US18313613

    申请日:2023-05-08

    Abstract: An apparatus for generating at least one distance estimate to at least one sound source within a sound scene comprising the least one sound source, the apparatus configured to: receive at least two audio signals from a microphone array located within the sound scene; receive at least one further audio signal associated with the at least one sound source; determine at least one portion of the at least two audio signals from a microphone array corresponding to the at least one further audio signal associated with the at least one sound source; determine a distance estimate to the at least one sound source based on the at least one portion of the at least two audio signals from a microphone array corresponding to the at least one further audio signal associated with the at least one sound source.

    An Apparatus and Method for Processing Volumetric Audio

    公开(公告)号:US20210375258A1

    公开(公告)日:2021-12-02

    申请号:US16768968

    申请日:2018-11-29

    Abstract: A method including receiving an audio scene including at least one source captured using at least one near field microphone and at least one far field microphone. The method includes determining at least one room-impulse-response associated with the audio scene based on the at least one near field microphone and the at least one far field microphone, accessing a predetermined scene geometry corresponding to the audio scene, and identifying best match to the predetermined scene geometry in a scene geometry database. The method also includes performing RIR comparison based on the at least one RIR and at least one geometric RIR associated with the best matching geometry and rendering a volumetric audio scene based on a result of the RIR comparison.

Patent Agency Ranking