Apparatus, Methods and Computer Programs for Enabling Rendering of Spatial Audio

    公开(公告)号:US20250071497A1

    公开(公告)日:2025-02-27

    申请号:US18723930

    申请日:2022-12-09

    Abstract: Examples of the disclosure enable spatial audio rendering in a different format to the format that is used for the spatial audio coding. In examples of the disclosure spatial audio and first spatial metadata in a first format are obtained. The first spatial metadata enables rendering of spatial audio in a first audio format. In order to enable rendering of the spatial audio in a different format the spatial metadata is converted to second spatial metadata corresponding to a second audio format. The spatial audio can then be rendered for the second format using the second spatial metadata.

    STEREO AUDIO SIGNAL PROCESSING METHOD, ENCODING DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20250024216A1

    公开(公告)日:2025-01-16

    申请号:US18715679

    申请日:2021-12-03

    Inventor: Shuo GAO

    Abstract: A method for processing a stereo audio signal, performed by an encoding device, includes: determining an initial first threshold Thresh01 and an initial second threshold Thresh02 of a current frame of the stereo audio signal, where Thresh01∈(−1,0), and Thresh02∈(0,1); determining an offset value Delta; determining a first threshold Thresh1 and a second threshold Thresh2 corresponding to the current frame of the stereo audio signal according to a de-correlation manner for a previous frame of the stereo audio signal, the offset value Delta, the initial first threshold Thresh01 of the current frame, and the initial second threshold Thresh02 of the current frame; and performing de-correlation on the current frame according to the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame.

    BINAURAL DIALOGUE ENHANCEMENT
    3.
    发明申请

    公开(公告)号:US20240406650A1

    公开(公告)日:2024-12-05

    申请号:US18606040

    申请日:2024-03-15

    Abstract: Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.

    Representing spatial audio by means of an audio signal and associated metadata

    公开(公告)号:US12156012B2

    公开(公告)日:2024-11-26

    申请号:US18465636

    申请日:2023-09-12

    Inventor: Stefan Bruhn

    Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.

    REPRESENTING SPATIAL AUDIO BY MEANS OF AN AUDIO SIGNAL AND ASSOCIATED METADATA

    公开(公告)号:US20240114307A1

    公开(公告)日:2024-04-04

    申请号:US18465636

    申请日:2023-09-12

    Inventor: Stefan BRUHN

    CPC classification number: H04S3/02 H04S2400/03

    Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.

Patent Agency Ranking