-
公开(公告)号:US20250071497A1
公开(公告)日:2025-02-27
申请号:US18723930
申请日:2022-12-09
Applicant: Nokia Technologies Oy
Inventor: Mikko-Ville LAITINEN , Juha Tapio VILKAMO
Abstract: Examples of the disclosure enable spatial audio rendering in a different format to the format that is used for the spatial audio coding. In examples of the disclosure spatial audio and first spatial metadata in a first format are obtained. The first spatial metadata enables rendering of spatial audio in a first audio format. In order to enable rendering of the spatial audio in a different format the spatial metadata is converted to second spatial metadata corresponding to a second audio format. The spatial audio can then be rendered for the second format using the second spatial metadata.
-
公开(公告)号:US20250024216A1
公开(公告)日:2025-01-16
申请号:US18715679
申请日:2021-12-03
Applicant: BEIJING XIAOMI MOBILE SOFTWARE CO., LTD.
Inventor: Shuo GAO
IPC: H04S3/02
Abstract: A method for processing a stereo audio signal, performed by an encoding device, includes: determining an initial first threshold Thresh01 and an initial second threshold Thresh02 of a current frame of the stereo audio signal, where Thresh01∈(−1,0), and Thresh02∈(0,1); determining an offset value Delta; determining a first threshold Thresh1 and a second threshold Thresh2 corresponding to the current frame of the stereo audio signal according to a de-correlation manner for a previous frame of the stereo audio signal, the offset value Delta, the initial first threshold Thresh01 of the current frame, and the initial second threshold Thresh02 of the current frame; and performing de-correlation on the current frame according to the first threshold Thresh1 and the second threshold Thresh2 corresponding to the current frame.
-
公开(公告)号:US20240406650A1
公开(公告)日:2024-12-05
申请号:US18606040
申请日:2024-03-15
Inventor: Leif Jonas Samuelsson , Dirk Jeroen Breebaart , David Matthew Cooper , Jeroen Koppens
Abstract: Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.
-
公开(公告)号:US12156012B2
公开(公告)日:2024-11-26
申请号:US18465636
申请日:2023-09-12
Inventor: Stefan Bruhn
IPC: H04S3/02
Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.
-
公开(公告)号:US12087310B2
公开(公告)日:2024-09-10
申请号:US17148638
申请日:2021-01-14
Inventor: Arne Borsum , Stephan Schreiner , Harald Fuchs , Michael Kratz , Bernhard Grill , Sebastian Scharrer
CPC classification number: G10L19/008 , G10L19/02 , G10L19/173 , H04S3/002 , H04S3/02 , H04S5/005 , H04S2400/03 , H04S2400/11 , H04S2420/03
Abstract: An apparatus for downmixing three or more audio input channels to obtain two or more audio output channels is provided. The apparatus includes a receiving interface for receiving the three or more audio input channels and for receiving side information. Moreover, the apparatus includes a downmixer for downmixing the three or more audio input channels depending on the side information to obtain the two or more audio output channels. The number of the audio output channels is smaller than the number of the audio input channels. The side information indicates a characteristic of at least one of the three or more audio input channels, or a characteristic of one or more sound waves recorded within the one or more audio input channels, or a characteristic of one or more sound sources which emitted one or more sound waves recorded within the one or more audio input channels.
-
公开(公告)号:US20240259744A1
公开(公告)日:2024-08-01
申请号:US18591517
申请日:2024-02-29
Applicant: Nokia Technologies Oy
Inventor: Mikko-Ville LAITINEN , Lasse LAAKSONEN , Juha VILKAMO
IPC: H04S3/02 , G10L19/008
CPC classification number: H04S3/02 , G10L19/008 , H04S2400/01 , H04S2420/11
Abstract: An apparatus configured to: obtain at least one signal, wherein the at least one signal comprises one or more transport audio signals; obtain an indicator specifying a type of the one or more transport audio signals; and process the one or more transport audio signals based, at least partially, on the indicator to generate one or more processed transport audio signals that are of an at least partially different type than the type of the one or more transport audio signals.
-
公开(公告)号:US12010501B2
公开(公告)日:2024-06-11
申请号:US17521762
申请日:2021-11-08
Applicant: DOLBY INTERNATIONAL AB
Inventor: Johannes Boehm , Florian Keiler
CPC classification number: H04S3/008 , G10L19/00 , H04S1/002 , H04S1/007 , H04S3/02 , H04S7/30 , H04S2400/01 , H04S2400/11 , H04S2420/11
Abstract: Decoding of Ambisonics representations for a stereo loudspeaker setup is known for first-order Ambisonics audio signals. But such first-order Ambisonics approaches have either high negative side lobes or poor localisation in the frontal region. The invention deals with the processing for stereo decoders for higher-order Ambisonics HOA.
-
8.
公开(公告)号:US20240171925A1
公开(公告)日:2024-05-23
申请号:US18430118
申请日:2024-02-01
Applicant: Voyetra Turtle Beach, Inc.
Inventor: Kevin Arthur Robertson , David Gallagher , Nicholas Richard Bourne
IPC: H04S3/02 , G09G5/00 , H04L67/00 , H04L67/08 , H04N21/436 , H04N21/439 , H04R1/10 , H04S3/00 , H04S7/00
CPC classification number: H04S3/02 , G09G5/006 , H04L67/00 , H04L67/08 , H04L67/34 , H04N21/43615 , H04N21/439 , H04R1/1041 , H04R2420/09 , H04S3/004 , H04S7/307
Abstract: In an audio system that includes at least one audio output element for outputting audio signals, one or more audio settings, from a plurality of audio settings supported in the audio system, may be determined based on a selected audio mode supported in the audio system and mapping data. The mapping data defines, for at least the selected audio mode, valid values for at least one audio setting. At least one user control element may be configured to enable a user input that includes a selection for the one or more audio settings, with the configuring including adjusting operation of the at least one user control element, and the adjusting including enabling for selection or setting, via the at least one user control element, values for the user input that match or correspond to only the valid values for the at least one audio setting.
-
公开(公告)号:US20240121567A1
公开(公告)日:2024-04-11
申请号:US18525910
申请日:2023-12-01
Applicant: KONINKLIJKE PHILIPS N.V.
Inventor: ERIK G.P. SCHUIJERS
IPC: H04S5/00 , G10L19/008 , H04S3/02
CPC classification number: H04S5/00 , G10L19/008 , H04S3/02 , H04S2400/03 , H04S2420/03
Abstract: A parametric stereo upmix method for generating a left signal and a right signal from a mono downmix signal based on spatial parameters includes predicting a difference signal comprising a difference between the left signal and the right signal based on the mono downmix signal scaled with a prediction coefficient. The prediction coefficient is derived from the spatial parameters. The method further includes deriving the left signal and the right signal based on a sum and a difference of the mono downmix signal and said difference signal.
-
公开(公告)号:US20240114307A1
公开(公告)日:2024-04-04
申请号:US18465636
申请日:2023-09-12
Inventor: Stefan BRUHN
IPC: H04S3/02
CPC classification number: H04S3/02 , H04S2400/03
Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.
-
-
-
-
-
-
-
-
-