-
公开(公告)号:US11984965B2
公开(公告)日:2024-05-14
申请号:US17496566
申请日:2021-10-07
Inventor: Mohammad Taghizadeh , Walter Kellermann , Michael Günther , Andreas Brendel
IPC: H04R3/00 , H04B7/0426 , H04B7/08
CPC classification number: H04B7/086 , H04B7/043 , H04B7/0434
Abstract: The disclosure relates to an audio processing apparatus for localizing an audio source. The audio processing apparatus comprises a plurality of audio sensors, including a primary audio sensor and at least two secondary audio sensors, configured to detect an audio signal from a target audio source, wherein the primary audio sensor defines at least two pairs of audio sensors with the at least two secondary audio sensors; and processing circuitry configured to: determine for each pair of audio sensors a first set of likelihoods of spatial directions of the target audio source using a first localization scheme; determine a second set of likelihoods of spatial directions of the target audio source using a second localization scheme; and determine a third set of likelihoods of spatial directions of the target audio source on the basis of the first sets of likelihoods and the second set of likelihoods.
-
公开(公告)号:US11921198B2
公开(公告)日:2024-03-05
申请号:US17582837
申请日:2022-01-24
Inventor: Mohammad Taghizadeh , Michael Günther , Andreas Brendel , Walter Kellermann
CPC classification number: G01S15/46 , H04R1/406 , H04R3/005 , G01S2015/465
Abstract: An apparatus determines a spatial position of an audio source in multi moving audio sources scenarios. The apparatus receives audio signal versions as local sound waves. The apparatus determines first and second probabilities for a direction of arrival of the audio signal version based on the audio signal versions received within a first time interval; determines third and fourth probabilities for the direction of arrival of the audio signal version based on the audio signal versions received within a second time interval; determines a first probability difference between the first and third probabilities; determines a second probability difference between the second and fourth probabilities; combines the third probability and the first probability difference to obtain an updated third probability; combines the fourth probability with the second probability difference to obtain an updated fourth probability; and determines the spatial position based on the updated third and fourth probabilities.
-
公开(公告)号:US11632626B2
公开(公告)日:2023-04-18
申请号:US17019757
申请日:2020-09-14
Applicant: Huawei Technologies Co., Ltd.
Inventor: Mohammad Taghizadeh , Christof Faller , Alexis Favrot
Abstract: A method and a device encode N audio signals, from N microphones where N≥3. For each pair of the N audio signals an angle of incidence of direct sound is estimated. A-format direct sound signals are derived from the estimated angles of incidence by deriving from each estimated angle an A-format direct sound signal. Each A-format direct sound signal is a first-order virtual microphone signal, for example, a cardioids signal.
-
公开(公告)号:US11889292B2
公开(公告)日:2024-01-30
申请号:US17581527
申请日:2022-01-21
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Mohammad Taghizadeh , Gil Keren , Shuo Liu , Bjoern Schuller
IPC: H04S7/00 , G10L21/0208 , G10L25/18 , G10L25/30 , G10L21/10
CPC classification number: H04S7/307 , G10L21/0208 , G10L21/10 , G10L25/18 , G10L25/30 , H04S2400/03
Abstract: The disclosure relates to an audio processing apparatus, comprising: a plurality of audio sensors, each audio sensor configured to receive a respective plurality of audio frames of an audio signal from an audio source, wherein the respective plurality of audio frames defines an audio channel of the audio signal; and a processing circuitry configured to: determine a respective feature set having at least one feature for each audio frame of each of the plurality of audio frames, wherein the plurality of features define a three-dimensional feature array; process the three-dimensional feature array using a neural network, wherein the neural network comprises a self-attention layer configured to process a plurality of two-dimensional sub-arrays of the three-dimensional feature array; and generate an output signal on the basis of the plurality of processed two-dimensional sub-arrays. Moreover, the disclosure relates to a corresponding audio processing method.
-
公开(公告)号:US11838739B2
公开(公告)日:2023-12-05
申请号:US17495359
申请日:2021-10-06
Applicant: Huawei Technologies Co., Ltd.
Inventor: Mohammad Taghizadeh , Christof Faller , Alexis Favrot
IPC: H04S3/02 , G10L19/008 , H04R1/32 , H04R3/00
CPC classification number: H04S3/02 , G10L19/008 , H04R1/326 , H04R3/005 , H04S2400/11 , H04S2420/11
Abstract: A device and method, respectively, obtain a first order ambisonic (FOA) signal from signals of multiple microphones, e.g., at least four or five directive microphones. The device and method determine a look direction of each microphone, and calculate a decoding matrix based on the determined look directions. The decoding matrix is a matrix suitable for decoding a FOA signal into the signals of the microphones. Further, the device and method invert the decoding matrix to obtain an encoding matrix, and encode the signals of the microphones based on the encoding matrix to obtain the FOA signal.
-
公开(公告)号:US11567162B2
公开(公告)日:2023-01-31
申请号:US16664373
申请日:2019-10-25
Applicant: Huawei Technologies Co., Ltd.
Inventor: Kainan Chen , Jürgen Geiger , Mohammad Taghizadeh , Peter Grosche
Abstract: A device for estimating Direction of Arrival (DOA) of sound from Q≥1 sound sources is provided. The device is configured to obtain a phase difference matrix, which includes measured phase difference values, each of the measured phase difference values being a measured value of a phase difference between two microphone units for a frequency bin in a range of frequencies of the sound. The device is further configured to generate a replicated phase difference matrix by replicating the measured phase difference values to other potential sinusoidal periods, calculate a DOA value for each phase difference value in the replicated phase difference matrix, and determine, as Q DOA results, the Q most prominent peak values in a histogram generated based on the calculated DOA values.
-
-
-
-
-