-
公开(公告)号:US20220030371A1
公开(公告)日:2022-01-27
申请号:US17495359
申请日:2021-10-06
Applicant: Huawei Technologies Co., Ltd.
Inventor: Mohammad TAGHIZADEH , Christof FALLER , Alexis FAVROT
IPC: H04S3/02 , H04R3/00 , G10L19/008 , H04R1/32
Abstract: A device and method, respectively, obtain a first order ambisonic (FOA) signal from signals of multiple microphones, e.g., at least four or five directive microphones. The device and method determine a look direction of each microphone, and calculate a decoding matrix based on the determined look directions. The decoding matrix is a matrix suitable for decoding a FOA signal into the signals of the microphones. Further, the device and method invert the decoding matrix to obtain an encoding matrix, and encode the signals of the microphones based on the encoding matrix to obtain the FOA signal.
-
公开(公告)号:US20210067868A1
公开(公告)日:2021-03-04
申请号:US17019757
申请日:2020-09-14
Applicant: Huawei Technologies Co., Ltd.
Inventor: Mohammad TAGHIZADEH , Christof FALLER , Alexis FAVROT
IPC: H04R1/40 , H04R3/02 , G10L19/02 , G10L19/008
Abstract: A method and a device encode N audio signals, from N microphones where N≥3. For each pair of the N audio signals an angle of incidence of direct sound is estimated. A-format direct sound signals are derived from the estimated angles of incidence by deriving from each estimated angle an A-format direct sound signal. Each A-format direct sound signal is a first-order virtual microphone signal, for example, a cardioids signal.
-
公开(公告)号:US20200057132A1
公开(公告)日:2020-02-20
申请号:US16664373
申请日:2019-10-25
Applicant: Huawei Technologies Co., Ltd.
Inventor: Kainan CHEN , Jürgen GEIGER , Mohammad TAGHIZADEH , Peter GROSCHE
Abstract: A device for estimating Direction of Arrival (DOA) of sound from Q≥1 sound sources is provided. The device is configured to obtain a phase difference matrix, which includes measured phase difference values, each of the measured phase difference values being a measured value of a phase difference between two microphone units for a frequency bin in a range of frequencies of the sound. The device is further configured to generate a replicated phase difference matrix by replicating the measured phase difference values to other potential sinusoidal periods, calculate a DOA value for each phase difference value in the replicated phase difference matrix, and determine, as Q DOA results, the Q most prominent peak values in a histogram generated based on the calculated DOA values.
-
公开(公告)号:US20220163664A1
公开(公告)日:2022-05-26
申请号:US17582837
申请日:2022-01-24
Inventor: Mohammad TAGHIZADEH , Michael GÜNTHER , Andreas BRENDEL , Walter KELLERMANN
Abstract: An apparatus determines a spatial position of an audio source in multi moving audio sources scenarios. The apparatus receives audio signal versions as local sound waves. The apparatus determines first and second probabilities for a direction of arrival of the audio signal version based on the audio signal versions received within a first time interval; determines third and fourth probabilities for the direction of arrival of the audio signal version based on the audio signal versions received within a second time interval; determines a first probability difference between the first and third probabilities; determines a second probability difference between the second and fourth probabilities; combines the third probability and the first probability difference to obtain an updated third probability; combines the fourth probability with the second probability difference to obtain an updated fourth probability; and determines the spatial position based on the updated third and fourth probabilities.
-
公开(公告)号:US20220150661A1
公开(公告)日:2022-05-12
申请号:US17581527
申请日:2022-01-21
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Mohammad TAGHIZADEH , Gil KEREN , Shuo LIU , Bjoern SCHULLER
IPC: H04S7/00 , G10L21/0208 , G10L21/10 , G10L25/18 , G10L25/30
Abstract: The disclosure relates to an audio processing apparatus, comprising: a plurality of audio sensors, each audio sensor configured to receive a respective plurality of audio frames of an audio signal from an audio source, wherein the respective plurality of audio frames defines an audio channel of the audio signal; and a processing circuitry configured to: determine a respective feature set having at least one feature for each audio frame of each of the plurality of audio frames, wherein the plurality of features define a three-dimensional feature array; process the three-dimensional feature array using a neural network, wherein the neural network comprises a self-attention layer configured to process a plurality of two-dimensional sub-arrays of the three-dimensional feature array; and generate an output signal on the basis of the plurality of processed two-dimensional sub-arrays. Moreover, the disclosure relates to a corresponding audio processing method.
-
-
-
-