-
公开(公告)号:US20230024675A1
公开(公告)日:2023-01-26
申请号:US17953134
申请日:2022-09-26
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Mikko-Ville LAITINEN , Mikko TAMMI , Jussi VIROLAINEN , Jorma MÄKINEN
Abstract: According to an example embodiment, a method for processing a multi-channel input audio signal representing a sound field into a multi-channel output audio signal representing said sound field in accordance with a predefined loudspeaker layout is provided, the method comprising the following for at least one frequency band: obtaining spatial audio parameters that are descriptive of spatial characteristics of said sound field; estimating a signal energy of the sound field represented by the multi-channel input audio signal; estimating, based on said signal energy and the obtained spatial audio parameters, respective output signal energies for channels of the multi-channel output audio signal according to said predefined loudspeaker layout; determining a maximum output energy as the largest of the output signal energies across channels of said multi-channel output audio signal; and deriving, on basis of said maximum output energy, a gain value for adjusting sound reproduction gain in at least one of said channels of the multi-channel output audio signal.
-
公开(公告)号:US20190394606A1
公开(公告)日:2019-12-26
申请号:US16486176
申请日:2018-01-24
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Mikko TAMMI , Toni MÄKINEN , Jussi VIROLAINEN , Mikko HEIKKINEN
Abstract: Apparatus comprising one or more processors configured to: receive at least two microphone audio signals (101) for audio signal processing wherein the audio signal processing comprises at least a spatial audio signal processing (303) and beamforming processing (305); determine spatial information (304) based on the audio signal processing associated with the at least two microphone audio signals; determine focus information (308) for the beamforming processing associated with the at least two microphone audio signals; and apply a spatial filter (307) in order to synthesize at least one spatially processed audio signal (312) based on the at least one beamformed audio signal from the at least two microphone audio signals (101), the spatial information (304) and the focus information (308) in such a way that the spatial filter (307), the at least one beamformed audio signal (306), the spatial information (304) and the focus information (308) are configured to be used to spatially synthesize (307) the at least one spatially processed audio signal (312).
-
公开(公告)号:US20230179939A1
公开(公告)日:2023-06-08
申请号:US18161809
申请日:2023-01-30
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Lasse LAAKSONEN , Mikko TAMMI , Miikka VILERMO , Arto LEHTINIEMI
IPC: H04S7/00 , G10L19/008 , G10L19/022
CPC classification number: H04S7/30 , G10L19/008 , G10L19/022
Abstract: An apparatus for audio signal processing audio objects within at least one audio scene, the apparatus comprising at least one processor configured to: define for at least one time period at least one contextual grouping comprising at least two of a plurality of audio objects and at least one further audio object of the plurality of audio objects outside of the at least one contextual grouping, the plurality of audio objects within at least one audio scene; and define with respect to the at least one contextual grouping at least one first parameter and/or parameter rule type which is configured to be applied with respect to a common element associated with the at least two of the plurality of audio objects and wherein the at least one first parameter and/or parameter rule type is configured to be applied with respect to individual element associated with the at least one further audio object outside of the at least one contextual grouping, the at least one first parameter and/or parameter rule type being applied in audio rendering of both the at least two of the plurality of audio objects and the at least one further audio object.
-
公开(公告)号:US20210250717A1
公开(公告)日:2021-08-12
申请号:US16973600
申请日:2019-06-12
Applicant: Nokia Technologies Oy
Inventor: Mikko-Ville LAITINEN , Milkka VILERMO , Mikko TAMMI , Jussi VIROLAINEN , Juha VILKAMO
Abstract: An apparatus including circuitry configured for: receiving at least two audio signals; determining at least one lower frequency effect parameter based on the at least two audio signals; determining at least one transport audio signal based on the at least two audio signals; controlling a transmission/storage of the at least one transport audio signal and the at least one lower frequency effect information such that a rendering based on the at least one transport audio signal and the at least one lower frequency effect information enables a determination of at least one low frequency effect channel.
-
公开(公告)号:US20200275230A1
公开(公告)日:2020-08-27
申请号:US16753698
申请日:2018-09-24
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Lasse LAAKSONEN , Mikko TAMMI , Miikka VILERMO , Arto LEHTINIEMI
IPC: H04S7/00 , G10L19/008 , G10L19/022
Abstract: An apparatus for audio signal processing audio objects within at least one audio scene, the apparatus comprising at least one processor configured to:define for at least one time period at least one contextual grouping comprising at least two of a plurality of audio objects and at least one further audio object of the plurality of audio objects outside of the at least one contextual grouping, the plurality of audio objects within at least one audio scene; anddefine with respect to the at least one contextual grouping at least one first parameter and/or parameter rule type which is configured to be applied with respect to a common element associated with the at least two of the plurality of audio objects and wherein the at least one first parameter and/or parameter rule type is configured to be applied with respect to individual element associatedwith the at least one further audio object outside of the at least one contextual grouping, the at least one first parameter and/or parameter rule type being applied in audio rendering of both the at least two of the plurality of audio objects and the at least one further audio object.
-
公开(公告)号:US20210360362A1
公开(公告)日:2021-11-18
申请号:US16625597
申请日:2018-06-08
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Mikko-Ville LAITINEN , Mikko TAMMI , Jussi VIROLAINEN , Jorma MÄKINEN
Abstract: According to an example embodiment, a method for processing a multi-channel input audio signal representing a sound field into a multi-channel output audio signal representing said sound field in accordance with a predefined loudspeaker layout is provided, the method comprising the following for at least one frequency band: obtaining spatial audio parameters that are descriptive of spatial characteristics of said sound field; estimating a signal energy of the sound field represented by the multi-channel input audio signal; estimating, based on said signal energy and the obtained spatial audio parameters, respective output signal energies for channels of the multi-channel output audio signal according to said predefined loudspeaker layout; determining a maximum output energy as the largest of the output signal energies across channels of said multi-channel output audio signal; and deriving, on basis of said maximum output energy, a gain value for adjusting sound reproduction gain in at least one of said channels of the multi-channel output audio signal.
-
公开(公告)号:US20180213309A1
公开(公告)日:2018-07-26
申请号:US15742240
申请日:2016-07-05
Applicant: Nokia Technologies Oy
Inventor: Mikko-Ville Laitinen , Mikko TAMMI , Miikka VILERMO
CPC classification number: H04R1/005 , H04R1/406 , H04R3/005 , H04R5/027 , H04R2201/401 , H04S7/30 , H04S2400/15 , H04S2420/01
Abstract: Apparatus including: an audio capture application configured to determine separate microphones from a plurality of microphones and identify a sound source direction of at least one audio source within an audio scene by analysing respective two or more audio signals from the separate microphones, wherein the audio capture application is further configured to adaptively select, from the plurality of microphones, two or more respective audio signals based on the determined direction and furthermore configured to select, from the two or more respective audio signals, a reference audio signal also based on the determined direction; and a signal generator configured to generate a mid signal representing the at least one audio source based on a combination of the selected two or more respective audio signals and with reference to the reference audio signal.
-
公开(公告)号:US20160073198A1
公开(公告)日:2016-03-10
申请号:US14777825
申请日:2013-03-20
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Miikka VILERMO , Mikko TAMMI , Joonas NIKUNEN , Tuomas VIRTANEN
CPC classification number: H04R5/027 , G10L21/028 , H04N7/15 , H04R1/406 , H04R3/005 , H04R2201/401 , H04R2430/23
Abstract: An apparatus comprising: an input configured to receive at least two audio signals; a frequency domain transformer configured to transform the at least two audio signals into a frequency domain representation of the at least two signals; a spatial covariance processor configured to generate an observed spatial covariance matrix from the frequency domain representations of the at least two audio signals; a beamformer configured to generate a spatial covariance matrix model comprising at least one beamformer kernel; a matrix factorizer configured to generate a linear magnitude mode! of audio objects; to combine the spatial covariance matrix model and the linear magnitude model; and further configured to determine at least one combination parameter, such that the at least one parameter for the combination attempts to optimise the combination; and a separator configured to cluster the audio objects based on the at least one combination parameter to create separated audio sources.
Abstract translation: 一种装置,包括:被配置为接收至少两个音频信号的输入; 频域变换器,被配置为将所述至少两个音频信号变换为所述至少两个信号的频域表示; 空间协方差处理器,被配置为从所述至少两个音频信号的频域表示中产生观测空间协方差矩阵; 波束形成器,被配置为生成包括至少一个波束形成器内核的空间协方差矩阵模型; 配置成生成线性幅度模式的矩阵分解器! 的音频对象; 组合空间协方差矩阵模型和线性幅度模型; 并且还被配置为确定至少一个组合参数,使得所述组合的所述至少一个参数尝试优化所述组合; 以及分配器,被配置为基于所述至少一个组合参数来聚集所述音频对象以创建分离的音频源。
-
公开(公告)号:US20250147140A1
公开(公告)日:2025-05-08
申请号:US19016153
申请日:2025-01-10
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Miikka VILERMO , Mikko TAMMI , Toni MÄKINEN , Juha VILKAMO
IPC: G01S3/805 , G01S3/80 , G10L19/008 , G10L21/0216 , H04R3/00
Abstract: A method for audio focusing comprises: receiving a multi-channel audio signal that represents sounds in sound directions that correspond to respective positions in an image area of an image; receiving an indication of an audio focus direction that corresponds to a first position in the image area; selecting a primary sound direction from a plurality of different available candidate directions, wherein said different available candidate directions comprise said audio focus direction and one or more offset candidate directions and wherein each offset candidate direction corresponds to a respective candidate offset from said first position in the image area; and deriving, based on said multi-channel audio signal in dependence of the selected primary sound direction, an output audio signal where sounds in sound directions defined via the selected primary sound direction are emphasized in relation to sounds in sound directions other than those defined via the selected primary sound direction.
-
公开(公告)号:US20230254659A1
公开(公告)日:2023-08-10
申请号:US18301792
申请日:2023-04-17
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Juha VILKAMO , Miikka VILERMO , Mikko TAMMI , Jussi VIROLAINEN
IPC: H04S7/00
CPC classification number: H04S7/303 , H04S2400/15 , H04S2420/11
Abstract: A method, apparatus and computer program, the method comprising: receiving a plurality of input signals representing a sound space; using the received plurality of input signals to obtain spatial metadata corresponding to the sound space; using the received plurality of input signals to obtain a first spatial audio signal corresponding to the spatial metadata; and associating the first spatial audio signal with the spatial metadata to enable the spatial metadata to be used to process the first spatial audio signal to obtain a second spatial audio signal.
-
-
-
-
-
-
-
-
-