-
公开(公告)号:US20250140271A1
公开(公告)日:2025-05-01
申请号:US18687871
申请日:2021-08-30
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Anssi Sakari RÄMÖ , Mikko-Ville LAITINEN , Adriana VASILACHE , Lasse Juhani LAAKSONEN
IPC: G10L19/012 , G10L19/008 , G10L19/032
Abstract: There is inter alia disclosed an apparatus for spatial audio encoding configured to: determine an error of fit measure (408) between a plurality of spatial direction component values (402) from a plurality of audio frames and a curve fitted (405) to a data set comprising the plurality of spatial direction component values; compare the error of fit measure to a threshold value; and depending on the comparison, either use a method of non-prediction for generating at least one spatial direction component value for each remaining audio frame of the interval of audio frames, or use a method of prediction for generating the at least one spatial direction component (406) value for each remaining audio frame of the interval of audio frames.
-
公开(公告)号:US20250119700A1
公开(公告)日:2025-04-10
申请号:US18981955
申请日:2024-12-16
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Sampo VESA , Mikko-Ville LAITINEN , Jussi VIROLAINEN
Abstract: According to an example embodiment, a technique for processing an input audio signal comprising a multi-channel audio signal is provided, the technique comprising: deriving, based on the input audio signal, a first signal component comprising a multi-channel audio signal that represents a focus portion of a spatial audio image conveyed by the input audio signal and a second signal component comprising a multi-channel audio signal that represents a non-focus portion of the spatial audio image; processing the second signal component into a modified second signal component wherein the width of the spatial audio image is extended from that of the second signal component; and combining the first signal component and the modified second signal component into an output audio signal comprising a multi-channel audio signal that represents partially extended spatial audio image.
-
公开(公告)号:US20250095659A1
公开(公告)日:2025-03-20
申请号:US18728919
申请日:2022-01-18
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Mikko-Ville LAITINEN , Tapani PIHLAJAKUJA , Juha Tapio VILKAMO
IPC: G10L19/008 , G10L19/02 , H04S7/00
Abstract: An apparatus for spatial audio signal decoding and rendering associated with a plurality of speaker nodes placed within a three-dimensional space having virtual surface arrangement comprising a plurality of virtual surfaces. The apparatus determines an azimuth angle for each virtual surface of the virtual surface set and the arrange the virtual surfaces of the virtual surface set into an order based on azimuth angles to give an ordered virtual surface set. The apparatus then associates a virtual surface of the ordered virtual surface set to a search sector and starting from the associated virtual surface for the search sector, search the ordered virtual surface set to determine a virtual surface that encloses a target panning direction.
-
公开(公告)号:US20240029745A1
公开(公告)日:2024-01-25
申请号:US18245789
申请日:2021-08-25
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Tapani PIHLAJAKUJA , Mikko-Ville LAITINEN
IPC: G10L19/008 , G10L19/025 , G10L19/032
CPC classification number: G10L19/008 , G10L19/025 , G10L19/032
Abstract: An apparatus comprising means configured to: obtain at least one audio signal; obtain, for the at least one audio signal, spatial audio signal parameter values, the spatial audio signal parameters values distributed within a time-frequency domain (106); determine a merge metric to control a merging of the spatial audio signal parameter values over the time-frequency domain (201); and merge (203), based on the merge metric (202), the spatial audio signal parameter values to a smaller number of spatial audio signal parameter values overtime and/or frequency within the time-frequency domain.
-
公开(公告)号:US20230178085A1
公开(公告)日:2023-06-08
申请号:US17998992
申请日:2021-04-15
Applicant: Nokia Technologies Oy
Inventor: Tapani PIHLAJAKUJA , Mikko-Ville LAITINEN , Lasse Juhani LAAKSONEN , Adriana VASILACHE , Anssi RÄMÖ
IPC: G10L19/008 , G10L25/21 , H04S7/00 , G10L19/02 , G10L25/18
CPC classification number: G10L19/008 , G10L25/21 , H04S7/30 , G10L19/0204 , G10L25/18 , H04S2420/07 , H04S2420/03 , H04S2400/11 , H04S3/008
Abstract: There is inter alia disclosed an apparatus for spatial audio encoding comprising: means for analysing a plurality of spatial audio parameter sets associated with a frame of one or more audio signals, wherein the plurality of spatial audio parameter sets are associated with a plurality of subframes, a plurality of frequency sub bands and a plurality of sound source directions for the frame of the one or more audio signals; and means for determining from the analysis of the plurality of spatial audio parameter sets at least one spatial audio parameter set for subframes of the frame of the one or more audio signals.
-
公开(公告)号:US20230079683A1
公开(公告)日:2023-03-16
申请号:US17802261
申请日:2021-02-03
Applicant: Nokia Technologies Oy
Inventor: Juha Vilkamo , Mikko-Ville LAITINEN , Archontis POLITIS
Abstract: An apparatus circuitry including configured to: obtain two or more audio signal sets, wherein each audio signal set is associated with a position; obtain at least one parameter value for at least two of the audio signal sets; obtain the positions associated with at least the at least two of the audio signal sets; obtain a listener position; generate at least one audio signal based on at least one audio signal from at least one of the two or more audio signal sets based on the positions associated with the at least the at least two of the audio signal sets and the listener position; generate at least one modified parameter value based on the obtained at least one parameter value for the at least two of the audio signal sets, the positions associated with the at least two of the audio signal sets and the listener position; and process the at least one audio signal based on the at least one modified parameter value to generate a spatial audio output.
-
公开(公告)号:US20220369061A1
公开(公告)日:2022-11-17
申请号:US17767265
申请日:2020-09-29
Applicant: Nokia Technologies Oy
Inventor: Juha VILKAMO , Mikko-Ville LAITINEN
Abstract: An apparatus including circuitry configured to: obtain a spatial audio signal including at least one audio signal and spatial metadata associated with the at least one audio signal; obtain at least one data set related to binaural rendering; obtain at least one pre-defined data set related to binaural rendering; and generate a binaural audio signal based on a combination of at least part of the at least one data set and the at least one pre-defined data set, and the spatial audio signal.
-
公开(公告)号:US20220328056A1
公开(公告)日:2022-10-13
申请号:US17595947
申请日:2020-06-03
Applicant: Nokia Technologies Oy
Inventor: Juha VILKAMO , Koray OZCAN , Mikko-Ville LAITINEN
IPC: G10L21/0208 , H04S7/00
Abstract: An apparatus including circuitry configured to obtain a defocus direction; process a spatial audio signal that represents an audio scene to generate a processed spatial audio signal that represents a modified audio scene based on the defocus direction, so as to control relative deemphasis in, at least in part, a portion of the spatial audio signal in the defocus direction relative to at least in part other portions of the spatial audio signal; and output the processed spatial audio signal, wherein the modified audio scene based on the defocus direction enables the deemphasis in, at least in part, the portion of the spatial audio signal in the defocus direction relative to at least in part other portions of the spatial audio signal.
-
公开(公告)号:US20220303710A1
公开(公告)日:2022-09-22
申请号:US17596119
申请日:2020-06-03
Applicant: Nokia Technologies Oy
Inventor: Juha VILKAMO , Koray OZCAN , Mikko-Ville LAITINEN
IPC: H04S7/00 , G10L19/008 , G10L21/0364
Abstract: An apparatus for spatial audio reproduction including circuitry configured to: obtain at least one focus parameter configured to define a focus shape; process a spatial audio signal that represents an audio scene to generate a processed spatial audio signal that represents a modified audio scene, so as to control relative emphasis in, at least in part, a portion of the spatial audio signal in the focus shape relative to at least in part; other portions of the spatial audio signals outside the focus shape and output the processed spatial audio signal, wherein the modified audio scene enables the relative emphasis in, at least in part, the portion of the spatial audio signal in the focus shape relative to at least in part other portions of the spatial audio signals outside the focus shape.
-
公开(公告)号:US20220225053A1
公开(公告)日:2022-07-14
申请号:US17573033
申请日:2022-01-11
Applicant: Nokia Technologies Oy
Inventor: Juha VILKAMO , Mikko-Ville LAITINEN
Abstract: A method for spatial audio signal processing including: obtaining, from a first capture device, at least one first audio signal and at least one first direction parameter for at least one frequency band; obtaining, from a second capture device, at least one second audio signal and at least one second direction parameter for the at least one frequency band; obtaining a first position associated with the first capture device; obtaining a second position associated with the second capture device; determining a distance parameter for the at least one frequency band in relation to the first position based, at least partially, on the at least one first direction parameter and the at least one second direction parameter; and enabling an output and/or store of the at least one first audio signal, the at least one first direction parameter and the distance parameter.
-
-
-
-
-
-
-
-
-