-
公开(公告)号:US20210397965A1
公开(公告)日:2021-12-23
申请号:US17354398
申请日:2021-06-22
Applicant: Nokia Technologies Oy
Inventor: Honglei ZHANG , Francesco CRICRI , Hamed REZAZADEGAN TAVAKOLI , Joachim WABNIG , Iraj SANIEE , Miska Matias HANNUKSELA , Emre AKSU
Abstract: An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: estimate an importance of parameters of a neural network based on a graph diffusion process over at least one layer of the neural network; determine the parameters of the neural network that are suitable for pruning or sparsification; remove neurons of the neural network to prune or sparsify the neural network; and provide at least one syntax element for signaling the pruned or sparsified neural network over a communication channel, wherein the at least one syntax element comprises at least one neural network representation syntax element.
-
公开(公告)号:US20180295463A1
公开(公告)日:2018-10-11
申请号:US15767422
申请日:2016-10-07
Applicant: Nokia Technologies Oy
Inventor: Antti ERONEN , Jussi LEPPANEN , Arto LEHTINIEMI , Sujeet MATE , Francesco CRICRI
IPC: H04S7/00 , H04R1/40 , H04R3/00 , G10L19/008
Abstract: A spatial audio signal is received that is associated with a microphone array configured to provide spatial audio capture and additional audio signal(s) associated with an additional microphone, the additional audio signal having been delayed by a variable delay determined such that common components of the spatial audio signal and the additional audio signal(s) are time aligned. A relative position is received between a first position associated with the microphone array and a second position associated with the additional microphone Source parameter(s) are received classifying an audio source associated with the common components and/or space parameter(s) identifying an environment within which the audio source is located Processing effect ruleset is determined based on the source parameter(s) and/or the space parameter(s). Multiple output audio channel signals are generated by mixing and applying processing effect(s) to the spatial audio signal and the additional audio signal(s) based on the processing effect ruleset(s).
-
公开(公告)号:US20230343022A1
公开(公告)日:2023-10-26
申请号:US17787761
申请日:2020-12-15
Applicant: Nokia Technologies OY
Inventor: Jussi LEPPANEN , Miikka VILERMO , Francesco CRICRI , Arto LEHTINIEMI
CPC classification number: G06T15/205 , G02B27/01 , G06F3/013 , G06F3/017 , A63F13/5255
Abstract: An apparatus including circuitry configured for: in a first-person perspective mediated reality state, rendering mediated reality content as content distributed across a first area of a user's field of view, wherein a point of view of a user determines a point of view within a three-dimensional virtual space and determines at least part of the content distributed across the first area of a user's field of view as a virtual scene; responding to at least one user gesture to enter a spatially consolidated state; and in the spatially consolidated state, rendering the mediated reality content as content distributed across a second area of a user's field of view, wherein the second area is smaller than the first area and the point of view of the user does not determine the content distributed across the second area of a user's field of view.
-
公开(公告)号:US20200015021A1
公开(公告)日:2020-01-09
申请号:US16464913
申请日:2017-11-20
Applicant: Nokia Technologies Oy
Inventor: Jussi LEPPANEN , Antti ERONEN , Francesco CRICRI , Arto LEHTINIEMI
Abstract: An apparatus for identifying which sound sources are associated with which microphone audio signals, the apparatus comprising including a processor configured to: determine/receive a position/orientation of at least one sound source relative to a microphone array; receive at least one microphone audio signal, each microphone audio signal received from a microphone; receive an audio-focussed audio signal from the microphone array, wherein the audio-focussed audio signal is directed from the microphone array towards the one of the at least one sound source so as to enhance the audio-focussed audio signal; compare the audio-focussed audio signal against each microphone audio signal to identify a match between one of the at least one microphone audio signal and the audio focussed audio signal; and associate the one of the at least one microphone with the at least one sound source, based on the identified match.
-
公开(公告)号:US20190313174A1
公开(公告)日:2019-10-10
申请号:US16464743
申请日:2017-11-20
Applicant: Nokia Technologies Oy
Inventor: Jussi LEPPANEN , Arto LEHTINIEMI , Antti ERONEN , Francesco CRICRI
Abstract: An apparatus for controlling a controllable position/orientation of at least one audio source within an audio scene, the audio scene including the at least one audio source; a capture device, the apparatus including a processor configured to: receive a physical position/orientation of the at least one audio source relative to a capture device capture orientation; receive an earlier physical position/orientation of the at least one audio source relative to the capture device capture orientation; receive at least one control parameter; and control a controllable position/orientation of the at least one audio source, the controllable position being between the physical position/orientation of the at least one audio source relative to the capture device capture orientation and the earlier physical position/orientation of the at least one audio source relative to the capture device capture orientation and based on the control parameter.
-
公开(公告)号:US20190058861A1
公开(公告)日:2019-02-21
申请号:US16078746
申请日:2017-02-22
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Francesco CRICRI , Arto LEHTINIEMI , Antti ERONEN
IPC: H04N13/117 , H04N5/262 , H04N13/366
Abstract: An apparatus comprising: at least one processor; and at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following: in respect of virtual reality content comprising video imagery configured to provide a virtual reality space for viewing in virtual reality and based on a plurality of indicated highlight portions, each highlight portion comprising a spatial portion of the video imagery that forms the virtual reality space and being smaller in spatial extent than the spatial extent of the virtual reality space, and further based on a viewing direction in the virtual reality space of each of the plurality highlight portions, provide for one or more of generation or display of virtual reality summary content comprising a plurality of clips, each clip comprising the video imagery associated with one of the highlight portions, the virtual reality summary content configured to provide for display of the clips in a time consecutive manner and to provide for display of consecutive clips with a modified spatial separation such that the angular separation between a clip viewing direction of at least one clip and a clip viewing direction of an immediately preceding clip is less than the angular separation between the viewing directions of the highlight portions associated with said at least one clip and said immediately preceding clip.
-
公开(公告)号:US20240121570A1
公开(公告)日:2024-04-11
申请号:US18275238
申请日:2022-01-18
Applicant: Nokia Technologies Oy
Inventor: Antti Johannes ERONEN , Francesco CRICRI
Abstract: Example apparatus include circuitry for: obtaining audio content representing at least one audio space; enabling at least one digital signal processing operation to render the audio content such that the rendered audio content includes at least one target response for the at least one audio space wherein the enabling of the at least one digital signal processing operation to render the audio content is controlled based on obtaining the at least one target response for the at least one audio space. When the obtained target response is known the circuitry obtains at least one parameter for the at least one digital signal processing operation. When the obtained target response is unknown the circuitry obtains at least one parameter for a neural network and determines at least one parameter for the at least one digital signal processing operation.
-
公开(公告)号:US20230164336A1
公开(公告)日:2023-05-25
申请号:US17917153
申请日:2021-03-30
Applicant: Nokia Technologies OY
Inventor: Francesco CRICRI , Nam LE , Hamed REZAZADEGAN TAVAKOLI , Honglei ZHANG , Miska Matias HANNUKSELA , Emre Baris AKSU
IPC: H04N19/42 , H04N19/192 , G06N3/0455 , G06N3/084
CPC classification number: H04N19/42 , H04N19/192 , G06N3/0455 , G06N3/084
Abstract: Example embodiments provide a system for training a data coding pipeline including a feature extractor neural network, an encoder neural network, and a decoder neural network configured to reconstruct input data based on encoded features. A plurality of losses corresponding to different tasks may be determined for the coding pipeline. Tasks may be performed based on an output of the coding pipeline. A weight update may be determined for at least a subset of the coding pipeline based on the plurality of losses. The weight update may be configured to reduce a number of iterations for fine-tuning the coding pipeline for one of the tasks. This enables faster adaptation of the coding pipeline for one of the tasks after deployment of the coding pipeline. Apparatuses, methods, and computer programs are disclosed. Apparatuses, methods, and computer programs are disclosed.
-
公开(公告)号:US20210127140A1
公开(公告)日:2021-04-29
申请号:US17043925
申请日:2019-03-29
Applicant: Nokia Technologies Oy
Inventor: Miska HANNUKSELA , Jani LAINEMA , Francesco CRICRI
IPC: H04N19/85 , H04N19/172 , H04N19/176 , H04N19/124 , G06N3/08
Abstract: A method comprising: obtaining a block of a picture or a picture in an encoder; determining if the block/picture is used for on-line learning; if affirmative, encoding the block/picture; reconstructing a coarse version of the block/picture or the respective prediction error block/picture; enhancing the coarse version using a neural net; fine-tuning the neural net with a training signal based on the coarse version; determining if the block/picture is enhanced using the neural net; and if affirmative, encoding the block/picture with enhancing using the neural net.
-
公开(公告)号:US20180310114A1
公开(公告)日:2018-10-25
申请号:US15767458
申请日:2016-10-11
Applicant: Nokia Technologies Oy
Inventor: Antti ERONEN , Jussi LEPPANEN , Arto LEHTINIEMI , Matti HAMALAINEN , Sujeet MATE , Francesco CRICRI , Mikko-Ville LAITINEN , Mikko TAMMI , Ville-Veikko MATTILA
CPC classification number: H04S7/303 , H04R1/406 , H04R3/005 , H04R5/027 , H04R2430/23 , H04S2400/01 , H04S2400/11 , H04S2400/15 , H04S2420/01
Abstract: Apparatus including a processor configured to: receive a spatial audio signal associated with a microphone array configured to provide spatial audio capture and at least one additional audio signal associated with an additional microphone, the at least one additional microphone signal having been delayed by a variable delay determined such that the audio signals are time aligned; receive a relative position between a first position associated with the microphone array and a second position associated with the additional microphone; generate at least two output audio channel signals by processing and mixing the spatial audio signal and the at least one additional audio signal based on the relative position between the first position and the second position such that the at least two output audio channel signals present an augmented audio scene.
-
-
-
-
-
-
-
-
-