-
公开(公告)号:US20170337720A1
公开(公告)日:2017-11-23
申请号:US15591540
申请日:2017-05-10
Applicant: Nokia Technologies Oy
Inventor: Francesco CRICRI , Jukka Pentti Paivio SAARINEN
IPC: G06T11/60 , G06F3/01 , G06K9/00 , G06F3/0484
CPC classification number: G06T11/60 , G06F3/011 , G06F3/012 , G06F3/013 , G06F3/04842 , G06K9/00711 , G06K2009/00738 , G06T2210/22
Abstract: A method comprises providing video data representing at least part of virtual space to a user for viewing, identifying a current viewed sector of the virtual space based on user position, determining a sub-portion of said viewing sector, identifying an event occurring in a non-viewed sector of the virtual space, and displaying content indicative of the event in the sub-portion of said current viewing sector. The displaying step may comprise displaying a graphical notification of the event in the sub-portion, or in alternative embodiments, displaying video data showing the event in the sub-portion.
-
公开(公告)号:US20220164652A1
公开(公告)日:2022-05-26
申请号:US17431012
申请日:2020-01-29
Applicant: Nokia Technologies Oy
Inventor: Caglar AYTEKIN , Francesco CRICRI
IPC: G06N3/08
Abstract: There is provided an apparatus comprising means for training a neural network, wherein the training comprises applying a loss function configured to increase sparsity of a weight tensor of the neural network and to cause a plurality of non-zero elements of the weight tensor to be substantially equal to each other; and means for entropy coding the weight tensor to obtain a compressed neural network.
-
公开(公告)号:US20210168395A1
公开(公告)日:2021-06-03
申请号:US16973620
申请日:2019-07-08
Applicant: Nokia Technologies Oy
Inventor: Francesco CRICRI , Antti HALLAPURO , Miska HANNUKSELA , Jani LAINEMA , Emre AKSU , Caglar AYTEKIN , Ramin GHAZNAVI YOUVALARI
IPC: H04N19/50 , H04N19/172 , H04N19/154 , H04N19/14 , H04N19/196 , H04N19/105 , G06N3/04 , G06N3/08
Abstract: An apparatus, a method and a computer program product are described comprising: obtaining or receiving video data; providing a current frame and/or one or more previous frames of the obtained or received video data to an input of a neural network; generating a predicted output at an output of the neural network, wherein the predicted output comprises at least one of one or more predicted future frames of the video data and predicted properties of one or more future frames of the video data; determining one or more processing decisions based, at least in part, on the predicted output; and processing the current frame of the video data at least partially according to the one or more processing decisions.
-
公开(公告)号:US20190012581A1
公开(公告)日:2019-01-10
申请号:US16017742
申请日:2018-06-25
Applicant: Nokia Technologies Oy
Inventor: Mikko HONKALA , Francesco CRICRI , Xingyang NI
Abstract: The invention relates to a method comprising receiving a set of input samples, said set of input images comprising real images and generated images; extracting a set of feature maps from multiple layers of a pre-trained neural network for both the real images and the generated images; determining statistics for each feature map of the set of feature maps; comparing statistics of the feature maps for the real images to statistics of the feature maps for the generated images by using a distance function to obtain a vector of distances; and averaging the distances of the vector of distances to have a value indicating a diversity of the generated images. The invention also relates to technical equipment for implementing the method.
-
公开(公告)号:US20180336702A1
公开(公告)日:2018-11-22
申请号:US15951976
申请日:2018-04-12
Applicant: Nokia Technologies Oy
Inventor: Francesco CRICRI , Miika TUPALA
IPC: G06T7/73 , H04N13/243 , G10L25/57 , G10L25/30
CPC classification number: G06T7/75 , G06K9/00771 , G06K9/00899 , G10L25/30 , G10L25/57 , H04N5/225 , H04N7/15 , H04N13/243 , H04N2013/0085
Abstract: A method for analyzing a presence of objects within a space provided with a capturing system comprising a plurality of camera devices and a playback system for reproducing audio and/or visual signals in the space, the method comprising obtaining a first 3D volumetric representation of a scene within the space, generated on the basis of input streams of at least a first and a second camera device, said first 3D volumetric representation showing at least one object within the scene; sending probe signals to a processing unit; controlling the processing unit to reproduce, using the playback system, one or more audio and/or visual signals on the basis of the probe signals into the space; controlling the processing unit to capture a second 3D volumetric representation of the scene including reproductions of the one or more audio and/or visual signals within the space; and analyzing the reproductions of the one or more audio and/or visual signals captured within the first space whether they correspond to a presumed location of the at least one object shown in the first 3D volumetric representation.
-
公开(公告)号:US20180276476A1
公开(公告)日:2018-09-27
申请号:US15918881
申请日:2018-03-12
Applicant: Nokia Technologies Oy
Inventor: Antti ERONEN , Francesco CRICRI , Arto LEHTINIEMI , Jussi LEPPÄNEN , Juha ARRASVUORI
IPC: G06K9/00 , H04N21/439 , G02B27/01 , G06T19/00
Abstract: A method comprising:rendering a first media scene based upon media content provided by a content-rendering application via one or more rendering devices worn by the user; determining a priority for an event that occurs near the user, the event being independent of the content-rendering application; and automatically modifying the rendered first media scene, to render a modified second media scene based at least in part upon media content provided by the content-rendering application and at least in part upon other media content associated with the event.
-
公开(公告)号:US20180007486A1
公开(公告)日:2018-01-04
申请号:US15621243
申请日:2017-06-13
Applicant: Nokia Technologies Oy
Inventor: Francesco CRICRI , Jukka SAARINEN
CPC classification number: H04S7/301 , G06F3/011 , G06F3/017 , G06F3/165 , H04R1/326 , H04R5/027 , H04R2420/07 , H04S2400/11 , H04S2400/15
Abstract: This specification describes a method comprising determining whether an estimated position of an audio capture device which captures audio data is within boundaries of a predetermined area, and in response to a determination that the estimated position is not within the boundaries of the predetermined area, associating the captured audio data with an adjusted position.
-
18.
公开(公告)号:US20230110503A1
公开(公告)日:2023-04-13
申请号:US17759550
申请日:2021-02-04
Applicant: Nokia Technologies Oy
Inventor: Jani LAINEMA , Emre Baris AKSU , Miska Matias HANNUKSELA , Alireza ZARE , Francesco CRICRI
IPC: H04N19/42 , H04N19/176 , H04N19/172 , H04N19/463 , H04N19/117 , G06N20/00
Abstract: The embodiments relate to method for encoding and decoding, wherein the method for encoding comprises receiving an input block of a video frame for encoding; applying at least a learning-based model (702) for said input block as a processing step for encoding the block; combining (703) an output of a learning-based model with one or more data sources (712, 713) by a combination process; encoding block to a bitstream (40); using a result of the combination process as additional input for the learning-based model for encoding a subsequent block; and encoding to a bitstream combination information (720) used in the combination process, said combination information comprising at least one or more combination parameters. The embodiments also relate to technical equipment for implementing the methods.
-
19.
公开(公告)号:US20230062752A1
公开(公告)日:2023-03-02
申请号:US17760017
申请日:2021-02-12
Applicant: Nokia Technologies Oy
Inventor: Jani LAINEMA , Francesco CRICRI , Emre Baris AKSU , Alireza ZARE , Miska Matias HANNUKSELA
IPC: H04N19/149 , H04N19/159 , H04N19/176 , G06N3/04
Abstract: The embodiments relate to a method for encoding and a decoding, and apparatuses for the same. The method for encoding comprises receiving a block of a video frame for encoding (1510); making a decision on whether or not a learning-based model is to be applied as a processing step for encoding the block (1520); applying the learning-based model for said input block according to the decision, where the learning-based model has been selectively fine-tuned according to information relating to activation of the learning-based model of previously-decoded blocks (1530); encoding a signal corresponding to the decision on usage of the learning-based model into a bitstream (1540); and encoding the block into a bitstream with an information whether the block is to be used for finetuning (1550).
-
20.
公开(公告)号:US20220164995A1
公开(公告)日:2022-05-26
申请号:US17430987
申请日:2020-01-29
Applicant: Nokia Technologies Oy
Inventor: Caglar AYTEKIN , Francesco CRICRI , Mikko HONKALA
IPC: G06T9/00 , H04N19/15 , H04N19/132 , H04N19/196 , G06N3/08
Abstract: The embodiments relate to a method comprising compressing input data (I) by means of at least a neural network (E, 310); determining a compression rate for data compression; miming the neural network (E, 310) with the input data (I) to produce an output data (c); removing a number of elements from the output data (c) according to the compression rate to result in a reduced form of the output data (me); and providing the reduced form of the output data (me) and the compression rate to a decoder (D, 320). The embodiments also relate to a method comprising receiving input data (me) for decompression; decompressing the input data (me) by means of at least a neural network (D, 320); determining a decompression rate for decompressing the input data (me); miming the neural network (D, 320) with input data (me) to produce a decompressed output data (ï); padding a number of elements to the compressed input data (me) according to the decompression rate to produce an output data (ï); and providing the output data (ï).
-
-
-
-
-
-
-
-
-