-
1.
公开(公告)号:US20240265240A1
公开(公告)日:2024-08-08
申请号:US18567736
申请日:2022-06-17
Applicant: Nokia Technologies Oy
Inventor: Honglei ZHANG , Francesco CRICRÌ , Ramin GHAZNAVI YOUVALARI , Hamed REZAZADEGAN TAVAKOLI , Nannan ZOU , Vinod Kumar MALAMAL VADAKITAL , Miska Matias HANNUKSELA , Yat Hong LAM , Jani LAINEMA , Emre Baris AKSU
IPC: G06N3/0455
CPC classification number: G06N3/0455
Abstract: An example apparatus includes at least one processor; and at least one non-transitory memory comprising computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform; learn importance of one or more parameters by using a training dataset; define one or more masks for indicating the importance of the one or more parameters for a model finetuning; share at least one mask of the one or more masks with at least one of an encoder or a decoder; finetune at least one parameter of the one or more parameters based at least on the at least one mask; send or signal one or more weight updates corresponding to the at least one parameter in a bitstream to the decoder.
-
2.
公开(公告)号:US20240121387A1
公开(公告)日:2024-04-11
申请号:US18470973
申请日:2023-09-20
Applicant: Nokia Technologies Oy
Inventor: Jani LAINEMA , Francesco CRICRÌ , Ramin GHAZNAVI YOUVALARI , Miska Matias HANNUKSELA , Honglei ZHANG , Maria Claudia SANTAMARIA GOMEZ , Ruiying YANG
IPC: H04N19/117 , H04N19/103 , H04N19/136 , H04N19/157 , H04N19/172 , H04N19/176 , H04N19/182 , H04N19/42 , H04N19/80
CPC classification number: H04N19/117 , H04N19/103 , H04N19/136 , H04N19/157 , H04N19/172 , H04N19/176 , H04N19/182 , H04N19/42 , H04N19/80
Abstract: An example method includes: receiving an input extended block comprising an input block and input margins, wherein the input block is derived from an image or a video frame, and wherein the input block and the input margins are input to a filter, and wherein the input margins comprise pixels of the image or the video frame; filtering the input extended block to obtain a filtered extended block comprising a filtered block and filtered margins; receiving other blocks, wherein the other blocks are derived from the image or video frame, and wherein at least one pixel of the other blocks overlaps with at least one pixel of the filtered margins of the filtered extended block; and blending the filtered extended block with the other blocks, wherein an operation to blend the filtered extended block with the other blocks is applied to two or more overlapping pixels
-
3.
公开(公告)号:US20230269387A1
公开(公告)日:2023-08-24
申请号:US18001987
申请日:2021-06-11
Applicant: Nokia Technologies Oy
Inventor: Francesco CRICRÌ , Hamed REZAZADEGAN TAVAKOLI , Honglei ZHANG , Nannan ZOU
IPC: H04N19/42 , H04N19/124 , H04N19/176 , G06N3/0455 , G06N3/0985
CPC classification number: H04N19/42 , G06N3/0455 , G06N3/0985 , H04N19/124 , H04N19/176
Abstract: In example embodiments, an apparatus, a method, and a computer program product are provided. An example apparatus include processing circuitry; and at least one memory including computer program code, the at least one memory and the computer program code configured to, with the processing circuitry, cause the apparatus at least to: overfit a neural network on each media item, from a batch of media items, for a number of iterations to obtain an overfitted neural network model for the each media item; evaluate the overfitted neural network model on the each media item to obtain evaluation errors; and update parameters of the neural network to be based on the evaluation errors.
-
公开(公告)号:US20240357104A1
公开(公告)日:2024-10-24
申请号:US18640520
申请日:2024-04-19
Applicant: Nokia Technologies Oy
Inventor: Honglei ZHANG , Francesco CRICRÌ , Alireza AMINLOU , Miska Matias HANNUKSELA , Nam Hai LE , Jukka Ilari AHONEN , Hamed REZAZADEGAN TAVAKOLI
IPC: H04N19/119 , H04N19/167 , H04N19/176
CPC classification number: H04N19/119 , H04N19/167 , H04N19/176
Abstract: Various embodiments describe an apparatus, a method, and a computer program product. An example apparatus includes at least one processor; and at least one memory storing instructions that, when executed by the at least one processor, cause the apparatus at least to perform: encoding an input picture by using a first encoder or first encoding parameters; encoding the input picture by using a second encoder or second encoding parameters; generating a first reconstructed picture based on the encoding of the input picture by using the first encoder or the first encoding parameters; and generating a second reconstructed picture based on the encoding of the input picture by using the second encoder or the second encoding parameters.
-
公开(公告)号:US20240249514A1
公开(公告)日:2024-07-25
申请号:US18560430
申请日:2022-05-13
Applicant: Nokia Technologies Oy
Inventor: Jani LAINEMA , Francesco CRICRÌ , Honglei ZHANG , Hamed REZAZADEGAN TAVAKOLI , Yat Hong LAM , Miska Matias HANNUKSELA , Nannan ZOU
IPC: G06V10/82 , G06V10/771 , H04N19/117 , H04N19/159 , H04N19/172 , H04N19/70 , H04N19/82
CPC classification number: G06V10/82 , G06V10/771 , H04N19/117 , H04N19/159 , H04N19/172 , H04N19/70 , H04N19/82
Abstract: Various embodiments provide an apparatus, a method, and a computer program product. The apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform; train or finetune one or more additional parameters of at least one neural network (NN) or a portion of the at least one NN, wherein the one or more additional parameters comprise one or more scaling parameters; and encode or decode one or more media elements based on the at least one neural network or a portion of the at least one NN comprising the trained or finetuned one or more additional parameters.
-
公开(公告)号:US20230209092A1
公开(公告)日:2023-06-29
申请号:US17996040
申请日:2021-04-13
Applicant: Nokia Technologies Oy
IPC: H04N19/70 , H04N19/124 , H04N19/42
CPC classification number: H04N19/70 , H04N19/42 , H04N19/124
Abstract: In example embodiments, an apparatus, a method, and a computer program product are provided. The apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform: encode or decode a high-level bitstream syntax for at least one neural network; wherein the high-level bitstream syntax comprises at least one information unit, wherein the at least one information unit comprises syntax definitions for the at least one neural network or a portion of the at least one neural network; and wherein a serialized bitstream comprises one or more of the at least one information units.
-
公开(公告)号:US20220256227A1
公开(公告)日:2022-08-11
申请号:US17649915
申请日:2022-02-03
Applicant: Nokia Technologies Oy
IPC: H04N21/435 , H04N21/44
Abstract: An example method is provided to include receiving a media bitstream comprising one or more media units and a first enhancement information message, wherein the first enhancement information message comprises at least two independently parsable structures, a first independently parsable structure comprising information about at least one purpose of one or more neural networks (NNs) to be applied to the one or more media units, and a second independently parsable structure comprising or identifying one or more neural networks; decoding the one or more media units; and using the one or more neural networks to enhance or filter one or more frames of the decoded the one or more media units, based on the at least one purpose. An example method includes. Corresponding apparatuses and computer program products are also provided.
-
公开(公告)号:US20240289590A1
公开(公告)日:2024-08-29
申请号:US18572100
申请日:2022-06-16
Applicant: Nokia Technologies Oy
Inventor: Francesco CRICRÌ , Nannan ZOU , Honglei ZHANG , Hamed REZAZADEGAN TAVAKOLI
IPC: G06N3/045
CPC classification number: G06N3/045
Abstract: Various embodiments provide a method, an apparatus, and computer program product. The method comprising: defining an attention block comprising: a set of initial neural network layers, wherein each layer is caused to process an output of a previous layer, and wherein a first layer processes an input of a dense split attention block; core attention blocks process one or more outputs of the set of initial neural network layers; a concatenation block for concatenating one or more outputs of the core attention blocks and at least one intermediate output of the set of initial neural network layers; one or more final neural network layers process at least the output of the concatenation block; and a summation block caused to sum an output of the final neural network layers and an input to the attention block; and providing an output of the summation block as a final output of the attention block.
-
9.
公开(公告)号:US20240202507A1
公开(公告)日:2024-06-20
申请号:US18555479
申请日:2022-04-15
Applicant: Nokia Technologies Oy
Inventor: Francesco CRICRÌ , Jani LAINEMA , Ramin GHAZNAVI YOUVALARI , Honglei ZHANG , Yat Hong LAM , Maria Claudia SANTAMARIA GOMEZ , Hamed REZAZADEGAN TAVAKOLI , Miska Matias HANNUKSELA
Abstract: An apparatus with a corresponding method and computer program product are provided. The apparatus includes at least one processor; and at least one non-transitory memory including computer program code: wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform the steps (1600) of train or finetune at least one neural network (NN) based at least on a temporal persistence scope; and encode or decode one or more media frames elements based at least on the trained or finetuned at least one neural network. A further apparatus with a corresponding method and computer program product are provided. The further apparatus configured to carry out the steps (1700) of receive a weight-update prediction error from an encoder-side, predict a weight-update based on one or more reference weight updates, and a prediction function or algorithm, and reconstruct a weight update by combining the predicted weight-update and the prediction error.
-
公开(公告)号:US20240013046A1
公开(公告)日:2024-01-11
申请号:US18247200
申请日:2021-09-02
Applicant: Nokia Technologies Oy
Inventor: Nam LE , Francesco CRICRÌ , Honglei ZHANG , Hamed REZAZADEGAN TAVAKOLI , Ramin GHAZNAVI YOUVALARI
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: A method is provided for computing predetermined loss terms based on original data and decoded data; training one or more neural networks of a system by using the predetermined loss terms; updating weights for one or more of other loss terms; and determining trade-offs between predetermined objectives of the system. Corresponding apparatuses and computer program products are also provided.
-
-
-
-
-
-
-
-
-