-
公开(公告)号:US11997296B2
公开(公告)日:2024-05-28
申请号:US17571175
申请日:2022-01-07
Applicant: Huawei Technologies Co., Ltd.
Inventor: Timofey Mikhailovich Solovyev , Roman Igorevich Chernyak , Alexander Alexandrovich Karabutov , Sergey Yurievich Ikonin
IPC: H04N19/176 , H04N19/433 , H04N19/52 , H04N19/96
CPC classification number: H04N19/433 , H04N19/52 , H04N19/96
Abstract: A method for deriving motion information for a current block includes: determining whether the current block and a top neighboring block of the current block are in a same coding tree unit (CTU) or not; obtaining a position of a spatial motion information candidate for the current block; obtaining motion information corresponding to the position of the spatial motion information candidate, depending on the result of whether the current block and top neighboring block of the current block are in the same CTU; adding the obtained motion information to a motion information candidate list for the current block; obtaining a value of a first index of a motion information candidate; and deriving motion information of the current block based on the motion information candidate list and the value of the first index.
-
公开(公告)号:US11924457B2
公开(公告)日:2024-03-05
申请号:US17407548
申请日:2021-08-20
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Vasily Alexeevich Rufitskiy , Timofey Mikhailovich Solovyev , Alexey Konstantinovich Filippov , Jianle Chen
IPC: H04N19/513 , H04N19/159 , H04N19/176 , H04N19/186 , H04N19/30
CPC classification number: H04N19/513 , H04N19/159 , H04N19/176 , H04N19/186 , H04N19/30
Abstract: The present disclosure relates to a method and apparatus for motion vector derivation for affine based inter prediction of chroma subblocks based on a chroma format. The method includes: determining chroma scaling factors in horizontal and vertical directions based on chroma format information, wherein the chroma format information indicates a chroma format of a current picture which the current image block belongs to; determining a set of luma subblocks of the luma block based on values of the chroma scaling factors; and determining a motion vector for a chroma subblock of the co-located chroma block based on motion vectors of one or more luma subblocks in the set of luma subblocks.
-
公开(公告)号:US20240048756A1
公开(公告)日:2024-02-08
申请号:US18446265
申请日:2023-08-08
Inventor: Timofey Mikhailovich Solovyev , Fabian Brand , Jürgen Seiler , Andre Kaup , Elena Alexandrovna Alshina
IPC: H04N19/513 , H04N19/543
CPC classification number: H04N19/521 , H04N19/543
Abstract: A sparse motion field exchanges motion information between encoder and decoder. Side information indicates a suitable interpolation approach for interpolating the motion vectors of the dense motion field different from the signaled sparse motion field motion vectors. The side information is provided on a segment basis. The segments are derived at the encoder side and the decoder side similarly, based on the reference images and the sparse motion field.
-
公开(公告)号:US11863748B2
公开(公告)日:2024-01-02
申请号:US17547169
申请日:2021-12-09
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Sergey Yurievich Ikonin , Roman Igorevich Chernyak , Timofey Mikhailovich Solovyev , Alexander Alexandrovich Karabutov , Elena Alexandrovna Alshina
IPC: H04N19/124 , H04N19/186 , H04N19/196
CPC classification number: H04N19/124 , H04N19/186 , H04N19/196
Abstract: A method of obtaining quantization parameter (QP) for chrominance components based on QP for luminance component, wherein the method is performed by a decoder or encoder, and wherein the method comprises obtaining a luminance QP for a luma component of a coding unit; obtaining a QP index (QPi) based at least in a part on the luminance QP; and obtaining a chrominance QP (QPc) for a chroma component of the coding unit based on the QP index by using a preset lookup table, wherein the maximum difference between the QP index and the chrominance QP is equal to 3.
-
公开(公告)号:US20230353764A1
公开(公告)日:2023-11-02
申请号:US18340704
申请日:2023-06-23
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Sergey Yurievich Ikonin , Mikhail Vyacheslavovich Sosulnikov , Alexander Alexandrovich Karabutov , Timofey Mikhailovich Solovyev , Biao Wang , Elena Alexandrovna Alshina
IPC: H04N19/139 , H04N19/59 , H04N19/70 , H04N19/33
CPC classification number: H04N19/33 , H04N19/139 , H04N19/59 , H04N19/70
Abstract: A method and apparatus for decoding data for still or video processing into a bitstream are provided. In particular, two or more sets of feature map elements are obtained from the bitstream. Each set of feature map elements relates to a feature map. Each of the two or more sets of feature map elements is then respectively inputted into two or more feature map processing layers out of a plurality of cascaded layers. The decoded data for picture or video processing is then obtained as a result of the processing by the plurality of cascaded layers. According to the present disclosure, the data may be decoded from the bitstream in an efficient manner in the layered structure.
-
公开(公告)号:US20230336759A1
公开(公告)日:2023-10-19
申请号:US18339783
申请日:2023-06-22
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Sergey Yurievich Ikonin , Mikhail Vyacheslavovich Sosulnikov , Alexander Alexandrovich Karabutov , Timofey Mikhailovich Solovyev , Biao Wang , Elena Alexandrovna Alshina
IPC: H04N19/46 , H04N19/132
CPC classification number: H04N19/46 , H04N19/132
Abstract: The present disclosure relates to methods and apparatuses for decoding data for (still or video processing into a bitstream). Two or more sets of segmentation information elements are obtained from the bitstream. Then, each of the two or more sets of segmentation information elements are inputted respectively into two or more segmentation information processing layers out of a plurality of cascaded layers. In each of the two or more segmentation information processing layers, the respective sets of segmentation information are processed. The decoded data for picture or video processing are obtained based on the segmentation information processed by the plurality of cascaded layers. Accordingly, the data may be decoded from the bitstream in an efficient manner in the layered structure.
-
公开(公告)号:US20230336758A1
公开(公告)日:2023-10-19
申请号:US18339772
申请日:2023-06-22
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Sergey Yurievich Ikonin , Mikhail Vyacheslavovich Sosulnikov , Alexander Alexandrovich Karabutov , Timofey Mikhailovich Solovyev , Biao Wang , Elena Alexandrovna Alshina
IPC: H04N19/46 , H04N19/132
CPC classification number: H04N19/46 , H04N19/132
Abstract: The present disclosure relates to methods and apparatuses for encoding data for (still or video processing into a bitstream). In particular, the data are processed by a network which includes a plurality of cascaded layers. In the processing, feature maps are generated by the layers. The feature maps processed (output) by at least two different layers have different resolutions. In the processing, a layer is selected, out of the cascaded layers, which is different from the layer generating the feature map of the lowest resolution (e.g. latent space). The bitstream includes information related to the selected layer. With this approach, scalable processing which may operate on different resolutions is provided so that the bitstream may convey information relating to such different resolutions. Accordingly, the data may be efficiently coded within the bitstream, depending on the resolution which may vary depending on the content of the picture data coded.
-
公开(公告)号:US20220103822A1
公开(公告)日:2022-03-31
申请号:US17547169
申请日:2021-12-09
Applicant: HUAWEI TECHNOLOGIES CO.,LTD.
Inventor: Sergey Yurievich Ikonin , Roman Igorevich Chernyak , Timofey Mikhailovich Solovyev , Alexander Alexandrovich Karabutov , Elena Alexandrovna Alshina
IPC: H04N19/124 , H04N19/186 , H04N19/196
Abstract: A method of obtaining quantization parameter (QP) for chrominance components based on QP for luminance component, wherein the method is performed by a decoder or encoder, and wherein the method comprises obtaining a luminance QP for a luma component of a coding unit; obtaining a QP index (QPi) based at least in a part on the luminance QP; and obtaining a chrominance QP (QPc) for a chroma component of the coding unit based on the QP index by using a preset lookup table, wherein the maximum difference between the QP index and the chrominance QP is equal to 3.
-
公开(公告)号:US20240414361A1
公开(公告)日:2024-12-12
申请号:US18744323
申请日:2024-06-14
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Georgii Petrovich Gaikov , Sergey Yurievich Ikonin , Ahmet Burakhan Koyuncu , Alexander Alexandrovich Karabutov , Timofey Mikhailovich Solovyev , Elena Alexandrovna Alshina
Abstract: A method of processing a current object is provided. A set of input data tensors representing the current object are inputted into a first neural layer of a transformer based neural network. Based on information about processing the current object, at least one auxiliary data tensor is inputted into the first neural layer or a second neural layer of the transformer based neural network, where the at least one auxiliary data tensor is different from each of the input data tensors of the set of input data tensors and represents at least one auxiliary input. The set of input data tensors are processed by the transformer based neural network using the at least one auxiliary data tensor in order to obtain a set of output data tensors.
-
公开(公告)号:US20240296594A1
公开(公告)日:2024-09-05
申请号:US18662752
申请日:2024-05-13
Inventor: Timofey Mikhailovich Solovyev , Fabian Brand , Jürgen Seiler , Andre Kaup , Elena Alexandrovna Alshina
IPC: G06T9/00 , H04N19/20 , H04N19/587
CPC classification number: G06T9/002 , H04N19/20 , H04N19/587
Abstract: This application provides methods and apparatuses for encoding image or video related data into a bitstream. The present disclosure may be applied in the field of artificial intelligence (AI)-based video or picture compression technologies, and in particular, to the field of neural network-based video compression technologies. A neural network (generalized difference) is applied to a signal and a predicted signal during the encoding to obtain a generalized residual. During the decoding another neural network (generalized sum) may be applied to a reconstructed generalized residual and the predicted signal to obtain a reconstructed signal.
-
-
-
-
-
-
-
-
-