Patent search ap:("HUAWEI TECHNOLOGIES CO. Page LTD.") AND inv:"Panqi Jia"

1.

发明公开
CONFIGURABLE POSITIONS FOR AUXILIARY INFORMATION INPUT INTO A PICTURE DATA PROCESSING NEURAL NETWORK 审中-公开

公开(公告)号：US20240037802A1

公开(公告)日：2024-02-01

申请号：US18479507

申请日：2023-10-02

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Timofey Mikhailovich Solovyev , Biao Wang , Elena Alexandrovna Alshina , Han Gao , Panqi Jia , Esin Koyuncu , Alexander Alexandrovich Karabutov , Mikhail Vyacheslavovich Sosulnikov , Semih Esenlik , Sergey Yurievich Ikonin

IPC: G06T9/00 , G06T3/40

CPC classification number: G06T9/002 , G06T3/4046

Abstract: This application provides methods and apparatuses for processing of picture data or picture feature data using a neural network with two or more layers. The present disclosure may be applied in the field of artificial intelligence (AI)-based video or picture compression technologies, and in particular, to the field of neural network-based video compression technologies. According to some embodiments, position within the neural network, at which auxiliary information may be entered for processing is selectable based on a gathering condition. The gathering condition may assess whether some prerequisite is fulfilled. Some of the advantages may include better performance in terms of rate and/or disclosure due to the effect of increased flexibility in neural network configurability.

2.

发明公开
DECODING AND ENCODING OF NEURAL-NETWORK-BASED BITSTREAMS 审中-公开

公开(公告)号：US20230336784A1

公开(公告)日：2023-10-19

申请号：US18336735

申请日：2023-06-16

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Semih Esenlik , Panqi Jia , Elena Alexandrovna Alshina

IPC: H04N19/132 , H04N19/70 , H04N19/119 , H04N19/42 , H04N19/167 , H04N19/172

CPC classification number: H04N19/70 , H04N19/119 , H04N19/132 , H04N19/167 , H04N19/172 , H04N19/42

Abstract: For picture decoding and encoding of neural-network-based bitstreams, a picture is represented by an input set of samples which is obtained from the bitstream. The picture is reconstructed from output subsets, which are generated as a result of processing the input set L. The input set is divided into multiple input subsets Li. The input subsets are each subject to processing with a neural network having one or more layers. The neural network uses as input multiple samples of an input subset and generates one sample of an output subset. By combining the output subsets, the picture is reconstructed. In particular, the size of at least one input subset is smaller than a size that is required to obtain the size of the respective output subset, after processing by the one or more layers of the neural network.

3.

发明申请
PARALLEL PROCESSING OF IMAGE REGIONS WITH NEURAL NETWORKS – DECODING, POST FILTERING, AND RDOQ 有权

公开(公告)号：US20250142066A1

公开(公告)日：2025-05-01

申请号：US19007327

申请日：2024-12-31

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Johannes Sauer , Panqi Jia , Elena Alexandrovna Alshina , Atanas Boev

IPC: H04N19/119 , H04N19/176 , H04N19/186

Abstract: The present disclosure relates to picture encoding and decoding of image regions on tile-basis. In particular, multiple components of an input tensor including a first and second component in spatial dimensions is processed within multiple pipelines. The processing of the first component includes dividing the first component in the spatial dimensions into a first plurality of tiles. Likewise, the processing of the second component includes dividing the second component in the spatial dimensions into a second plurality of tiles. The respective first and second plurality of tiles are then processed each separately. Among the first and second plurality of tiles there are at least two respective collocated tiles differing in size. In case of compression, the processing of the first and/or second component includes picture encoding, rate distortion optimization quantization, and picture filtering. In case of decompression, the processing includes picture decoding and picture filtering.

4.

发明授权
Decoding and encoding of neural-network-based bitstreams 有权

公开(公告)号：US12284390B2

公开(公告)日：2025-04-22

申请号：US18336735

申请日：2023-06-16

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Semih Esenlik , Panqi Jia , Elena Alexandrovna Alshina

IPC: H04N19/70 , H04N19/119 , H04N19/132 , H04N19/167 , H04N19/172 , H04N19/42

Abstract: For picture decoding and encoding of neural-network-based bitstreams, a picture is represented by an input set of samples which is obtained from the bitstream. The picture is reconstructed from output subsets, which are generated as a result of processing the input set L. The input set is divided into multiple input subsets Li. The input subsets are each subject to processing with a neural network having one or more layers. The neural network uses as input multiple samples of an input subset and generates one sample of an output subset. By combining the output subsets, the picture is reconstructed. In particular, the size of at least one input subset is smaller than a size that is required to obtain the size of the respective output subset, after processing by the one or more layers of the neural network.

5.

发明申请
PARALLEL PROCESSING OF IMAGE REGIONS WITH NEURAL NETWORKS – DECODING, POST FILTERING, AND RDOQ 有权

公开(公告)号：US20250142099A1

公开(公告)日：2025-05-01

申请号：US19007203

申请日：2024-12-31

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Johannes Sauer , Panqi Jia , Elena Alexandrovna Alshina , Atanas Boev

IPC: H04N19/436 , H04N19/119 , H04N19/172 , H04N19/70 , H04N19/80 , H04N19/85

Abstract: Neural-network-based picture encoding and decoding of image regions may be performed on a tile-basis. An input tensor representing picture data is processed by the neural network, which includes at least a first and second subnetwork. The first subnetwork is applied to a first tensor where the first tensor is divided in a spatial dimensions into a first plurality of tiles. The first tiles are then further processed by the first subnetwork. After application of the first subnetwork, the second subnetwork is applied to a second tensor where the second tensor is divided in the spatial dimensions into a second plurality of tiles. The second tiles are then further processed by the second subnetwork. Among the first and second plurality of tiles there are at least two respective collocated tiles differing in size.

6.

发明公开
Conditional Image Compression 审中-公开

公开(公告)号：US20240296593A1

公开(公告)日：2024-09-05

申请号：US18661245

申请日：2024-05-10

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Alexander Alexandrovich Karabutov , Panqi Jia , Atanas Boev , Han Gao , Biao Wang , Elena Alexandrovna Alshina , Johannes Sauer

IPC: G06T9/00 , H04N19/13 , H04N19/167 , H04N19/186

CPC classification number: G06T9/002 , H04N19/13 , H04N19/167 , H04N19/186

Abstract: A conditional coding of components of an image is described. A method of encoding at least a portion of an image is provided, which comprises encoding a primary component of the image independently from at least one secondary component and encoding the at least one secondary component of the image using information from the primary component. Further, it is provided a method of encoding at least a portion of an image, comprising providing a residual comprising a primary residual component for a primary component of the image and at least one secondary residual component for at least one secondary component of the image that is different from the primary component, encoding the primary residual component independently from the at least one secondary residual component and encoding the at least one secondary residual component using information from the primary residual component.

7.

发明公开
INDEPENDENT POSITIONING OF AUXILIARY INFORMATION IN NEURAL NETWORK BASED PICTURE PROCESSING 审中-公开

公开(公告)号：US20240161488A1

公开(公告)日：2024-05-16

申请号：US18479611

申请日：2023-10-02

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Timofey Mikhailovich Solovyev , Elena Alexandrovna Alshina , Biao Wang , Alexander Alexandrovich Karabutov , Mikhail Vyacheslavovich Sosulnikov , Georgy Petrovich Gaikov , Han Gao , Panqi Jia , Esin Koyuncu , Sergey Yurievich Ikonin , Semih Esenlik

IPC: G06V10/82 , G06V10/77 , G06V20/40 , H04N19/513 , H04N19/91

CPC classification number: G06V10/82 , G06V10/7715 , G06V20/46 , H04N19/521 , H04N19/91

Abstract: This application provides methods and apparatuses for processing of picture data or picture feature data using a neural network with two or more layers. The present disclosure may be applied in the field of artificial intelligence (AI)-based video or picture compression technologies, and in particular, to the field of neural network-based video compression technologies. According to some embodiments, two kinds of data are combined during the processing including processing by the neural network. The two kinds of data are obtained from different stages of processing by the network. Some of the advantages may include greater scalability and a more flexible design of the neural network architecture which may further lead to better encoding/decoding performance.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification