Patent search ap:("QUALCOMM Incorporated") AND inv:"Taco Sebastiaan COHEN" Page 1

1.

发明公开
NEURAL IMAGE COMPRESSION WITH CONTROLLABLE SPATIAL BIT ALLOCATION 审中-公开

公开(公告)号：US20230156207A1

公开(公告)日：2023-05-18

申请号：US17987844

申请日：2022-11-15

Applicant: QUALCOMM Incorporated

Inventor： Yang YANG , Hoang Cong Minh LE , Yinhao ZHU , Reza POURREZA , Amir SAID , Yizhe ZHANG , Taco Sebastiaan COHEN

IPC: H04N19/436 , H04N19/124 , H04N19/147 , H04N19/17 , H04N19/119

CPC classification number: H04N19/436 , H04N19/124 , H04N19/147 , H04N19/17 , H04N19/119

Abstract: A processor-implemented method for image compression using an artificial neural network (ANN) includes receiving, at an encoder of the ANN, an image and a spatial segmentation map corresponding to the image. The spatial segmentation map indicates one or more regions of interest. The encoder compresses the image according to a controllable spatial bit allocation. The controllable spatial bit allocation is based on a learned quantization bin size.

2.

发明申请
MULTI-SCALE OPTICAL FLOW FOR LEARNED VIDEO COMPRESSION 有权

公开(公告)号：US20220303568A1

公开(公告)日：2022-09-22

申请号：US17207244

申请日：2021-03-19

Applicant: QUALCOMM Incorporated

Inventor： Reza POURREZA , Amir SAID , Yang YANG , Yinhao ZHU , Taco Sebastiaan COHEN

IPC: H04N19/51 , H04N19/172 , H04N19/137 , H04N19/107 , H04N19/593 , G06N3/08

Abstract: Systems and techniques are described for encoding and/or decoding data based on motion estimation that applies variable-scale warping. An encoding device can receive an input frame and a reference frame that depict a scene at different times. The encoding device can generate an optical flow identifying movements in the scene between the two frames. The encoding device can generate a weight map identifying how finely or coarsely the reference frame can be warped for input frame prediction. The encoding device can generate encoded video data based on the optical flow and the weight map. A decoding device can generate a reconstructed optical flow and a reconstructed weight map from the encoded data. A decoding device can generate a prediction frame by warping the reference frame based on the reconstructed optical flow and the reconstructed weight map. The decoding device can generate a reconstructed input frame based on the prediction frame.

3.

发明申请
DATA COMPRESSION WITH A MULTI-SCALE AUTOENCODER 有权

公开(公告)号：US20220292725A1

公开(公告)日：2022-09-15

申请号：US17200694

申请日：2021-03-12

Applicant: QUALCOMM Incorporated

Inventor： Hoang Cong Minh LE , Reza POURREZA , Yang YANG , Yinhao ZHU , Amir SAID , Yizhe ZHANG , Taco Sebastiaan COHEN

IPC: G06T9/00 , G06T3/40 , G06N3/08

Abstract: A method of image compression includes receiving an image. Multiple quantized latent representations are generated to represent features of the image. Each of the quantized latent representations has a different resolution and is generated at staggered timings. Each of the later generated quantized latent representations is conditioned on each of the prior generated quantized latent representations. The multiple quantized latent representations are decoded to reconstruct the image.

4.

发明申请
PROGRESSIVE DATA COMPRESSION USING ARTIFICIAL NEURAL NETWORKS 有权

公开(公告)号：US20220237740A1

公开(公告)日：2022-07-28

申请号：US17648808

申请日：2022-01-24

Applicant: QUALCOMM Incorporated

Inventor： Yadong LU , Yang YANG , Yinhao ZHU , Amir SAID , Taco Sebastiaan COHEN

IPC: G06T3/40 , G06T9/00

Abstract: Certain aspects of the present disclosure provide techniques for compressing content using a neural network. An example method generally includes receiving content for compression. The content is encoded into a first latent code space through an encoder implemented by an artificial neural network trained to generate a latent space representation of the content. A first compressed version of the encoded content is generated using a first quantization bin size of a series of quantization bin sizes. A refined compressed version of the encoded content is generated by scaling the first compressed version of the encoded content into one or more second quantization bin sizes smaller than the first quantization bin size, conditioned at least on a value of the first compressed version of the encoded content. The refined compressed version of the encoded content is output for transmission.

5.

发明公开
INSTANCE-ADAPTIVE IMAGE AND VIDEO COMPRESSION USING MACHINE LEARNING SYSTEMS 审中-公开

公开(公告)号：US20240205427A1

公开(公告)日：2024-06-20

申请号：US18420635

申请日：2024-01-23

Applicant: QUALCOMM Incorporated

Inventor： Ties Jehan VAN ROZENDAAL , Iris Anne Marie HUIJBEN , Taco Sebastiaan COHEN

IPC: H04N19/184 , G06N3/045 , G06N3/088

CPC classification number: H04N19/184 , G06N3/045 , G06N3/088

Abstract: Techniques are described for compressing data using machine learning systems and tuning machine learning systems for compressing the data. An example process can include receiving, by a neural network compression system (e.g., trained on a training dataset), input data for compression by the neural network compression system. The process can include determining a set of updates for the neural network compression system, the set of updates including updated model parameters tuned using the input data. The process can include generating, by the neural network compression system using a latent prior, a first bitstream including a compressed version of the input data. The process can further include generating, by the neural network compression system using the latent prior and a model prior, a second bitstream including a compressed version of the updated model parameters. The process can include outputting the first bitstream and the second bitstream for transmission to a receiver.

6.

发明公开
LEARNED B-FRAME CODING USING P-FRAME CODING SYSTEM 审中-公开

公开(公告)号：US20240022761A1

公开(公告)日：2024-01-18

申请号：US18343618

申请日：2023-06-28

Applicant: QUALCOMM Incorporated

Inventor： Reza POURREZA , Taco Sebastiaan COHEN

IPC: H04N19/59 , G06N3/063 , G06N3/088 , G06N3/045

CPC classification number: H04N19/59 , G06N3/063 , G06N3/088 , G06N3/045

Abstract: Techniques are described for processing video data, such as by performing learned bidirectional coding using a unidirectional coding system and an interpolated reference frame. For example, a process can include obtaining a first reference frame and a second reference frame. The process can include generating a third reference frame at least in part by performing interpolation between the first reference frame and the second reference frame. The process can include performing unidirectional inter-prediction on an input frame based on the third reference frame, such as by estimating motion between an input frame and the third reference frame, and generating a warped frame at least in part by warping one or more pixels of the third reference frame based on the estimated motion. The process can include generating, based on the warped frame and a predicted residual, a reconstructed frame representing the input frame, the reconstructed frame including a bidirectionally-predicted frame.

7.

发明申请
DATA AND COMPUTE EFFICIENT EQUIVARIANT CONVOLUTIONAL NETWORKS 有权

公开(公告)号：US20210248467A1

公开(公告)日：2021-08-12

申请号：US17170745

申请日：2021-02-08

Applicant: QUALCOMM Incorporated

Inventor： Mirgahney Husham Awadelkareem MOHAMED , Gabriele CESA , Taco Sebastiaan COHEN , Max WELLING

IPC: G06N3/08 , G06K9/62 , G06F9/345

Abstract: Certain aspects of the present disclosure provide a method of performing machine learning, comprising: generating a neural network model; and training the neural network model for a task with a first set of input data, wherein: the training uses a total loss function total including an equivariance loss component equivarnace according to total=task+αequivarnace, and α>0.

8.

发明申请
VIDEO COMPRESSION USING DEEP GENERATIVE MODELS 审中-公开

公开(公告)号：US20200304802A1

公开(公告)日：2020-09-24

申请号：US16826221

申请日：2020-03-21

Applicant: QUALCOMM Incorporated

Inventor： Amirhossein HABIBIAN , Ties Jehan VAN ROZENDAAL , Taco Sebastiaan COHEN

IPC: H04N19/14 , H04N19/124 , H04N19/179 , H04N19/186 , H04N19/46 , H04N5/247 , G06K9/00 , G06K9/62 , G06N3/04 , G06N3/08

Abstract: Certain aspects of the present disclosure are directed to methods and apparatus for compressing video content using deep generative models. One example method generally includes receiving video content for compression. The received video content is generally encoded into a latent code space through an encoder, which may be implemented by a first artificial neural network. A compressed version of the encoded video content is generally generated through a trained probabilistic model, which may be implemented by a second artificial neural network, and output for transmission.

9.

发明公开
VIDEO COMPRESSION USING DEEP GENERATIVE MODELS 审中-公开

公开(公告)号：US20230336754A1

公开(公告)日：2023-10-19

申请号：US18337331

申请日：2023-06-19

Applicant: QUALCOMM Incorporated

Inventor： Amirhossein HABIBIAN , Ties Jehan VAN ROZENDAAL , Taco Sebastiaan COHEN

IPC: H04N19/20 , H04N19/14 , H04N19/124 , H04N19/179 , H04N19/186 , H04N19/46 , G06N3/084 , G06V20/40 , G06F18/21 , G06N3/044 , G06N3/045 , G06N3/047 , H04N23/90 , G06V10/764 , G06V10/82

CPC classification number: H04N19/20 , H04N19/14 , H04N19/124 , H04N19/179 , H04N19/186 , H04N19/46 , G06N3/084 , G06V20/46 , G06F18/21 , G06N3/044 , G06N3/045 , G06N3/047 , H04N23/90 , G06V10/764 , G06V10/82

Abstract: Certain aspects of the present disclosure are directed to methods and apparatus for compressing video content using deep generative models. One example method generally includes receiving video content for compression. The received video content is generally encoded into a latent code space through an encoder, which may be implemented by a first artificial neural network. A compressed version of the encoded video content is generally generated through a trained probabilistic model, which may be implemented by a second artificial neural network, and output for transmission.

10.

发明申请
TRANSFORMER-BASED ARCHITECTURE FOR TRANSFORM CODING OF MEDIA 有权

公开(公告)号：US20230100413A1

公开(公告)日：2023-03-30

申请号：US17486732

申请日：2021-09-27

Applicant: QUALCOMM Incorporated

Inventor： Yinhao ZHU , Yang YANG , Taco Sebastiaan COHEN

IPC: H04N19/60

Abstract: Systems and techniques are described herein for processing media data using a neural network system. For instance, a process can include obtaining a latent representation of a frame of encoded image data and generating, by a plurality of decoder transformer layers of a decoder sub-network using the latent representation of the frame of encoded image data as input, a frame of decoded image data. At least one decoder transformer layer of the plurality of decoder transformer layers includes: one or more transformer blocks for generating one or more patches of features and determine self-attention locally within one or more window partitions and shifted window partitions applied over the one or more patches; and a patch un-merging engine for decreasing a respective size of each patch of the one or more patches.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification