Patent search ap:("QUALCOMM Incorporated") AND inv:"Yinhao ZHU" Page 2

11.

发明公开
FLOW-AGNOSTIC NEURAL VIDEO COMPRESSION 审中-公开

公开(公告)号：US20230169694A1

公开(公告)日：2023-06-01

申请号：US17975471

申请日：2022-10-27

Applicant: QUALCOMM Incorporated

Inventor： Hoang Cong Minh LE , Reza POURREZA , Yang YANG , Yinhao ZHU , Amir SAID , Taco Sebastiaan COHEN

IPC: G06T9/00

CPC classification number: G06T9/002

Abstract: A processor-implemented method for video compression using an artificial neural network (ANN) includes receiving a video via the ANN. The ANN extracts a first set of features of a current frame of the video and a second set of features of a reference frame of the video. The ANN determines an estimate of correlation features between the first set of features of the current frame and the second set of features of the reference frame. The estimate of the correlation features are encoded and transmitted to a receiver.

12.

发明公开
SCALING FOR DEPTH ESTIMATION 审中-公开

公开(公告)号：US20240177329A1

公开(公告)日：2024-05-30

申请号：US18481050

申请日：2023-10-04

Applicant: QUALCOMM Incorporated

Inventor： Hong CAI , Yinhao ZHU , Jisoo JEONG , Yunxiao SHI , Fatih Murat PORIKLI

IPC: G06T7/593 , G06T3/40 , G06T7/246 , G06T7/579

CPC classification number: G06T7/593 , G06T3/40 , G06T7/248 , G06T7/579 , G06T2207/10012 , G06T2207/20081 , G06T2207/20084

Abstract: Systems and techniques are provided for processing sensor data. For example, a process can include determining, using a trained machine learning system, a predicted depth map for an image, the predicted depth map including a respective predicted depth value for each pixel of the image. The process can further include obtaining depth values for the image, the depth values including depth values for less than all pixels of the image from a tracker configured to determine the depth values based on one or more feature points between frames. The process can further include scaling the predicted depth map for the image using and the depth values. The output of the process can be scale-correct depth prediction values.

13.

发明公开
NEURAL IMAGE COMPRESSION WITH CONTROLLABLE SPATIAL BIT ALLOCATION 审中-公开

公开(公告)号：US20230156207A1

公开(公告)日：2023-05-18

申请号：US17987844

申请日：2022-11-15

Applicant: QUALCOMM Incorporated

Inventor： Yang YANG , Hoang Cong Minh LE , Yinhao ZHU , Reza POURREZA , Amir SAID , Yizhe ZHANG , Taco Sebastiaan COHEN

IPC: H04N19/436 , H04N19/124 , H04N19/147 , H04N19/17 , H04N19/119

CPC classification number: H04N19/436 , H04N19/124 , H04N19/147 , H04N19/17 , H04N19/119

Abstract: A processor-implemented method for image compression using an artificial neural network (ANN) includes receiving, at an encoder of the ANN, an image and a spatial segmentation map corresponding to the image. The spatial segmentation map indicates one or more regions of interest. The encoder compresses the image according to a controllable spatial bit allocation. The controllable spatial bit allocation is based on a learned quantization bin size.

14.

发明申请
MULTI-SCALE OPTICAL FLOW FOR LEARNED VIDEO COMPRESSION 有权

公开(公告)号：US20220303568A1

公开(公告)日：2022-09-22

申请号：US17207244

申请日：2021-03-19

Applicant: QUALCOMM Incorporated

Inventor： Reza POURREZA , Amir SAID , Yang YANG , Yinhao ZHU , Taco Sebastiaan COHEN

IPC: H04N19/51 , H04N19/172 , H04N19/137 , H04N19/107 , H04N19/593 , G06N3/08

Abstract: Systems and techniques are described for encoding and/or decoding data based on motion estimation that applies variable-scale warping. An encoding device can receive an input frame and a reference frame that depict a scene at different times. The encoding device can generate an optical flow identifying movements in the scene between the two frames. The encoding device can generate a weight map identifying how finely or coarsely the reference frame can be warped for input frame prediction. The encoding device can generate encoded video data based on the optical flow and the weight map. A decoding device can generate a reconstructed optical flow and a reconstructed weight map from the encoded data. A decoding device can generate a prediction frame by warping the reference frame based on the reconstructed optical flow and the reconstructed weight map. The decoding device can generate a reconstructed input frame based on the prediction frame.

15.

发明申请
DATA COMPRESSION WITH A MULTI-SCALE AUTOENCODER 有权

公开(公告)号：US20220292725A1

公开(公告)日：2022-09-15

申请号：US17200694

申请日：2021-03-12

Applicant: QUALCOMM Incorporated

Inventor： Hoang Cong Minh LE , Reza POURREZA , Yang YANG , Yinhao ZHU , Amir SAID , Yizhe ZHANG , Taco Sebastiaan COHEN

IPC: G06T9/00 , G06T3/40 , G06N3/08

Abstract: A method of image compression includes receiving an image. Multiple quantized latent representations are generated to represent features of the image. Each of the quantized latent representations has a different resolution and is generated at staggered timings. Each of the later generated quantized latent representations is conditioned on each of the prior generated quantized latent representations. The multiple quantized latent representations are decoded to reconstruct the image.

16.

发明申请
PROGRESSIVE DATA COMPRESSION USING ARTIFICIAL NEURAL NETWORKS 有权

公开(公告)号：US20220237740A1

公开(公告)日：2022-07-28

申请号：US17648808

申请日：2022-01-24

Applicant: QUALCOMM Incorporated

Inventor： Yadong LU , Yang YANG , Yinhao ZHU , Amir SAID , Taco Sebastiaan COHEN

IPC: G06T3/40 , G06T9/00

Abstract: Certain aspects of the present disclosure provide techniques for compressing content using a neural network. An example method generally includes receiving content for compression. The content is encoded into a first latent code space through an encoder implemented by an artificial neural network trained to generate a latent space representation of the content. A first compressed version of the encoded content is generated using a first quantization bin size of a series of quantization bin sizes. A refined compressed version of the encoded content is generated by scaling the first compressed version of the encoded content into one or more second quantization bin sizes smaller than the first quantization bin size, conditioned at least on a value of the first compressed version of the encoded content. The refined compressed version of the encoded content is output for transmission.

Patent Agency Ranking