Patent search ap:("QUALCOMM Incorporated") AND inv:"Reza POURREZA" Page 1

1.

发明公开
LEARNED B-FRAME CODING USING P-FRAME CODING SYSTEM 审中-公开

公开(公告)号：US20240022761A1

公开(公告)日：2024-01-18

申请号：US18343618

申请日：2023-06-28

Applicant: QUALCOMM Incorporated

Inventor： Reza POURREZA , Taco Sebastiaan COHEN

IPC: H04N19/59 , G06N3/063 , G06N3/088 , G06N3/045

CPC classification number: H04N19/59 , G06N3/063 , G06N3/088 , G06N3/045

Abstract: Techniques are described for processing video data, such as by performing learned bidirectional coding using a unidirectional coding system and an interpolated reference frame. For example, a process can include obtaining a first reference frame and a second reference frame. The process can include generating a third reference frame at least in part by performing interpolation between the first reference frame and the second reference frame. The process can include performing unidirectional inter-prediction on an input frame based on the third reference frame, such as by estimating motion between an input frame and the third reference frame, and generating a warped frame at least in part by warping one or more pixels of the third reference frame based on the estimated motion. The process can include generating, based on the warped frame and a predicted residual, a reconstructed frame representing the input frame, the reconstructed frame including a bidirectionally-predicted frame.

2.

发明申请
TEACHING LANGUAGE MODELS TO DRAW SKETCHES 有权

公开(公告)号：US20240394936A1

公开(公告)日：2024-11-28

申请号：US18466747

申请日：2023-09-13

Applicant: QUALCOMM Incorporated

Inventor： Reza POURREZA , Roland MEMISEVIC , Apratim BHATTACHARYYA , Sunny Praful Kumar PANCHAL , Mingu LEE , Pulkit MADAN

IPC: G06T11/20 , G06N3/0464 , G06N3/084 , G06T11/60

Abstract: A processor-implemented method for image generation using an artificial neural network (ANN) includes receiving an input including one or more of an image or a text prompt. The ANN processes the input to determine one or more virtual brush strokes to generate an output image or one or more commands for controlling an image drawing application to generate the output image. A list of the one or more virtual brush strokes to generate the output image or the one or more commands for controlling the image drawing application to generate the output image. The one or more virtual brush strokes or commands may be executed to generate a sketch based on the input.

3.

发明申请
VARIABLE BIT RATE COMPRESSION USING NEURAL NETWORK MODELS 有权

公开(公告)号：US20220224926A1

公开(公告)日：2022-07-14

申请号：US17573568

申请日：2022-01-11

Applicant: QUALCOMM Incorporated

Inventor： Yadong LU , Yang YANG , Yinhao ZHU , Amir SAID , Reza POURREZA , Taco Sebastiaan COHEN

IPC: H04N19/42 , H04N19/30 , H04N19/13 , H04N19/136 , H04N19/124

Abstract: A computer-implemented method for operating an artificial neural network (ANN) includes receiving an input by the ANN. The ANN generates a latent representation of the input. The latent representation is communicated according to a bit rate based on a learned latent scaling parameter. The latent scaling parameter is learned based on a channel index and a tradeoff parameter value that corresponds to a value that balances the bit rate and a distortion.

4.

发明公开
LOW-LATENCY MACHINE LEARNING-BASED STEREO STREAMING 审中-公开

公开(公告)号：US20240364925A1

公开(公告)日：2024-10-31

申请号：US18636126

申请日：2024-04-15

Applicant: QUALCOMM Incorporated

Inventor： Hoang Cong Minh LE , Qiqi HOU , Farzad FARHADZADEH , Amir SAID , Auke Joris WIGGERS , Guillaume Konrad SAUTIERE , Reza POURREZA

IPC: H04N19/597 , H04N19/137 , H04N19/436

CPC classification number: H04N19/597 , H04N19/137 , H04N19/436

Abstract: Systems and techniques are described herein for processing video data. For example, a machine-learning based stereo video coding system can obtain video data including at least a right-view image of a right view of a scene and a left-view image of a left view of the scene. The machine-learning based stereo video coding system can compress the right-view image and the left-view image in parallel to generate a latent representation of the right-view image and the left-view image. The right-view image and the left-view image can be compressed in parallel based on inter-view information between the right-view image and the left-view image, determined using one or more parallel autoencoders.

5.

发明公开
VIDEO CODING USING CAMERA MOTION COMPENSATION AND OBJECT MOTION COMPENSATION 审中-公开

公开(公告)号：US20240013441A1

公开(公告)日：2024-01-11

申请号：US17862149

申请日：2022-07-11

Applicant: QUALCOMM Incorporated

Inventor： Hoang Cong Minh LE , Reza POURREZA , Amir SAID

IPC: G06T9/00 , G06T3/40 , G06T7/246 , G06T7/50 , G06T3/00

CPC classification number: G06T9/00 , G06T3/40 , G06T7/248 , G06T7/50 , G06T3/0093 , G06T2207/20224 , G06T2207/20084

Abstract: Systems and techniques are provided for coding (e.g., encoding and/or decoding) video data using camera motion information. For example, a decoding device can obtain a frame of encoded video data associated with an input frame, the frame of encoded video data including camera information associated with generating the video data and a residual. A camera motion compensated frame can be generated based on a reference frame and the camera information. Optical flow information associated with object motion determined based on at least the input frame and the reference frame can be generated. A motion compensated frame can be generated by warping the camera motion compensated frame based on the optical flow information. A reconstructed input frame can be generated based on the motion compensated frame and the residual.

6.

发明申请
VIDEO COMPRESSION USING RECURRENT-BASED MACHINE LEARNING SYSTEMS 有权

公开(公告)号：US20210281867A1

公开(公告)日：2021-09-09

申请号：US17091570

申请日：2020-11-06

Applicant: QUALCOMM Incorporated

Inventor： Adam Waldemar GOLINSKI , Yang YANG , Reza POURREZA , Guillaume Konrad SAUTIERE , Ties Jehan VAN ROZENDAAL , Taco Sebastiaan COHEN

IPC: H04N19/42 , H04N19/137 , H04N19/172 , H04N19/85 , G06N3/08

Abstract: Techniques are described herein for coding video content using recurrent-based machine learning tools. A device can include a neural network system including encoder and decoder portions. The encoder portion can generate output data for the current time step of operation of the neural network system based on an input video frame for a current time step of operation of the neural network system, reconstructed motion estimation data from a previous time step of operation, reconstructed residual data from the previous time step of operation, and recurrent state data from at least one recurrent layer of a decoder portion of the neural network system from the previous time step of operation. A decoder portion of the neural network system can generate, based on the output data and recurrent state data from the previous time step of operation, a reconstructed video frame for the current time step of operation.

7.

发明公开
NEURAL IMAGE COMPRESSION WITH CONTROLLABLE SPATIAL BIT ALLOCATION 审中-公开

公开(公告)号：US20230156207A1

公开(公告)日：2023-05-18

申请号：US17987844

申请日：2022-11-15

Applicant: QUALCOMM Incorporated

Inventor： Yang YANG , Hoang Cong Minh LE , Yinhao ZHU , Reza POURREZA , Amir SAID , Yizhe ZHANG , Taco Sebastiaan COHEN

IPC: H04N19/436 , H04N19/124 , H04N19/147 , H04N19/17 , H04N19/119

CPC classification number: H04N19/436 , H04N19/124 , H04N19/147 , H04N19/17 , H04N19/119

Abstract: A processor-implemented method for image compression using an artificial neural network (ANN) includes receiving, at an encoder of the ANN, an image and a spatial segmentation map corresponding to the image. The spatial segmentation map indicates one or more regions of interest. The encoder compresses the image according to a controllable spatial bit allocation. The controllable spatial bit allocation is based on a learned quantization bin size.

8.

发明申请
MULTI-SCALE OPTICAL FLOW FOR LEARNED VIDEO COMPRESSION 有权

公开(公告)号：US20220303568A1

公开(公告)日：2022-09-22

申请号：US17207244

申请日：2021-03-19

Applicant: QUALCOMM Incorporated

Inventor： Reza POURREZA , Amir SAID , Yang YANG , Yinhao ZHU , Taco Sebastiaan COHEN

IPC: H04N19/51 , H04N19/172 , H04N19/137 , H04N19/107 , H04N19/593 , G06N3/08

Abstract: Systems and techniques are described for encoding and/or decoding data based on motion estimation that applies variable-scale warping. An encoding device can receive an input frame and a reference frame that depict a scene at different times. The encoding device can generate an optical flow identifying movements in the scene between the two frames. The encoding device can generate a weight map identifying how finely or coarsely the reference frame can be warped for input frame prediction. The encoding device can generate encoded video data based on the optical flow and the weight map. A decoding device can generate a reconstructed optical flow and a reconstructed weight map from the encoded data. A decoding device can generate a prediction frame by warping the reference frame based on the reconstructed optical flow and the reconstructed weight map. The decoding device can generate a reconstructed input frame based on the prediction frame.

9.

发明申请
DATA COMPRESSION WITH A MULTI-SCALE AUTOENCODER 有权

公开(公告)号：US20220292725A1

公开(公告)日：2022-09-15

申请号：US17200694

申请日：2021-03-12

Applicant: QUALCOMM Incorporated

Inventor： Hoang Cong Minh LE , Reza POURREZA , Yang YANG , Yinhao ZHU , Amir SAID , Yizhe ZHANG , Taco Sebastiaan COHEN

IPC: G06T9/00 , G06T3/40 , G06N3/08

Abstract: A method of image compression includes receiving an image. Multiple quantized latent representations are generated to represent features of the image. Each of the quantized latent representations has a different resolution and is generated at staggered timings. Each of the later generated quantized latent representations is conditioned on each of the prior generated quantized latent representations. The multiple quantized latent representations are decoded to reconstruct the image.

10.

发明申请
USING GROUNDED RATIONALES TO IMPROVE VISUAL REASONING 有权

公开(公告)号：US20240386712A1

公开(公告)日：2024-11-21

申请号：US18500986

申请日：2023-11-02

Applicant: QUALCOMM Incorporated

Inventor： Apratim BHATTACHARYYA , Roland MEMISEVIC , Sunny Praful Kumar PANCHAL , Reza POURREZA , Mingu LEE , Pulkit MADAN

IPC: G06V10/82 , G06F40/10 , G06F40/284

Abstract: A processor-implemented method for generating grounded rationales for visual reasoning tasks includes receiving, by a first artificial neural network (ANN), an interleaved sequence of images and textual information. The first ANN extracts grid features of the images of the interleaved sequence of the images and the textual information to generate a representation of the interleaved sequence of the images and the textual information based on the grid features. A second ANN maps the grid features to a textual domain. The second ANN extracts visual information of the interleaved sequence of the images and the textual information based on the grid features in the textual domain. The second ANN determines a rationale based on the visual information. The visual information comprises one or more lower-level surrogate tasks.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification