-
公开(公告)号:US12294705B2
公开(公告)日:2025-05-06
申请号:US17779692
申请日:2019-12-23
Applicant: Google LLC
Inventor: Debargha Mukherjee , Lester Lu , Elliott Karpilovsky
IPC: H04N19/126 , H04N19/119 , H04N19/12 , H04N19/176 , H04N19/42 , H04N19/463 , H04N19/60
Abstract: Residual coding using vector quantization (VQ) is described. A flag indicating whether a residual block for the current block is encoded using VQ. In response to the flag indicating that the residual block is encoded using VQ, a parameter indicating an entry in a codebook is decoded, and the residual block is decoded using the entry. In response to the flag indicating that the residual block is not encoded using VQ, the residual block is decoded based on a skip flag indicating whether the current block is encoded using transform skip. The current block is reconstructed using the residual block.
-
公开(公告)号:US20240388690A1
公开(公告)日:2024-11-21
申请号:US18576733
申请日:2021-07-15
Applicant: Google LLC
Inventor: Yue Chen , Yu Wang , Hui Su , Debargha Mukherjee , Yunqing Wang
IPC: H04N19/105 , H04N19/139 , H04N19/176 , H04N19/52
Abstract: Video coding using warped motion compensation is described. Extended rotations for the warped motion compensation can be explicitly signaled. For example, motion parameters for predicting the current block and a rotation angle can be decoded. A warping matrix is obtained using the motion parameters and the rotation angle, and a prediction block is obtained by projecting the current block to a quadrilateral in a reference frame. Also described is determining a prediction model of the current block and obtaining a prediction block by projecting the current block to a quadrilateral in a reference frame. Determining the prediction model can include determining whether to predict the current block using a motion vector, a local warping model, or a global motion model, obtaining motion parameters of the prediction model, decoding a rotation angle, and obtaining a warping matrix using the motion parameters and the rotation angle.
-
公开(公告)号:US20240314345A1
公开(公告)日:2024-09-19
申请号:US18577831
申请日:2021-07-15
Applicant: GOOGLE LLC
Inventor: Hui Su , Debargha Mukherjee
IPC: H04N19/513 , H04N19/105 , H04N19/119 , H04N19/139 , H04N19/159 , H04N19/176
CPC classification number: H04N19/513 , H04N19/105 , H04N19/119 , H04N19/139 , H04N19/159 , H04N19/176
Abstract: A method for inter-prediction includes coding a first block of a current frame using a first motion vector (MV) and a reference frame type; storing, in at least one MV buffer, the first MV and the reference frame type; identifying MV candidates for coding a current block using the reference frame type; responsive to a determination that a cardinality of the MV candidates is less than a maximum number of MV candidates identifying the first motion vector in the at least one MV buffer, and responsive to a determination that the first MV is not included in the MV candidates, adding the first MV as an MV candidate; and selecting one of the MV candidates for coding the current block.
-
公开(公告)号:US12075089B2
公开(公告)日:2024-08-27
申请号:US17793618
申请日:2020-05-14
Applicant: GOOGLE LLC
Inventor: James Bankoski , Debargha Mukherjee
IPC: H04N19/593 , H04N19/105 , H04N19/11 , H04N19/159 , H04N19/167 , H04N19/176 , H04N19/182 , H04N19/196 , H04N19/59
CPC classification number: H04N19/593 , H04N19/105 , H04N19/11 , H04N19/159 , H04N19/167 , H04N19/176 , H04N19/182 , H04N19/196 , H04N19/59
Abstract: A method for coding a current block using an intra-prediction mode includes obtaining a focal point, the focal point having coordinates (a, b) in a coordinate system; and generating, using first peripheral pixels and second peripheral pixels, a prediction block for the current block, where the first peripheral pixels form a first peripheral pixel line constituting an x-axis, and where the second peripheral pixels form a second peripheral pixel line constituting a y-axis. Generating the prediction block includes, for each location of the prediction block at a location (i, j) of the prediction block, determining at least one of an x-intercept or a y-intercept; and determining a prediction pixel value for the each location of the prediction block using the at least one of the x-intercept or the y-intercept.
-
公开(公告)号:US11870993B2
公开(公告)日:2024-01-09
申请号:US17359767
申请日:2021-06-28
Applicant: GOOGLE LLC
Inventor: Urvang Joshi , Debargha Mukherjee
IPC: H04N19/122 , H04N19/176 , H04N19/82 , H04N19/18 , H04N19/159 , H04N19/124
CPC classification number: H04N19/122 , H04N19/124 , H04N19/159 , H04N19/176 , H04N19/18 , H04N19/82
Abstract: Improved transforms are used to encode and decode large video and image blocks. During encoding, a prediction residual block having a large size (e.g., larger than 32×32) is generated. The pixel values of the prediction residual block are transformed to produce transform coefficients. After determining that the transform coefficients exceed a threshold cardinality representative of a maximum transform block size (e.g., 32×32), a number of the transform coefficients are discarded such that a remaining number of transform coefficients does not exceed the threshold cardinality. A transform block is then generated using the remaining number. During decoding, after determining that the transform coefficients exceed the threshold cardinality, a number of new coefficients are added to the transform coefficients such that a total number of transform coefficients exceeds the threshold cardinality. The transform coefficients are then inverse transformed into a prediction residual block having a large size.
-
公开(公告)号:US20230058845A1
公开(公告)日:2023-02-23
申请号:US17793618
申请日:2020-05-14
Applicant: GOOGLE LLC
Inventor: James Bankoski , Debargha Mukherjee
IPC: H04N19/593 , H04N19/11 , H04N19/176 , H04N19/182
Abstract: A method for coding a current block using an intra-prediction mode includes obtaining a focal point, the focal point having coordinates (a, b) in a coordinate system; and generating, using first peripheral pixels and second peripheral pixels, a prediction block for the current block, where the first peripheral pixels form a first peripheral pixel line constituting an x-axis, and where the second peripheral pixels form a second peripheral pixel line constituting a y-axis. Generating the prediction block includes, for each location of the prediction block at a location (i, j) of the prediction block, determining at least one of an x-intercept or a y-intercept; and determining a prediction pixel value for the each location of the prediction block using the at least one of the x-intercept or the y-intercept.
-
公开(公告)号:US11558631B2
公开(公告)日:2023-01-17
申请号:US16836078
申请日:2020-03-31
Applicant: GOOGLE LLC
Inventor: Urvang Joshi , Debargha Mukherjee , Andrew Simpson
IPC: H04N19/117 , H04N19/176 , H04N19/50 , H04N7/01 , H04N19/184 , H04N19/61
Abstract: A super-resolution coding mode is described. Encoded image can be decoded by decoding, from an encoded bitstream, a flag indicating whether an image was encoded using the super-resolution mode. The image is encoded at a first resolution. Responsive to the flag indicating that the image was encoded using the super-resolution mode, bits indicating an amount of scaling of the image are decoded. The image is decoded from the encoded bitstream to obtain a reconstructed image at the first resolution, and the reconstructed image is upscaled to a second resolution using the amount of scaling to obtain an upscaled reconstructed image. The second resolution is higher than the first resolution. Loop restoration filtering is applied to the upscaled reconstructed image using loop restoration parameters to obtain a loop restored image at the second resolution.
-
公开(公告)号:US20220094950A1
公开(公告)日:2022-03-24
申请号:US17543126
申请日:2021-12-06
Applicant: Google LLC
Inventor: Rohit Singh , Debargha Mukherjee , Elliott Karpilovsky , Lester Lu
IPC: H04N19/184 , H04N19/124 , H04N19/61
Abstract: Transform modes are derived for inter-predicted blocks using side information. A prediction residual is generated for a current video block using a reference frame. Side information associated with one or both of the current video block or the reference frame is identified. A trained transform is determined from amongst multiple trained transforms based on the side information, in which each of the trained transforms is determined using individual side information types and combinations of the individual side information types and the side information represents values of one of the individual side information types or one of the combinations of the individual side information types. The prediction residual is transformed according to the trained transform, and data associated with the transformed prediction residual and the side information are encoded to a bitstream.
-
公开(公告)号:US11284071B2
公开(公告)日:2022-03-22
申请号:US16712057
申请日:2019-12-12
Applicant: GOOGLE LLC
Inventor: Lester Lu , Debargha Mukherjee , Elliott Karpilovsky
IPC: H04N19/107 , H04N19/122 , H04N19/176 , H04N19/50
Abstract: Coding a block of video data includes determining a prediction mode for the block, which is an inter-prediction or intra-prediction mode, determining a transform type for the block, and coding the block using the prediction mode and the transform type. The transform type is one of a first plurality of transform types when the prediction mode is the inter-prediction mode, and is one of a second plurality of transform types when the prediction mode is the intra-prediction mode. The first plurality of transform types includes first fixed transform types and first mode-dependent transform types that are based on a first learned transform generated using inter-predicted blocks. The second plurality of transform types includes second fixed transform types and second mode-dependent transform types that are based on a second learned transform generated using intra-predicted blocks. The first and second fixed transform types have at least some fixed transform types in common.
-
公开(公告)号:US10771783B2
公开(公告)日:2020-09-08
申请号:US16004929
申请日:2018-06-11
Applicant: GOOGLE LLC
Inventor: Urvang Joshi , Debargha Mukherjee
IPC: H04N19/122 , H04N19/176 , H04N19/82 , H04N19/18 , H04N19/159 , H04N19/124
Abstract: Improved transforms are used to encode and decode large video and image blocks. During encoding, a prediction residual block having a large size (e.g., larger than 32×32) is generated. The pixel values of the prediction residual block are transformed to produce transform coefficients. After determining that the transform coefficients exceed a threshold cardinality representative of a maximum transform block size (e.g., 32×32), a number of the transform coefficients are discarded such that a remaining number of transform coefficients does not exceed the threshold cardinality. A transform block is then generated using the remaining number. During decoding, after determining that the transform coefficients exceed the threshold cardinality, a number of new coefficients are added to the transform coefficients such that a total number of transform coefficients exceeds the threshold cardinality. The transform coefficients are then inverse transformed into a prediction residual block having a large size.
-
-
-
-
-
-
-
-
-