Abstract:
A video coder receives data from a bitstream for a block of pixels to be encoded or decoded as a current block of a current picture of a video. Upon determining that an applied block setting of the current block satisfies a threshold condition, the video coder generates a first prediction based on a first motion information for a first prediction unit of the current block. The video coder generates a second prediction based on a second motion information for a second prediction unit of the current block. The video coder generates a third prediction based on the first and second motion information for an overlap prediction region that is defined based on a partitioning between the first prediction unit and the second prediction unit. The video coder encodes or decodes the current block by using the first, second, and third predictions.
Abstract:
A video coder that uses a shared candidate list to encode or decode multiple blocks of pixels within a shared boundary is provided. The video coder identifies a shared boundary encompassing a plurality of blocks of pixels of a current picture in a video sequence. The video coder identifies one or more prediction candidates as a shared candidate list based on neighbors of a region defined by the shared boundary. The video coder codes one or more blocks of pixels encompassed by the shared boundary by using one or more prediction candidates selected from the shared candidate list.
Abstract:
Aspects of the disclosure provide a method for denoising a reconstructed picture in a video coding system. The method can include providing two or more candidate non-local denoising technologies, selecting a target non-local denoising technology from the two or more candidate non-local denoising technologies for a patch group, and denoising the patch group of the reconstructed picture with the target non-local denoising technology. Besides, two parallel processing methods are provided for forming patch groups with predictor-based searching algorithms.
Abstract:
A video coder implementing affine prediction is provided. The video coder receives input data associated with a current block to be coded. The video coder derives a control point motion vector (CPMV) of the current block according to an affine model. The video coder derives a set of motion compensation motion vectors (MCMVs) for a set of sub-blocks of the current block based on the derived CPMV of the current block according to the affine model. The video coder performs motion compensation for the current block based on the set of MCMVs. The CPMV of the current block is derived based on a CPMV of a first reference block that is a neighboring block located in a same region of the current block or an MCMV of a second reference block that is not a neighboring block located in the same region of the current block.
Abstract:
A combined prediction mode for encoding or decoding a pixel block of a video picture is provided. When it is determined that the combined prediction mode is used, a video codec generates an intra predictor for the current block based on a selected intra-prediction mode and a merge-indexed predictor for the current block based on a selected merge candidate from a merge candidates list. The video codec then generates a final predictor for the current block based on the intra predictor and the merge-indexed predictor. The final predictor is then used to encode or decode the current block.
Abstract:
A method and apparatus related to secondary transform operations. In which, the non-zero transform coefficients in a particular subset of the transform coefficients of the block of pixels is counted and compared with a particular threshold so as to determine whether or not to perform the secondary transform on the block of pixels. Besides, the optimal secondary transform kernel size is derived implicitly based on the size of the block, or derived explicitly based on a flag coded by using an adaptive code word.
Abstract:
A multi-hypotheses motion prediction mode for video coding is provided. Multi-hypotheses motion prediction conveys prediction for motion compensation based on a selection of multiple predictions for motion compensation (hypotheses), which are respectively obtained using motion predictors or MVP selected from a list of candidate motion predictors. When coding a block of pixels, a video coder implementing multi-hypotheses motion prediction selects a first motion predictor and a second motion predictor from a list of candidate motion predictors for the block of pixels. The video coder encodes or decodes a motion prediction code word that identifies the first and second motion predictors. The video coder computes a combined prediction for motion compensation based on first and second sets of pixels that are obtained using the selected first and second motion predictors, respectively. The video coder encodes or decodes the block of pixels by using the combined prediction for motion compensation.
Abstract:
The techniques described herein relate to methods, apparatus, and computer readable media configured to receive compressed video data, wherein the compressed video data is related to a set of frames. A decoder-side predictor refinement technique is used to calculate a new motion vector for a current frame from the set of frames, wherein the new motion vector estimates motion for the current frame based on one or more reference frames. An existing motion vector associated with a different frame from a motion vector buffer is retrieved. The new motion vector is calculated based on the existing motion vector using a decoder-side motion vector prediction technique, such that the existing motion vector is in the motion vector buffer after calculating the new motion vector.
Abstract:
A method for cross-color Intra prediction using the LM Intra mode using multi-row or multi-column neighboring reconstructed pixels for LM parameter derivation or using only top pixels or left pixels of neighboring pixels is disclosed. Multiple LM Intra modes can be used. For example, three LM Intra modes can be used and the LM parameters for the three LM Intra modes can be determined based on only the top pixels, only the left pixels and both the top pixels and left pixels of neighboring reconstructed pixels respectively. To remove the need for additional buffer requirement for deriving the LM parameters based on using multi-row or multi-column neighboring reconstructed pixels, the current method re-uses existing buffers, where the buffers are used for deblocking. A syntax element can be used to indicate one of the multi-LM modes selected.
Abstract:
A method and apparatus for processing 2N×2N transform units (TUs) are disclosed. In one embodiment, the method comprises determining a first-layer scanning order among four N×N sub-blocks of the 2N×2N TU; determining a second-layer scanning pattern for said four N×N sub-blocks; and providing scanned 2N×2N transform coefficients of the intra-coded or the inter-coded 2N×2N TU using double scanning based on the first-layer scanning order and the second-layer scanning pattern. In another embodiment, said determining the first-layer scanning order is dependent on the second-layer scanning pattern. The second-layer scanning pattern can be diagonal, horizontal or vertical. In an embodiment, the first-layer scanning order can be from an upper-left sub-block, to an upper-right sub-block, to a lower-left sub-block and to a lower-right sub-block for the second-layer horizontal scanning pattern and from an upper-left sub-block, to a lower-left sub-block, to an upper-right sub-block and to a lower-right sub-block for other second-layer scanning patterns.