Abstract:
A method for performing parallel coding with ordered entropy slices includes: providing a plurality of entropy slices within a picture to a plurality of processing elements, wherein each entropy slice comprises a plurality of blocks; and respectively starting each of the processing elements to perform a Context-based Adaptive Binary Arithmetic Coding (CABAC) processing of a coding procedure for a corresponding entropy slice; wherein CABAC states of the current entropy slice is initialized as the CABAC states of a previous entropy slice after processing DB blocks of the previous entropy slice, DB is a positive integer, and the CABAC processing for the current entropy slice and the CABAC processing for the previous entropy slices are performed in parallel during at least a portion of processing time.
Abstract:
Various schemes pertaining to video coding parallelization techniques are described. An apparatus receives video data. The apparatus subsequently calculates a plurality of figures of merits (FOMs), each of the FOM representing how well a particular coding tool may perform in encoding the video data. The apparatus further determines a coding tool that may be suitable for encoding the video data by comparing the FOMs. In determining the coding tool, the apparatus utilizes time-interleaving techniques to parallelly process the video data. The video data may include an array of coding blocks, and the apparatus may receive the video data using a snake-like processing order scanning through the array of coding blocks.
Abstract:
Exemplary video processing methods and apparatuses for encoding or decoding a current block by inter prediction are disclosed. Input data of a current block is received and partitioned into sub-partitions and motion refinement is independently performed on each sub-partition. A reference block for each sub-partition is obtained from one or more reference pictures according to an initial motion vector (MV). A refined MV for each sub-partition is derived by searching around the initial MV with N-pixel refinement. One or more boundary pixels of the reference block for a sub-partition is padded for motion compensation of the sub-partition. A final predictor for the current block is generated by performing motion compensation for each sub-partition according to its refined MV. The current block is then encoded or decoded according to the final predictor.
Abstract:
Various schemes pertaining to video coding parallelization techniques are described. An apparatus receives video data. The apparatus subsequently calculates a plurality of figures of merits (FOMs), each of the FOM representing how well a particular coding tool may perform in encoding the video data. The apparatus further determines a coding tool that may be suitable for encoding the video data by comparing the FOMs. In determining the coding tool, the apparatus utilizes time-interleaving techniques to parallelly process the video data. The video data may include an array of coding blocks, and the apparatus may receive the video data using a snake-like processing order scanning through the array of coding blocks.
Abstract:
A video coding system generating candidates for Merge Mode with Motion Vector Difference (MMVD) with reduced resource usage is provided. The system receives data to be encoded or decoded as a current block of a current picture of a video. The system identifies multiple MMVD candidates for different offset positions based on a merge candidate of the current block. The system generates reference samples for the identified MMVD candidates. The system reconstructs the current block or encodes the current block into a bitstream by using the generated reference samples. The system processes the MMVD candidates in separate groups: a first group of vertical MMVD candidates and a second group of horizontal MMVD candidates. The system generates the reference samples for the identified MMVD candidates by applying a vertical filter to source reference samples of horizontal MMVD candidates and then applying a horizontal filter to outputs of the vertical filter.
Abstract:
In one method, the current block is partitioned into multiple final sub-blocks using one or more stages of sub-tree partition comprising ternary tree partition and at least one other-type partition, where ternary partition tree is excluded from the sub-tree partition if a current sub-tree depth associated with a current sub-block is greater than a first threshold and the first threshold is an integer greater than or equal to 1. In another method, if a test condition is satisfied, the current block is encoded or decoded using a current Inter mode selected from a modified group of Inter tools, where the modified group of Inter tools is derived from an initial group of Inter tools by removing one or more first Inter tools from the initial group of Inter tools, replacing one or more second Inter tools with one or more complexity-reduced Inter tools, or both.
Abstract:
A method and apparatus for video coding incorporating enhanced multiple transform (EMT) are disclosed. According to one method, the number of candidates for EMT in the horizontal direction or vertical direction is reduced depending on the current block size. According to another method, an EMT flag, one or more EMT indices or both are signalled only if the total number of non-zero coefficients in at least a part of the 2D coefficient block is greater than a threshold, where the threshold is equal to 1 or larger. A method and apparatus for video coding using non-separable secondary transform (NSST) are disclosed. According to this method, a total number of non-zero first coefficients in a partial block of the 2D coefficient block is determined and used to determine whether to apply the NSST process.
Abstract:
A method and apparatus of video encoding video coding for a video encoder or decoder using Neural Network (NN) are disclosed. According to this method, the multiple frames in a video sequence comprises multiple segments, where each of the multiple segments comprises a set of frames. The NN (Neural Network) processing is applied to a target signal in one or more encoded frames of a target segment in the encoder side or to the target signal in one or more decoded frames of the target segment in the decoder side using one NN parameter set for the target segment. The target signal may correspond to reconstructed residual, reconstructed output, de-blocked output, SAO (sample adaptive offset) output, ALF (adaptive loop filter) output, or a combination thereof. In another embodiment, the NN processing is applied to a target signal only in one or more specific encoded or decoded frames.
Abstract:
Video processing methods and apparatuses include receiving input data associated with a current picture composed of multiple Coding Tree Units (CTUs) for encoding or decoding, partitioning the current picture into one or more slices including raster scan slices or rectangular slices, and for each slice in the current picture, encoding or decoding each slice by always processing CTUs within each slice in a raster scan order. Each slice includes an integer number of complete CTUs in the current picture that are exclusively contained in a single Network Access Layer (NAL) unit. Tile partition and tile concept may be removed to reduce the coding complexity of video processing.
Abstract:
A video system that partitions slices of video pictures into slice chunks is provided. A video decoder receives data from a bitstream to be decoded as a current picture of a video. When a first syntax element in a picture parameter set (PPS) of the current picture indicates that one or more slices of the current picture are allowed to be divided into multiple slice chunks and when a second syntax element in a slice header of a current slice of the current picture indicates that data of the current slice is encoded and delivered in two or more slice chunks, the video decoder parses and specifies sizes and positions of the slice chunks of the current slice and reconstructs the current picture based on the slice chunks of the current slice.