Abstract:
A method and apparatus of video coding incorporating Deep Neural Network are disclosed. A target signal is processed using DNN (Deep Neural Network), where the target signal provided to DNN input corresponds to the reconstructed residual, output from the prediction process, the reconstruction process, one or more filtering processes, or a combination of them. The output data from DNN output is provided for the encoding process or the decoding process. The DNN can be used to restore pixel values of the target signal or to predict a sign of one or more residual pixels between the target signal and an original signal. An absolute value of one or more residual pixels can be signalled in the video bitstream and used with the sign to reduce residual error of the target signal.
Abstract:
Method and apparatus of video coding using decoder derived motion information based on bilateral matching or template matching are disclosed. According to one method, an initial motion vector (MV) index is signalled in a video bitstream at an encoder side or determined from the video bitstream at a decoder side. A selected MV is then derived using bilateral matching, template matching or both to refine an initial MV associated with the initial MV index. In another method, when both MVs for list 0 and list 1 exist in template matching, the smallest-cost MV between the two MVs may be used for uni-prediction template matching if the cost is lower than the bi-prediction template matching. According to yet another method, the refinement of the MV search is dependent on the block size. According to yet another method, merge candidate MV pair is always used for bilateral matching or template matching.
Abstract:
Methods of palette coding to reduce the required coding process are disclosed. According to one method, smaller blocks are derived from a large block. The histogram of the large block is derived based on the histograms of smaller blocks in the large block. According to another method, one or more palette tables are derived based on multiple blocks. One palette table is used for each of the multiple blocks. According to yet another method, index map transpose is performed in the parsing stage according to the transpose flag of the index map. Accordingly, a buffer to store the transpose flags can be saved. According to still yet another method, the palette predictor update is performed using an index mapping table to avoid the need for shuffling the contents of the palette predictor stored in a palette buffer.
Abstract:
A method and apparatus for color index coding of a block of video data using index prediction based on reconstructed neighboring pixels are disclosed. In one embodiment, color index encoding or decoding is applied to the current pixel indices by deriving index prediction from neighboring pixel indices of the reconstructed neighboring pixels. The reconstructed neighboring pixel values are first transformed into the neighboring pixel indices according to a quantization table to map between major color indices and major color values of the current block. The quantization table can be based on a major color table for the current block or can also be derived at a decoder side. The decoding process may reconstruct the current pixels by converting decoded current pixel indices to reconstructed current pixel values according to the quantization table or by directly copying the reconstructed neighboring pixel values indicated by decoded current pixel indices.
Abstract:
A method and apparatus for coding a block of video data using index or pixel value prediction including a copy-by-pattern-search mode are disclosed. According to the present invention, a current search pattern is determined based on one or more previous coded pixels. One or more predictors are derived according to the current search pattern for a current index or pixel value of a current pixel in the current block. Encoding or decoding is then applied to one or more following indices or pixel values of one or more following pixels including the current pixel using the predictors. In a simplest case, the one or more previous coded pixels correspond to a single previous coded pixel at the left side of the current pixel and the one or more following pixels contain only the current pixel.
Abstract:
Methods and apparatus for a coding system using palette mode coding for colour video are disclosed. According to method, when the palette mode is enabled, separate coding of luma and chroma components is disabled or if separate coding of luma and chroma components, the palette mode is disabled. According to another method, when the palette mode is enabled for the current image, palette mode coding is only applied to the luma blocks and non-palette mode coding is applied to the chroma blocks. According to yet another method, palette mode coding is applied to the luma blocks using luma palettes if the palette mode is selected for the luma blocks and palette mode coding applying to the chroma blocks using chroma palettes if the palette mode is selected for the chroma blocks. According to yet another method, the scan direction for a non-square CU depends on the shape of the non-square.
Abstract:
A method and apparatus for a video coding system with the current picture referencing (CPR) mode enabled are disclosed. According to one method, the luma and chroma blocks of the current image are jointly coded using a same coding unit (CU) structure if the CPR mode is selected for the luma and the chroma blocks. Alternatively, if the luma and chroma components are partitioned into the luma and the chroma blocks separately using separate CU structures, the luma and chroma blocks are encoded or decoded using a coding mode selected from a coding mode group excluding the CPR mode. According to another method, the luma and chroma blocks of the current image are coded separately using a different CU structure if the CPR mode is selected for the luma and chroma blocks. In yet another method, reconstructed reference data is disclosed for the CPR mode with CU equal to PU.
Abstract:
A method for color index coding of blocks in a picture is disclosed. In one embodiment for the decoder side, a current coding mode is parsed from the input coded bitstream for one or more first currently decoded pixels in a current block starting from a first starting position. If the current coding mode corresponds to a generalized copy-previous-row mode, one or more first reference pixels starting from a first starting reference position located at T-th row above the first currently decoded pixels are identified, where T is an integer equal to or greater than one. The first reference pixels starting from the first starting reference position are then used to decode the first currently decoded pixels starting from a first starting position by copying the first reference pixels starting from the first starting reference position respectively. The system may also include a copy-left mode or escape mode.
Abstract:
A method and apparatus of video coding incorporating Deep Neural Network are disclosed. A target signal is processed using DNN (Deep Neural Network), where the target signal provided to DNN input corresponds to the reconstructed residual, output from the prediction process, the reconstruction process, one or more filtering processes, or a combination of them. The output data from DNN output is provided for the encoding process or the decoding process. The DNN can be used to restore pixel values of the target signal or to predict a sign of one or more residual pixels between the target signal and an original signal. An absolute value of one or more residual pixels can be signalled in the video bitstream and used with the sign to reduce residual error of the target signal.
Abstract:
In order to overcome the issue caused by a decoded block vector (BV) pointing to a reference block overlapping with an unavailable area, various methods are disclosed. According to one method, if the reference block overlaps with an unavailable area, the reference pixels in the unavailable area are generated for IntraBC prediction of the current block by padding from neighbouring available pixels. The padding can be done in the horizontal direction and then the vertical direction. The padding may also done in the vertical direction first and then horizontal direction. In another method, if the reference block overlaps with an unavailable area, the reference pixels in the unavailable area are generated for IntraBC prediction of the current block by using previous decoded pixels in the unavailable area. A pre-defined value may also be used for the unavailable area.