Abstract:
Methods and apparatus of processing 360-degree virtual reality images are disclosed. According to one method, a 2D (two-dimensional) frame is divided into multiple blocks. The multiple blocks are encoded or decoded using quantization parameters by restricting a delta quantization parameter to be within a threshold for any two blocks corresponding to two neighboring blocks on a 3D sphere. According to another embodiment, one or more guard bands are added to one or more edges that are discontinuous in the 2D frame but continuous in the 3D sphere. Fade-out process is applied to said one or more guard bands to generate one or more faded guard bands. At the decoder side, the reconstructed 2D frame is generated from the decoded extended 2D frame by cropping said one or more decoded faded guard bands or by blending said one or more decoded faded guard bands and reconstructed duplicated areas.
Abstract:
A method and apparatus of video encoding or decoding for a video encoding or decoding system applied to multi-face sequences corresponding to a 360-degree virtual reality sequence are disclosed. According to embodiments of the present invention, at least one face sequence of the multi-face sequences is encoded or decoded using face-independent coding, where the face-independent coding encodes or decodes a target face sequence using prediction reference data derived from previous coded data of the target face sequence only. Furthermore, one or more syntax elements can be signaled in a video bitstream at an encoder side or parsed from the video bitstream at a decoder side, where the syntax elements indicate first information associated with a total number of faces in the multi-face sequences, second information associated with a face index for each face-independent coded face sequence, or both the first information and the second information.
Abstract:
An exemplary method for processing an input bitstream having a plurality of video frames includes the following steps: deriving an indication data from decoding of a current video frame, and controlling a video decoder to decode or skip a next video frame by referring to at least the indication data and a video decoder capability of the video decoder. A signal processing apparatus for processing an input bitstream including a plurality of video frames includes a video decoder, an indication data estimating unit, and a controller. The video decoder is arranged to decode a current video frame. The indication data estimating unit is for deriving an indication data from decoding of the current video frame. The controller is for controlling the video decoder to decode or skip a next video frame by referring to at least the indication data and a video decoder capability of the video decoder.
Abstract:
A method and apparatus for video coding of a block of depth data or texture data using a simple Intra mode is disclosed. The method determines a prediction process selected from a prediction process list for the current block, where the prediction process list comprises at least a single sample mode and at least a simplified Intra prediction mode. If the prediction process selected for the current block corresponds to one single sample mode, encoding or decoding the current block using a single sample value derived from one or more previously decoded pixels for a whole current block. If the prediction process selected for the current block corresponds to one simplified Intra prediction mode, encoding or decoding the current block using Intra prediction signal derived according to a corresponding Intra prediction mode with no residual coding for the current block.
Abstract:
A method of three-dimensional video encoding and decoding that adaptively incorporates camera parameters in the video bitstream according to a control flag is disclosed. The control flag is derived based on a combination of individual control flags associated with multiple depth-oriented coding tools. Another control flag can be incorporated in the video bitstream to indicate whether there is a need for the camera parameters for the current layer. In another embodiment, a first flag and a second flag are used to adaptively control the presence and location of camera parameters for each layer or each view in the video bitstream. The first flag indicates whether camera parameters for each layer or view are present in the video bitstream. The second flag indicates camera parameter location for each layer or view in the video bitstream.
Abstract:
A method and apparatus for 3D coding to support fast bi-prediction having identical motion in the advanced residual prediction (ARP) are disclosed. Embodiment of the present invention use one or more aligned operations including data clipping and reference picture selection associated with motion vector derivation during residual predictor generation. When ARP mode is enabled, the residual prediction process for uni-prediction and bi-prediction perform same data clipping process and same reference picture selection process effectively. A single clipping operation or two clipping operations can be performed for both uni-prediction and bi-prediction.
Abstract:
A method and apparatus of three-dimensional/multi-view coding using a candidate list including a second inter-view candidate in the candidate list for Merge mode, Skip mode or AMVP based (Advanced Motion Vector Prediction based) Inter mode are disclosed. The second inter-view candidate can be derived based on already coded or decoded texture data for the candidate list to include. For example, the second inter-view candidate can be determined from the motion information associated with a corresponding block in a reference view, where the corresponding block is located according to the location of the right-bottom neighboring block and a selected disparity vector. The right-bottom neighboring block is located across from a right-bottom corner of the current texture block. The second inter-view candidate can be inserted into the candidate list only when the number of previous available candidates is smaller than a pre-specified number.
Abstract:
A method and apparatus for three-dimensional video coding, multi-view video coding and scalable video coding are disclosed. Embodiments of the present invention use two stage motion data compression to reduce motion data buffer requirement. A first-stage motion data compression is applied after each texture picture or depth map is coded to reduce motion data buffer requirement. Accordingly, first compressed motion data is stored in reduced resolution in the buffer to reduce storage requirement and the first compressed motion data is used for coding process of other texture pictures or depth maps in the same access unit. After all pictures in an access unit are coded, motion data associated with the access unit is further compressed and the second compressed motion data is used during coding process of pictures in other access unit.
Abstract:
A method and apparatus of priority-based MVP (motion vector predictor) derivation for motion compensation in a video encoder or decoder are disclosed. According to this method, one or more final motion vector predictors (MVPs) are derived using priority-based MVP derivation process. The one or more final MVPs are derived by selecting one or more firstly available MVs from a priority-based MVP list for Inter prediction mode, Skip mode or Merge mode based on reference data of one or two target reference pictures that are reconstructed prior to the current block according to a priority order. Therefore, there is no need for transmitting information at the encoder side nor deriving information at the decoder side that is related to one or more MVP indices to identify the one or more final MVPs in the video bitstream.
Abstract:
Methods and apparatus of processing cube face images are disclosed. According to embodiments of the present invention, one or more discontinuous boundaries within each assembled cubic frame are determined and used for selective filtering, where the filtering process is skipped at said one or more discontinuous boundaries within each assembled cubic frame when the filtering process is enabled. Furthermore, the filtering process is applied to one or more continuous areas in each assembled cubic frame.