Abstract:
Techniques are described for determining a block in a reference picture in a reference view based on a disparity vector for a current block. The techniques start the disparity vector from a bottom-right pixel in a center 2×2 sub-block within the current block, and determine a location within the reference picture to which the disparity vector refers. The determined block covers the location referred to by the disparity vector based on the disparity vector starting from the bottom-right pixel in the center 2×2 sub-block within the current block.
Abstract:
Techniques and systems are provided for encoding video data. For example, a method of encoding video data includes obtaining a background picture that is generated based on a plurality of pictures captured by an image sensor. The background picture is generated to include background portions identified in each of the captured pictures. The method further includes encoding, into a video bitstream, a group of pictures captured by the image sensor. The group of pictures includes at least one random access picture. Encoding the group of pictures includes encoding at least a portion of the at least one random access picture using inter-prediction based on the background picture.
Abstract:
This disclosure includes techniques for signaling characteristics of a representation of multimedia content at a representation layer, such as frame packing arrangement information for the representation. In one example, an apparatus for receiving video data includes a processing unit configured to receive information indicative of whether a bitstream includes a packed frame of video data, wherein the packed frame comprises two frames corresponding to different views of a scene for three-dimensional playback, and wherein the information is present in a representation layer external to a codec layer of the bitstream, automatically determine whether the apparatus is capable of decoding and rendering the bitstream based on an analysis of the received information and decoding and rendering capabilities of the device, and retrieve the bitstream when the processing unit determines that the device is capable of decoding and rendering the bitstream.
Abstract:
Aspects of this disclosure relate to, in an example, a method that includes identifying a first block of video data in a first temporal location from a first view, wherein the first block is associated with a first disparity motion vector. The method also includes determining a motion vector predictor for a second motion vector associated with a second block of video data, wherein the motion vector predictor is based on the first disparity motion vector. When the second motion vector comprises a disparity motion vector, the method includes determining the motion vector predictor comprises scaling the first disparity motion vector to generate a scaled motion vector predictor, wherein scaling the first disparity motion vector comprises applying a scaling factor comprising a view distance of the second disparity motion vector divided by a view distance of the first motion vector to the first disparity motion vector.
Abstract:
A video coder generates a list of merging candidates for coding a video block of the 3D video. A maximum number of merging candidates in the list of merging candidates may be equal to 6. As part of generating the list of merging candidates, the video coder determines whether a number of merging candidates in the list of merging candidates is less than 5. If so, the video coder derives one or more combined bi-predictive merging candidates. The video coder includes the one or more combined bi-predictive merging candidates in the list of merging candidates.
Abstract:
In one example, a video coder (e.g., a video encoder or a video decoder) is configured to determine that a current block of video data is coded using a disparity motion vector, wherein the current block is within a containing block, based on a determination that a neighboring block to the current block is also within the containing block, substitute a block outside the containing block and that neighbors the containing block for the neighboring block in a candidate list, select a disparity motion vector predictor from one of a plurality of blocks in the candidate list, and code the disparity motion vector based on the disparity motion vector predictor. In this manner, the techniques of this disclosure may allow blocks within the containing block to be coded in parallel.
Abstract:
Techniques and systems are provided for coding video data. For example, a method of coding video data includes determining motion information for a current block and determining an illumination compensation status for the current block. The method further includes coding the current block based on the motion information and the illumination compensation status for the current block. In some examples, the method further includes determining the motion information for the current block based on motion information of a candidate block. In such examples, the method further includes determining an illumination compensation status of the candidate block and deriving the illumination compensation status for the current block based on the illumination compensation status of the candidate block.
Abstract:
A video decoder performs a neighboring-block based disparity vector (NBDV) derivation process to determine a disparity vector or performs a NBDV refinement (NBDV-R) process to determine the disparity vector. The video decoder uses the disparity vector as a disparity vector for a current block without using a median filtering process on multiple disparity motion vectors, wherein the current block is coded in either a skip mode or a direct mode. Furthermore, the video coder determines pixel values for the current block.
Abstract:
Systems, methods, and devices for coding multilayer video data are disclosed that may include, encoding, decoding, transmitting, or receiving a non-entropy encoded set of profile, tier, and level syntax structures, potentially at a position within a video parameter set (VPS) extension. The systems, methods, and devices may refer to one of the profile, tier, and level syntax structures for each of a plurality of output layer sets. The systems, methods, and devices may encode or decode video data of one of the output layer sets based on information from the profile, tier, and level syntax structure referred to for the output layer set.
Abstract:
This disclosure describes techniques for signaling and processing information indicating simplified depth coding (SDC) for depth intra-prediction and depth inter-prediction modes in a 3D video coding process, such as a process defined by the 3D-HEVC extension to HEVC. In some examples, the disclosure describes techniques for unifying the signaling of SDC for depth intra-prediction and depth inter-prediction modes in 3D video coding. The signaling of SDC can be unified so that a video encoder or video decoder uses the same syntax element for signaling SDC for both the depth intra-prediction mode and the depth inter-prediction mode. Also, in some examples, a video coder may and/or process a residual value generated in the SDC mode using the same syntax structure, or same type of syntax structure, for both the depth intra-prediction mode and depth inter-prediction mode.