Abstract:
Techniques for encapsulating video streams containing multiple coded views in a media file are described herein. In one example, a method includes parsing a track of video data, wherein the track includes one or more views. The method further includes parsing information to determine whether a texture view or a depth view of a reference view is required for decoding at least one of the one or more views in the track. Another example method includes composing a track of video data, wherein the track includes one or more views and composing information that indicates whether a texture view or a depth view of a reference view is required for decoding at least one of the one or more views in the track.
Abstract:
This disclosure describes techniques for improving coding efficiency of motion prediction in multiview and 3D video coding. In one example, a method of decoding video data comprises deriving one or more disparity vectors for a current block, the disparity vectors being derived from neighboring blocks relative to the current block, converting a disparity vector to one or more of inter-view predicted motion vector candidates and inter-view disparity motion vector candidates, adding the one or more inter-view predicted motion vector candidates and the one or more inter-view disparity motion vector candidates to a candidate list for a motion vector prediction mode, and decoding the current block using the candidate list.
Abstract:
In one example, a video coder, such as a video encoder or a video decoder, is configured to code a value for a layer identifier in a slice header for a current slice in a current layer of multi-layer video data, and, when the value for the layer identifier is not equal to zero, code a first set of syntax elements in accordance with a base video coding standard, and code a second set of one or more syntax elements in accordance with an extension to the base video coding standard. The second set of syntax elements may include a syntax element representative of a position for an identifier of an inter-layer reference picture of a reference layer in a reference picture list, and the video coder may construct the reference picture list such that the identifier of the inter-layer reference picture is located in the determined position.
Abstract:
A video processing device can be configured to process one or more initial syntax elements for a parameter set associated with a video bitstream; receive in the parameter set an offset syntax element for the parameter set that identifies syntax elements to be skipped within the parameter set; and based on the offset syntax element, skip the syntax elements within the parameter set and process one or more additional syntax elements in the parameter set that are after the skipped syntax elements in the parameter set.
Abstract:
In some example techniques for generating and updating depth map estimates used for inter-view motion prediction and/or inter-view residual prediction for coding multiview video data, each of a plurality of dependent views is associated with a dependent depth view estimate, which may be generated or updated based on coding of the texture data of the dependent view relative to a base view. In such examples, each of the dependent depth map estimates may be warped to the base view to produce a respective one of a plurality of base depth map estimates. Each dependent depth map estimate and the respective base depth map estimate for a depth map estimate pair associated with the respective one of the plurality of dependent views.
Abstract:
A video encoder generates a network abstraction layer (NAL) unit that includes at least a first syntax element and a second syntax element. The first syntax element indicates that the NAL unit belongs to a particular NAL unit type. Coded slices of texture view components and depth view components are encapsulated within NAL units that belong to the particular NAL unit type. The second syntax element indicates whether a NAL unit header of the NAL unit includes an Advanced Video Coding (AVC)-compatible 3-dimensional video (3DV) header extension or includes a Multiview Video Coding (MVC)-compatible 3DV header extension. The video encoder outputs a bitstream that includes the NAL unit. A video decoder receives the NAL unit and determines whether the second syntax element indicates that the NAL unit header of the NAL unit includes the AVC-compatible 3DV header extension or the MVC-compatible 3DV header extension.
Abstract:
Systems, methods, and devices for coding video data are described herein. In some aspects, a memory unit is configured to store the video data and a candidate list. The video data includes a base layer and an enhancement layer. The base layer includes a base layer prediction unit co-located with an enhancement layer prediction unit in the enhancement layer. The candidate list includes a list of motion vectors for use by the enhancement layer prediction unit. A processor is configured to store motion vectors originating from spatial neighbors of the enhancement layer prediction unit, and not motion vectors originating from temporal neighbors of the enhancement layer prediction unit, in the candidate list. The processor is also configured to store motion vectors originating from the base layer prediction unit in the candidate list.
Abstract:
In general, techniques are described for separately coding depth and texture components of video data. A video coding device for processing the video data comprising one or more processors may perform the techniques. The one or more processors may be configured to determine first sequence level information describing characteristics of the depth components, and determine second sequence level information describing characteristics of an operation point of the video data.
Abstract:
A video coding device, such as a video encoder or a video decoder, may be configured to code a parameter set group representing a first parameter set of a first type and a second parameter set of a second, different type, and code a slice of video data using information of the parameter set group, information of the first parameter set, and information of the second parameter set, wherein the slice includes information referring to the parameter set group. The video coding device may further code the first and second parameter sets.
Abstract:
Techniques are described related to receiving first and second sub-sequences of video, wherein the first sub-sequence includes one or more frames each having a first resolution, and the second sub-sequence includes one or more frames each having a second resolution, receiving a first sequence parameter set and a second sequence parameter set for the coded video sequence, wherein the first sequence parameter set indicates the first resolution of the one or more frames of the first sub-sequence, and the second sequence parameter set indicates the second resolution of the one or more frames of the second sub-sequence, and wherein the first sequence parameter set is different than the second sequence parameter set, and using the first sequence parameter set and the second sequence parameter set to decode the coded video sequence.