Abstract:
In one example, a device for coding video data includes a video coder configured to code data indicating whether tile boundaries of different layers of video data are aligned and whether inter-layer prediction is allowed along or across tile boundaries of enhancement layer blocks, code an enhancement layer block in an enhancement layer tile of the video data without using inter-layer prediction from a collocated base layer block for which inter-layer filtering or reference layer filtering across tile boundaries in a reference layer picture in an access unit including both the enhancement layer tile and the base layer block is enabled, and code the collocated base layer block.
Abstract:
In general, techniques are described for separately coding depth and texture components of video data. A video coding device for coding video data that includes a view component comprised of a depth component and a texture component may perform the techniques. The video coding device may comprise, as one example, a processor configured to activate a parameter set as a texture parameter set for the texture component of the view component, and code the texture component of the view component based on the activated texture parameter set.
Abstract:
An apparatus for coding video information may include computing hardware configured to: when a current picture is to be predicted using at least inter layer motion prediction (ILMP): process a collocated reference index value associated with the current picture, wherein the collocated reference index value indicates a first reference picture that is used in predicting the current picture using inter layer prediction (ILP); and determine whether the first reference picture indicated by the collocated reference index value is enabled for ILMP; when the current picture is to be predicted using at least inter layer sample prediction (ILSP): process a reference index value associated with a block in the current picture, wherein the reference index value indicates a second reference picture that is used in predicting the block in the current picture using ILP; and determine whether the second reference picture indicated by the reference index value is enabled for ILSP.
Abstract:
In general, techniques are described for coding picture order count values identifying long-term reference pictures. A video decoding device comprising a processor may perform the techniques. The processor may be configured to determine a number of bits used to represent least significant bits of the picture order count value that identifies a long-term reference picture to be used when decoding at least a portion of a current picture and parse the determined number of bits from a bitstream representative of the encoded video data. The parsed bits represent the least significant bits of the picture order count value. The processor retrieves the long-term reference picture from a decoded picture buffer based on the least significant bits, and decodes at least the portion of the current picture using the retrieved long-term reference picture.
Abstract:
A device generates, in a container box of a file for storage of video contents, a SampleGroupDescription box that provides a sample group description for a sample grouping. The SampleGroupDescription box includes a grouping type syntax element having a particular value. Additionally, the SampleGroupDescription box includes one or more sample group entries. The device generates, in the same container box, a plurality of SampleToGroup boxes. Each respective SampleToGroup box of the plurality of SampleToGroup boxes includes a respective grouping type syntax element having the particular value, includes a respective sample count syntax element indicating a number of samples in a respective sample grouping, and includes a respective group description index syntax element indicating an index of an entry in the SampleGroupDescription box which describes samples of the respective sample grouping. Each sample of the respective sample grouping comprises a respective picture of the video contents.
Abstract:
This disclosure describes techniques for selection of coded picture buffer (CPB) parameters used to define a CPB for a video coding device for clean random access (CRA) pictures and broken link access (BLA) pictures in a video bitstream. A video coding device receives a bitstream including one or more CRA pictures or BLA pictures, and also receives a message indicating whether to use an alternative set of CPB parameters for at least one of the CRA pictures or BLA pictures. The message may be received from an external means, such as a processing means included in a streaming server or network entity. The video coding device sets a variable defined to indicate the set of CPB parameters for a given one of the pictures based on the received message, and selects the set of CPB parameters for the given one of the pictures based on the variable for the picture.
Abstract:
As one example, techniques for decoding video data include receiving a bitstream that includes one or more pictures of a coded video sequence (CVS), decoding a first picture according to a decoding order, wherein the first picture is a random access point (RAP) picture that is not an instantaneous decoding refresh (IDR) picture, and decoding at least one other picture following the first picture according to the decoding order based on the decoded first picture. As another example, techniques for encoding video data include generating a bitstream that includes one or more pictures of a CVS, wherein a first picture according to the decoding order is a RAP picture that is not an IDR picture, and avoiding including at least one other picture, other than the first picture, that corresponds to a leading picture associated with the first picture, in the bitstream.
Abstract:
Techniques for encapsulating video streams containing multiple coded views in a media file are described herein. In one example, a method includes parsing a track of multiview video data, wherein the track includes one or more views, including only one of a texture view of a particular view and a depth view of the particular view. The method further includes parsing a track reference to determine a dependency of the track to a referenced track indicated in the track reference. Track reference types include ‘deps’ that indicates that the track includes the depth view and the reference track includes the texture view, ‘tref’ that indicates that the track depends on the texture view which is stored in the referenced track, and ‘dref’ that indicates that the track depends on the depth view which is stored in the referenced track.
Abstract:
A video decoder may be configured to decode a first value representative of a difference between a base most significant bits (MSBs) value of a picture order count (POC) value of a current picture of video data and a first MSBs value of a first POC value of a first long-term reference picture of the video data, decode a second value representative of a difference between a second MSBs value of a second POC value of a second long-term reference picture of the video data and the first MSBs value, wherein the first POC value and the second POC value have different least significant bits values, and decode at least a portion of a current picture of the video data relative to at least one of the first long-term reference picture and the second long-term reference picture.
Abstract:
In an example, the disclosure provides for receiving a coded video sequence comprising encoded pictures of a video sequence and receiving timing parameters for the coded video sequence that include a time scale and a number of units in a clock tick at most once in a video parameter set (VPS) syntax structure referenced by the coded video sequence and at most once in a video usability information (VUI) part of a sequence parameter set (SPS) syntax structure referenced by the coded video sequence. Another example provides for encoding pictures of a video sequence to generate a coded video sequence and signaling timing parameters for the coded video sequence by at least in part signaling a time scale and a number of units in a clock tick at most once in a VPS syntax structure and at most once in a VUI part of a SPS syntax structure.