Abstract:
A computing device obtains a Network Abstraction Layer (NAL) unit header of a NAL unit of the multi-layer video data. The NAL unit header comprises a layer identifier syntax element having a value that specifies an identifier of a layer of the NAL unit. The layer identifier syntax element comprises a plurality of bits that represent the value within a defined range of values. A requirement of the bitstream conforming to a video coding standard is that the value of the layer identifier syntax element is less than the maximum value of the range of values.
Abstract:
In one example, a device for coding video data includes a video coder configured to determine a first target reference index representing a reference picture to which a candidate motion vector predictor for a current motion vector refers, determine, for the candidate motion vector predictor and during temporal motion vector prediction (TMVP) of the current motion vector, a value for an additional target reference index that is equal to a predetermined value, and code the current motion vector based at least in part on the value of the additional reference index.
Abstract:
In an example, a method of decoding video data includes decoding, by a video decoder, a first picture of multi-layer video data, where the first picture has a first picture type. The method also includes, based on the first picture having the first picture type, decoding, by the video decoder, data representative of picture order count (POC) most significant bits (MSB) value for the first picture.
Abstract:
A computing device generates a file that comprises a track box that contains metadata for a track in the file. Media data for the track comprises a sequence of samples. Each of the samples is a video access unit of multi-layer video data. As part of generating the file, the computing device generates, in the file, an additional box that documents all of the samples containing at least one Intra Random Access Point (IRAP) picture.
Abstract:
A system and method for coding a picture in a multi-layer bitstream is disclosed. In one aspect, the method includes encoding at least one layer of the multi-layer bitstream in accordance with a first coding scheme. The multi-layer bitstream may comprise a base layer. The method further includes encoding a conformance window flag and at least one position offset for the picture in a Video Parameter Set (VPS) of the base layer. The conformance window flag may indicate that the VPS comprises the at least one position offset.
Abstract:
A device generates, in a file storing a multi-layer bitstream, a track box that contains metadata for a track. The device generates, in the track box, a sample description box containing a sample group description entry. Additionally, the device generates, in the track box, a sample-to-group box for the track. The sample-to-group box mapping samples of the track into a sample group. The sample-to-group box specifies target layers among layers present in the track. Each of the target layers contains at least one picture belonging to a particular picture type. The sample group is one of a temporal sub-layer access sample group and the particular picture type is a temporal sub-layer access picture type, or a stepwise temporal sub-layer access sample group and the particular picture type is a step-wise temporal sub-layer access picture type.
Abstract:
In one example, a device for decoding video data includes a processor configured to determine, for a supplemental enhancement information (SEI) network abstraction layer (NAL) unit of a bitstream, whether a NAL unit type value for the SEI NAL unit indicates that the NAL unit comprises a prefix SEI NAL unit including a prefix SEI message or a suffix SEI NAL unit including a suffix SEI message, and decode video data of the bitstream following the SEI NAL unit based on whether the SEI NAL unit is the prefix SEI NAL unit or the suffix SEI NAL unit and data of the SEI NAL unit.
Abstract:
In one example, a device for coding video data includes a video coder configured to code a value for a syntax element representative of whether any two reference layer samples, collocated with two respective enhancement layer picture samples within a common enhancement layer tile, must be within a common reference layer tile, and code the enhancement layer picture samples based at least in part on the value of the syntax element.
Abstract:
An example method of decoding video data includes receiving a plurality of pictures, receiving, in a message associated with a first picture of the plurality of pictures, information indicating a picture order count (POC) value of a recovery point picture of a gradual decoder refresh (GDR) set, when a picture that follows the first picture in decoding order has the POC value that is equal to a POC value of the recovery point picture, identifying the picture having the POC value equal to the POC value of the recovery point picture, as a recovery point picture, and when none of the pictures that follow the first picture has the POC value that is equal to the POC value of the recovery point picture, identifying one of the pictures that has a POC value greater than the POC value of the recovery point picture as the recovery point picture.
Abstract:
A video processing device includes a memory storing video data and one or more processors configured to: receive a first network abstraction layer (NAL) unit comprising a first picture of an access unit; in response to determining the first NAL unit comprises an intra random access point (IRAP) picture and in response to a NAL unit type for the first NAL unit indicating the presence of an instantaneous decoding refresh (IDR) picture without any associated leading pictures for a second NAL unit of the access unit comprising another IRAP picture, determine a NAL unit type for the second NAL unit to be a NAL unit type indicating the presence of an IDR picture without any associated leading pictures; and, process the first NAL unit and the second NAL unit based on the NAL unit type for the second NAL unit.