Abstract:
A multi-layer video decoder is configured to determine, based on a list of triplet entries, whether the multi-layer video decoder is capable of decoding a bitstream that comprises an encoded representation of the multi-layer video data. The number of triplet entries in the list is equal to a number of single-layer decoders in the multi-layer video decoder. Each respective triplet entry in the list of triplet entries indicates a profile, a tier, and a level for a respective single-layer decoder in the multi-layer video decoder. The multi-layer video decoder is configured such that, based on the multi-layer video decoder being capable of decoding the bitstream, the multi-layer video decoder decodes the bitstream.
Abstract:
Methods for coding an inter-layer reference picture set (RPS) and coding end of bitstream (EoB) network access (NAL) units in multi-layer coding are disclosed. In one aspect, the method includes determining whether a candidate inter-layer reference picture is present in the video information. The video information includes an inter-layer RPS including a plurality of subsets. The method further includes determining an inter-layer RPS subset to which the candidate inter-layer reference picture belongs in response to determining that the candidate inter-layer reference picture is not present, and indicating that no reference picture is present in the inter-layer RPS subset to which the candidate inter-layer reference picture belongs.
Abstract:
Bitstream restrictions or constraints on the partitioning of pictures across layers of video data are described. In some examples, the number of tiles per picture for each layer of a plurality of layers is constrained based on a maximum number of tiles per picture for the layer. In some examples, the number of tiles per picture for each layer of the plurality of layers is no greater than the maximum number of tiles per picture for the layer. In some examples, a sum of the numbers of tiles per picture for the plurality of layers is no greater than a sum of the maximum numbers of tiles per picture for the plurality of layers. In some examples, a second largest coding unit (LCU) or coding tree block (CTB) size for a second layer is constrained based on, e.g., to be equal to, a first LCU size for a first layer.
Abstract:
In an example, a method of coding video data includes coding data of a video parameter set (VPS) of a multi-layer bitstream, including at least one of data that indicates whether any layers of the multi-layer bitstream have an inter-layer prediction restriction or data that indicates whether tile boundaries are aligned between at least two of the layers of the multi-layer bitstream, and coding the multi-layer bitstream in accordance with the data of the VPS.
Abstract:
A computing device generates a file that comprises a track box that contains metadata for a track in the file. Media data for the track comprises a sequence of samples, each of the samples being a video access unit of multi-layer video data. As part of generating the file, the computing device generates, in the file, a sub-sample information box that contains flags that specify a type of sub-sample information given in the sub-sample information box. When the flags have a particular value, a sub-sample corresponding to the sub-sample information box contains exactly one coded picture and zero or more non-Video Coding Layer (VCL) Network Abstraction Layer (NAL) units associated with the coded picture.
Abstract:
Processing high dynamic range and or wide color gamut video data using a fixed-point implementation. A method of processing video data may include receiving one or more supplemental enhancement information (SEI) messages that contain information specifying how to determine parameters for performing an inverse dynamic range adjustment process, receiving decoded video data, and performing the inverse dynamic range adjustment process on the decoded video data using fixed-point computing in accordance with the information in the one or more SEI messages.
Abstract:
In one example, a device for coding video data includes a video coder configured to code an intra random access point (IRAP) picture of a partially aligned IRAP access unit of video data, and code data that indicates, when performing random access from the partially aligned IRAP access unit, at least one picture of a video coding layer that is not correctly decodable. When the video coder comprises a video decoder, the video decoder may skip decoding of the pictures that are not correctly decodable, assuming random access has been performed starting from the partially aligned IRAP access unit.
Abstract:
In an example, a method of coding video data includes determining a location of a reference sample associated with a reference picture of video data based on one or more scaled offset values, where the reference picture is included in a first layer of a multi-layer bitstream and the one or more scaled offset values indicate a difference in scale between the first layer and a second, different layer. The method also includes determining a location of a collocated reference block of video data in the first layer based on the location of the reference sample, and coding a current block of video data in the second layer relative to the collocated reference block.
Abstract:
In one example, a device for decoding video data includes a video decoder configured to decode a value representative of a difference between most significant bits (MSBs) of a reference picture order count (POC) value and MSBs of a long-term reference picture (LTRP) POC value, wherein the reference POC value corresponds to a picture for which data must have been received in order to properly decode a current picture, determine the MSBs of the LTRP POC value based on the decoded value and the reference POC value, and decode at least a portion of the current picture relative to the LTRP based at least in part on the LTRP POC value. The picture for which data must have been received in order to properly decode a current picture may correspond to the current picture itself or a most recent random access point (RAP) picture.
Abstract:
Systems, methods, and devices for coding multilayer video data are disclosed that may include encoding, decoding, transmitting, or receiving multilayer video data. The systems, methods, and devices may receive or transmit a non-entropy coded representation format within a video parameter set (VPS). The systems, methods, and devices may code (encode or decode) video data based on the non-entropy coded representation format within the VPS, wherein the representation format includes one or more of chroma format, whether different color planes are separately coded, picture width, picture height, luma bit depth, and chroma bit depth.