Abstract:
Bitstream restrictions or constraints on the partitioning of pictures across layers of video data are described. In some examples, the number of tiles per picture for each layer of a plurality of layers is constrained based on a maximum number of tiles per picture for the layer. In some examples, the number of tiles per picture for each layer of the plurality of layers is no greater than the maximum number of tiles per picture for the layer. In some examples, a sum of the numbers of tiles per picture for the plurality of layers is no greater than a sum of the maximum numbers of tiles per picture for the plurality of layers. In some examples, a second largest coding unit (LCU) or coding tree block (CTB) size for a second layer is constrained based on, e.g., to be equal to, a first LCU size for a first layer.
Abstract:
This disclosure describes techniques for signaling deblocking filter parameters for a current slice of video data with reduced bitstream overhead. Deblocking filter parameters may be coded in one or more of a picture layer parameter set and a slice header. The techniques reduce a number of bits used to signal the deblocking filter parameters by coding a first syntax element that indicates whether deblocking filter parameters are present in both the picture layer parameter set and the slice header, and only coding a second syntax element in the slice header when both sets of deblocking filter parameters are present. Coding the second syntax element is eliminated when deblocking filter parameters are present in only one of the picture layer parameter set or the slice header. The second syntax element indicates which set of deblocking filter parameters to use to define a deblocking filter applied to a current slice.
Abstract:
A method for processing video data in a real-time transport protocol (RTP) payload includes encapsulating video data in a single network abstraction layer (NAL) unit packet for an RTP session. The single NAL unit packet contains a single NAL unit. The method may also include encapsulating decoding order number information in the single NAL unit packet based on at least one of: the RTP session being in a multi-stream transmission (MST) mode, or a maximum number of NAL units that may precede the NAL unit in a de-packetization buffer in reception order and follow the NAL unit in decoding order being greater than 0.
Abstract:
A video encoder is configured to determine a picture size for one or more pictures included in a video sequence. The picture size associated with the video sequence may be a multiple of an aligned coding unit size for the video sequence. In one example, the aligned coding unit size for the video sequence may comprise a minimum coding unit size where the minimum coding unit size is selected from a plurality of smallest coding unit sizes corresponding to different pictures in the video sequence. A video decoder is configured to obtain syntax elements to determine the picture size and the aligned coding unit size for the video sequence. The video decoder decodes the pictures included in the video sequence with the picture size, and stores the decoded pictures in a decoded picture buffer.
Abstract:
Systems, methods, and devices for coding multilayer video data are disclosed that may include encoding, decoding, transmitting, or receiving multilayer video data. The systems, methods, and devices may receive or transmit a non-entropy coded representation format within a video parameter set (VPS). The systems, methods, and devices may code (encode or decode) video data based on the non-entropy coded representation format within the VPS, wherein the representation format includes one or more of chroma format, whether different colour planes are separately coded, picture width, picture height, luma bit depth, and chroma bit depth.
Abstract:
In an example, the disclosure provides for receiving a coded video sequence comprising encoded pictures of a video sequence and receiving timing parameters for the coded video sequence that include an indication of whether a picture order count (POC) value for each picture in the coded video sequence that is not a first picture in the coded video sequence according to a decoding order is proportional to an output time of the picture relative to an output time of the first picture in the coded video sequence in a video parameter set (VPS) syntax structure referenced by the coded video sequence. Another example provides for encoding pictures of a video sequence to generate the coded video sequence comprising the encoded pictures and signaling timing parameters for the coded video sequence by signaling the indication in the VPS syntax structure referenced by the coded video sequence.
Abstract:
An example method of decoding video data includes receiving a plurality of pictures, receiving, in a message associated with a first picture of the plurality of pictures, information indicating a picture order count (POC) value of a recovery point picture of a gradual decoder refresh (GDR) set, when a picture that follows the first picture in decoding order has the POC value that is equal to a POC value of the recovery point picture, identifying the picture having the POC value equal to the POC value of the recovery point picture, as a recovery point picture, and when none of the pictures that follow the first picture has the POC value that is equal to the POC value of the recovery point picture, identifying one of the pictures that has a POC value greater than the POC value of the recovery point picture as the recovery point picture.
Abstract:
Techniques for encapsulating video streams containing multiple coded views in a media file are described herein. In one example, a method includes parsing a track of video data, wherein the track includes one or more views. The method further includes parsing information to determine whether a texture view or a depth view of a reference view is required for decoding at least one of the one or more views in the track. Another example method includes composing a track of video data, wherein the track includes one or more views and composing information that indicates whether a texture view or a depth view of a reference view is required for decoding at least one of the one or more views in the track.
Abstract:
Techniques are described for signaling decoding unit identifiers for decoding units of an access unit. The video decoder determines which network abstraction layer (NAL) units are associated with which decoding units based on the decoding unit identifiers. Techniques are also described for including one or more copies of supplemental enhancement information (SEI) messages in an access unit.
Abstract:
A device signals a property of a bitstream. The bitstream comprises a plurality of coded video sequences (CVSs). When the property has a particular value, all the CVSs of the bitstream conform to the same profile. A video processing device is able to determine, based on the property, whether to process the bitstream.