Abstract:
A device generates, stores, or obtains a file that complies with a restriction specified in a file format to which the file conforms. The restriction requires that for each respective Intra Random Access Point (IRAP) picture, a parameter set required for decoding the respective IRAP picture is in at least one of the following: a sample entry of the respective IRAP picture's track that is applicable to the respective IRAP picture's sample, a sample entry of a non-corresponding track for the respective IRAP picture carrying a reference layer of the respective IRAP picture's layer, the respective IRAP picture's sample, or a temporally collocated sample of the respective IRAP picture's sample in a track of that carries the reference layer.
Abstract:
Systems, methods, and computer readable media are described for providing improved video coding, including improved video stream switching and random access. In some examples, systems, methods, and computer readable media include obtaining video data at an encoder and determining, from the video data, an intra random access point (IRAP) picture. A first set of one or more leading pictures associated with the IRAP picture are coded as random access decodable leading (RADL) pictures based on the first set of one or more leading pictures having a temporal identifier (TemporalId) that is less than or equal to a TemporalId threshold value. A second set of one or more leading pictures associated with the IRAP picture are coded as random access skipped leading (RASL) pictures based on the second set of one or more leading pictures having a TemporalId that is greater than the TemporalId threshold value. The video bitstream can then be generated to include the IRAP picture, the first set of one or more leading pictures, and the second set of one or more leading pictures.
Abstract:
In one example, a device for processing decoded video data a video decoder implemented by one or more hardware-based processing units comprising digital logic circuitry, and a postprocessing unit implemented by one or more hardware-based processing units comprising digital logic circuitry. The video decoder is configured to decode video data of a video bitsream according to a video coding standard, extract HDR postprocessing data from an SEI message of the video bitstream, and provide the decoded video data and the HDR postprocessing data to the postprocessing unit. The postprocessing unit is configured to process the decoded video data using the HDR postprocessing data according to the video coding standard. The device may additionally determine whether the video decoder is compliant with the video coding standard by comparing the processed video data with reference processed video data.
Abstract:
In one example, the disclosure is directed to techniques that include receiving a bitstream comprising at least a syntax element, a first network abstraction layer unit type, and a coded access unit comprising a plurality of pictures. The techniques further include determining a value of the syntax element which indicates whether the access unit was coded using cross-layer alignment. The techniques further include determining the first network abstraction layer unit type for a picture in the access unit and determining whether the first network abstraction layer unit type equals a value in a range of type values. The techniques further include setting a network abstraction layer unit type for all other pictures in the coded access unit to equal the value of the first network abstraction layer unit type if the first network abstraction layer unit type is equal to a value in the range of type values.
Abstract:
Systems and methods for separately defining and indicating inter-layer prediction dependencies for a first layer with respect to each of a number of enhancement layers associated with the first layer are described herein. One aspect of the subject matter described in the disclosure provides a video encoder comprising a memory unit configured to store a first picture associated with a first layer and enhancement layer pictures associated with a plurality of enhancement layers. The video encoder further comprises a processor in communication with the memory unit. The processor is configured to provide a separate indication for each of the enhancement layers that indicates whether the first picture can be used for inter-layer prediction of the enhancement layer picture in a respective enhancement layer.
Abstract:
A video coder can be configured to code a random access point (RAP) picture and code one or more decodable leading pictures (DLPs) for the RAP picture such that all pictures that are targeted for discard precede the DLPs associated with the RAP picture in display order.
Abstract:
A video encoder signals, in a slice header for a current slice of a current picture, a first long-term reference picture (LTRP) entry, the first LTRP entry indicating that a particular reference picture is in a long-term reference picture set of the current picture. Furthermore, the video encoder signals, in the slice header, a second LTRP entry only if second LTRP entry does not indicate that the particular reference picture is in the long-term reference picture set of the current picture.
Abstract:
Techniques for encapsulating video streams containing multiple coded views in a media file are described herein. In one example, a method includes parsing a track of video data, wherein the track includes one or more views. The method further includes parsing information to determine whether a texture view or a depth view of a reference view is required for decoding at least one of the one or more views in the track. Another example method includes composing a track of video data, wherein the track includes one or more views and composing information that indicates whether a texture view or a depth view of a reference view is required for decoding at least one of the one or more views in the track.
Abstract:
As one example, a method of coding video data includes storing one or more decoding units of video data in a picture buffer. The method further includes obtaining a respective buffer removal time for the one or more decoding units, wherein obtaining the respective buffer removal time comprises receiving a respective signaled value indicative of the respective buffer removal time for at least one of the decoding units. The method further includes removing the decoding units from the picture buffer in accordance with the obtained buffer removal time for each of the decoding units. The method further includes coding video data corresponding to the removed decoding units, wherein coding the video data comprises decoding the at least one of the decoding units.
Abstract:
In one example, a video coder is configured to code a value for a syntax element indicating whether at least a portion of a picture order count (POC) value of a picture is to be reset to a value of zero, when the value for the syntax element indicates that the portion of the POC value is to be reset to the value of zero, reset at least the portion of the POC value such that the portion of the POC value is equal to zero, and code video data using the reset POC value. Coding video data using the reset POC value may include inter-predicting a block of a subsequent picture relative to the picture, where the block may include a motion parameter that identifies the picture using the reset POC value. The block may be coded using temporal inter-prediction or inter-layer prediction.