Abstract:
A video coder codes a slice header for a slice of video data. The slice header includes a syntax element comprising identifying information for a long term reference picture, wherein the identifying information is explicitly signaled in the slice header or derived from a sequence parameter set corresponding to the slice. When the syntax element indicates that the identifying information for the long term reference picture is explicitly signaled, to code the slice header, the video coder is further configured to code a value for the identifying information for the long term reference picture in the slice header.
Abstract:
To extract a sub-bitstream from a 3-dimensional video (3DV) bitstream, a device determines a texture target view list that indicates views in the 3DV bitstream that have texture view components that are required for decoding pictures in a plurality of target views. The target views are a subset of the views in the bitstream that are to be decodable from the sub-bitstream. In addition, the device determines a depth target view list that indicates views in the 3DV bitstream that have depth view components that are required for decoding pictures in the plurality of target views. The device determines the sub-bitstream based at least in part on the texture target view list and the depth target view list.
Abstract:
A device includes, in a first track of the file, a first end of sequence (EOS) network abstraction layer (NAL) unit for a coded video sequence of a bitstream. The first EOS NAL unit is in a first access unit of the coded video sequence. The device also includes, in a second track of the file, a second EOS NAL unit for the coded video sequence. The second EOS NAL unit is in a second access unit of the coded video sequence, the second EOS NAL unit being different from the first EOS NAL unit. The device may perform similar actions for end of bitstream (EOB) NAL units.
Abstract:
A video coder may reconstruct a current picture of video data. A current region of the current picture is associated with a temporal index indicating a temporal layer to which the current region belongs. Furthermore, for each respective array of a plurality of arrays that correspond to different temporal layers, the video coder may store, in the respective array, sets of adaptive loop filtering (ALF) parameters used in applying ALF filters to samples of regions of pictures of the video data that are decoded prior to the current region and that are in the temporal layer corresponding to the respective array or a lower temporal layer than the temporal layer corresponding to the respective array. The video coder determines, based on a selected set of ALF parameters in the array corresponding to the temporal layer to which the current region belongs, an applicable set of ALF parameters.
Abstract:
Techniques and systems are provided for processing video data. For example, 360-degree video data can be obtained for processing by an encoding device or a decoding device. The 360-degree video data includes pictures divided into motion-constrained tiles. The 360-degree video data can be used to generate a media file including a plurality of tracks. Each of the plurality of tracks contain a set of at least one of the motion-constrained tiles. The set of at least one of the motion-constrained tiles corresponds to at least one of a plurality of viewports of the 360-degree video data. A first tile representation can be generated for the media file. The first tile representation encapsulates a first track of the plurality of tracks, and the first track includes a first set of at least one of the motion-constrained tiles at a first tile location in the pictures of the 360-degree video data. The first set of at least one of the motion-constrained tiles corresponds to a viewport of the 360-degree video data.
Abstract:
An apparatus configured to construct a reference picture list includes a memory and one or more processors in communication with the memory. The memory is configured to store one or more reference pictures. The one or more processors are configured to determine (i) a first number representative of a total number of reference pictures available for prediction in at least one of inter mode or intra block copy mode and (ii) a second number representative of a total number of reference pictures to be included in the reference picture list, and in response to a determination that the second number is greater than or equal to the first number, refrain from replacing a reference picture in the reference picture list with a current picture to be predicted. The one or more processors may encode or decode the current picture based on the reference picture list.
Abstract:
An apparatus for coding video information according to certain aspects includes a memory unit configured to store video information associated with a plurality of layers and a processor. The processor is configured to obtain information associated with a current access unit (AU) to be coded, the current AU containing pictures from one or more layers of the plurality of layers. The processor is further configured to determine whether the current AU includes a first layer containing an intra random access point (IRAP) picture. The process is additionally configured to reset a picture order count (POC) of the second layer at the current AU, in response to determining that the current AU includes (1) a first layer that contains an IRAP picture and (2) a second layer containing no picture or containing a discardable picture.
Abstract:
An apparatus configured to code video information includes a memory unit and a processor in communication with the memory unit. The memory unit is configured to store video information associated with a first layer and a second layer. The processor is configured to decode first layer pictures of the first layer, store the decoded first layer pictures in a decoded picture buffer, determine whether second layer pictures having no corresponding first layer pictures are to be coded, and in response to determining that second layer pictures having no corresponding first layer pictures are to be coded, process an indication that one or more decoded first layer pictures stored in the decoded picture buffer are to be removed. The processor may encode or decode the video information.
Abstract:
In one implementation, an apparatus is provided for encoding or decoding video information. The apparatus comprises a memory configured to store inter-layer reference pictures associated with a current picture that is being coded. The apparatus further comprises a processor operationally coupled to the memory. In one embodiment, the processor is configured to indicate a number of inter-layer reference pictures to use to predict the current picture using inter-layer prediction. The processor is also configured to indicate which of the inter-layer reference pictures to use to predict the current picture using inter-layer prediction. The processor is also configured to determine an inter-layer reference picture set associated with the current picture using the indication of the number of inter-layer reference pictures and the indication of which of the inter-layer reference pictures to use to predict the current picture using inter-layer prediction.
Abstract:
In one example, a device for decoding video data includes a processor configured to decapsulate a slice of a random access point (RAP) picture of a bitstream from a network abstraction layer (NAL) unit, wherein the NAL unit includes a NAL unit type value that indicates whether the RAP picture is of a type that can have associated leading pictures and whether the RAP picture is an instantaneous decoder refresh (IDR) picture or a clean random access (CRA) picture, determine whether the RAP picture can have associated leading pictures based on the NAL unit type value, and decode video data of the bitstream following the RAP picture based on the determination of whether the RAP picture can have associated leading pictures.