Abstract:
A coding device configured to code video data that includes a buffer memory configured to store pictures of the video data and a at least one processor implemented in circuitry that is in communication with the buffer memory such that the processor is configured to code at least two pictures of a single coded video sequence (CVS) of the video data where each picture of the at least two pictures is associated with an identical picture order count (POC) value and where the at least two pictures are different from one another, associate respective data with each of the at least two pictures of the single CVS, and identify, for inclusion in a reference picture set, at least one picture among the at least two pictures based on the identical POC value associated with the at least two pictures and the respective data associated with the at least one picture.
Abstract:
Systems and methods are provided for specifying regional information such as a source and nature of a recommend viewport and a priority among multiple recommended viewports. Virtual reality video data can represent a 360-degree view of a virtual environment. In various examples, a region of the virtual reality video data can be determined, where the region includes a sub-section of the 360-degree view. A data structure can be generated for the region, where the data structure includes parameters that describe the region. The parameters can include a source associated with the region. The virtual reality video data and the data structure can be stored in a file.
Abstract:
Video data bitstreams may contain bitstream conformance parameters, such as hypothetical reference decoder (HRD) parameters, which may be used to allow a decoder to test the conformance of a received bitstream. In multi-layer codecs transmitted using partitions, the video data may be associated with one or more layer sets. Each layer set may be associated with one or more output layer sets. Each output layer set may be further associated with one or more partitioning schemes. Conformance parameters are mapped to partitions of a partitioning scheme, based upon the output layer set that the partitioning scheme is associated with. This allows for a partition to be associated with different conformance parameters, depending upon the output layer set that is being used.
Abstract:
An apparatus for coding video information according to certain aspects includes a processor configured to determine a value of a flag associated with a current picture of a current layer to be decoded, the flag indicating whether pictures in a decoded picture buffer (DPB) should be output, wherein the current picture is an intra random access point (TRAP) picture that starts a new coded video sequence (CVS) and wherein the determination of the value of the flag is based on at least one of: (1) the chroma format of the current picture and the chroma format of the preceding picture, (2) the bit depth of the luma samples of the current picture and the bit depth of the luma samples of the preceding picture, or (3) the bit depth of the chroma samples of the current picture and the bit depth of the chroma samples of the preceding picture.
Abstract:
An apparatus configured to code video information includes a memory unit and a processor in communication with the memory unit. The memory unit is configured to store video information associated with a reference layer (RL) and an enhancement layer (EL), the RL having an RL picture in a first access unit, and the EL having a first EL picture in the first access unit, wherein the first EL picture is associated with a first set of parameters. The processor is configured to determine whether the first EL picture is an intra random access point (IRAP) picture, determine whether the first access unit immediately follows a splice point where first video information is joined with second video information including the first EL picture, and perform, based on the determination of whether the first EL picture is an intra random access point (IRAP) picture and whether the first access unit immediately follows a splice point, one of (1) refraining from associating the first EL picture with a second set of parameters that is different from the first set of parameters, or (2) associating the first EL picture with a second set of parameters that is different from the first set of parameters. The processor may encode or decode the video information.
Abstract:
An apparatus configured to code video information includes a memory unit and a processor in communication with the memory unit. The memory unit is configured to store video information associated with a first video layer having a first picture. The processor is configured to process picture order count (POC) derivation information associated with the first picture, and determine, based on the POC derivation information associated with the first picture, a POC value of at least one other picture in the first video layer that precedes the first picture in decoding order. The processor may encode or decode the video information.
Abstract:
Systems and methods for inter-layer reference picture set derivation based on sub-layer reference prediction dependency are described herein. One aspect of the subject matter described in the disclosure provides a video encoder comprising a memory configured to store one or more direct reference layer pictures of one or more current pictures in a sequence, wherein the one or more current pictures are associated with a current layer, the current layer being associated with the one or more direct reference layers. The video encoder further comprises a processor in communication with the memory unit. The memory unit is configured to set an indication associated with a current picture to indicate whether all of the one or more direct reference layer pictures of the current picture that are not restricted for use in inter-layer prediction are included in an inter-layer reference picture set associated with the current picture.
Abstract:
A method of coding video data includes receiving one or more layers of video information. Each layer may include at least one picture. The method can include determining a number of active reference layer pictures associated with at least one picture of the one or more layers. The method can further include determining a number of direct reference layers associated with the at least one of the one or more layers. Based on the number of direct reference layers equaling the number of active reference layer pictures, the method can further include refraining from further signaling inter-layer reference picture information in any video slice associated with at least one of a video parameter set (VPS), a sequence parameter set (SPS), or a picture parameter set (PPS). Additionally or alternatively, based on the number of direct reference layers equaling the number of active reference layer pictures, the method can include adding to the inter-layer reference picture set all direct reference layer pictures for any video slice associated with at least one of a video parameter set (VPS), a sequence parameter set (SPS), or a picture parameter set (PPS).
Abstract:
An apparatus for coding video information may include computing hardware configured to: when a current picture is to be predicted using at least inter layer motion prediction (ILMP): process a collocated reference index value associated with the current picture, wherein the collocated reference index value indicates a first reference picture that is used in predicting the current picture using inter layer prediction (ILP); and determine whether the first reference picture indicated by the collocated reference index value is enabled for ILMP; when the current picture is to be predicted using at least inter layer sample prediction (ILSP): process a reference index value associated with a block in the current picture, wherein the reference index value indicates a second reference picture that is used in predicting the block in the current picture using ILP; and determine whether the second reference picture indicated by the reference index value is enabled for ILSP.
Abstract:
A video encoder signals, in an encoded video bitstream, a video parameter set (VPS) that includes a plurality of Hypothetical Reference Decoder (HRD) parameter syntax structures that each include HRD parameters. For each respective HRD parameter syntax structure in the plurality of HRD parameter syntax structures, the VPS further includes a syntax element indicating whether the HRD parameters of the respective HRD parameter syntax structure include a common set of HRD parameters in addition to a set of sub-layer-specific HRD parameter information specific to a particular sub-layer of the encoded video bitstream. The common set of HRD parameters is common to all sub-layers of the encoded video bitstream. A video decoder or other device decodes, from the encoded video bitstream, the VPS and performs an operation using the HRD parameters of at least one of the HRD parameter syntax structures.