Abstract:
Devices and methods for improving use of supplemental enhancement information (SEI) messages in multi-layer codecs for decoding multi-layer bitstreams containing video information are disclosed. In one aspect, a device may include a memory configured to store video information associated with the multi-layer bitstream, the stored video information comprising an SEI message and a processor operably coupled to the memory. The processor is configured to determine whether the SEI message belongs to a first SEI message category or a second SEI message category based on an identifier in the SEI message. The processor is further configured to determine whether the SEI message applies to the operation points or the layers and, in response to the SEI message being of the first SEI message category and applying to the operation points, associating each semantic of the SEI message with each operation point among the operation points.
Abstract:
In an example, a method of coding video data includes coding one or more non-video coding layer (VCL) network abstraction layer (NAL) units of a layer of a multi-layer bitstream, where the one or more non-VCL NAL units contain a decoded picture hash SEI message. The method also includes determining a set of layers of the multi-layer bitstream to which the decoded picture hash SEI message is applicable based on a layer identifier of the one or more non-VCL NAL units containing the decoded picture hash SEI message.
Abstract:
A method for coding a reference picture set (RPS) in multi-layer coding is disclosed. In one aspect, the method may involve determining whether a current picture of video information is a discardable picture. The method may also involve refraining from including the current picture in an RPS based on the determination that the current picture is a discardable picture. The method may further involve encoding the video information based at least in part on the RPS.
Abstract:
Methods for coding an inter-layer reference picture set (RPS) and coding end of bitstream (EoB) network access (NAL) units in multi-layer coding are disclosed. In one aspect, the method includes determining whether a candidate inter-layer reference picture is present in the video information. The video information includes an inter-layer RPS including a plurality of subsets. The method further includes determining an inter-layer RPS subset to which the candidate inter-layer reference picture belongs in response to determining that the candidate inter-layer reference picture is not present, and indicating that no reference picture is present in the inter-layer RPS subset to which the candidate inter-layer reference picture belongs.
Abstract:
A computing device generates a file that comprises a track box that contains metadata for a track in the file. Media data for the track comprises a sequence of samples. Each of the samples is a video access unit of multi-layer video data. As part of generating the file, the computing device generates, in the file, an additional box that documents all of the samples containing at least one Intra Random Access Point (IRAP) picture.
Abstract:
A method of decoding video data including receiving an encoded video bitstream that includes a plurality of pictures and storing the plurality of pictures in one or more sub-DPBs. The method further including receiving a respective set of sub-DPB parameters for each respective operation point of the encoded video bitstream. applying the respective set of sub-DPB parameters to all layers of an output layer set for each respective operation point, and performing a sub-DPB management process on the one or more sub-DPBs in accordance with the received respective single sets of sub-DPB parameters.
Abstract:
A method, apparatus, and manufacture for processing video data. A list of output layer sets in a video bitstream is received, and an index to at least one target output layer set in the list of output layer sets is received. Next, target output layers in the at least one target output layer set is determined based on the index. At least the target output layers from the video bitstream are decoded. Then, the decoded target output layers are output without outputting layers that are not targeted for output.
Abstract:
In one example, a device for coding video data includes a video coder configured to code an intra random access point (IRAP) picture of a partially aligned IRAP access unit of video data, and code data that indicates, when performing random access from the partially aligned IRAP access unit, at least one picture of a video coding layer that is not correctly decodable. When the video coder comprises a video decoder, the video decoder may skip decoding of the pictures that are not correctly decodable, assuming random access has been performed starting from the partially aligned IRAP access unit.
Abstract:
Systems, methods, and devices for coding multilayer video data are disclosed that may include encoding, decoding, transmitting, or receiving multilayer video data. The systems, methods, and devices may receive or transmit a first output layer set for a layer set and receive or transmit a second output layer set for the layer set. The systems, methods, and devices may code (encode or decode) video data for at least one of the first output layer set and the second output layer set.
Abstract:
An apparatus for coding video information according to certain aspects includes a memory unit and a video processor in communication with the memory unit. The video processor is configured to identify a first picture included in the first set of pictures, wherein pictures within the first set of pictures having an output position after the output position of the first picture also have a decoding position after the decoding position of the first picture. The video processor is further configured to identify a second picture included in the second set of pictures, wherein pictures within the second set of pictures having an output position after the output position of the second pictures also have a decoding position after the decoding position of the second picture. The video processor is also configured to code the identified first picture and the identified second picture via one syntax element into one access unit.