Abstract:
A method for processing video data in a real-time transport protocol (RTP) payload includes encapsulating video data in a single network abstraction layer (NAL) unit packet for an RTP session. The single NAL unit packet contains a single NAL unit. The method may also include encapsulating decoding order number information in the single NAL unit packet based on at least one of: the RTP session being in a multi-stream transmission (MST) mode, or a maximum number of NAL units that may precede the NAL unit in a de-packetization buffer in reception order and follow the NAL unit in decoding order being greater than 0.
Abstract:
A method for processing video data in a real-time transport protocol (RTP) payload includes encapsulating video data in a single network abstraction layer (NAL) unit packet for an RTP session. The single NAL unit packet contains a single NAL unit. The method may also include encapsulating decoding order number information in the single NAL unit packet based on at least one of: the RTP session being in a multi-stream transmission (MST) mode, or a maximum number of NAL units that may precede the NAL unit in a de-packetization buffer in reception order and follow the NAL unit in decoding order being greater than 0.
Abstract:
Techniques for coding data, such as, e.g., video data, include coding a first syntax element, conforming to a particular type of syntax element, of a first slice of video data, conforming to a first slice type, using an initialization value set. The techniques further include coding a second syntax element, conforming to the same type of syntax element, of a second slice of video data, conforming to a second slice type, using the same initialization value set. In this example, the first slice type may be different from the second slice type. Also in this example, at least one of the first slice type and the second slice type may be a temporally predicted slice type. For example, the at least one of the first and second slice types may be a unidirectional inter-prediction (P) slice type, or a bi-directional inter-prediction (B) slice type.
Abstract:
A video coder may be configured to code a plurality of syntax elements that indicate an absolute value of a transform coefficients in a block, wherein the plurality of syntax elements include a significance flag and four or more greater_than_N flags. The video coder may further determine first respective contexts for each syntax element in a first set of the plurality of syntax elements from respective context tables for each syntax element in the first set of the plurality of syntax elements, and determine second respective contexts for each syntax element in a second set of the plurality of syntax elements from a shared context table.
Abstract:
Example devices and techniques are described. An example device includes one or more memories configured to store video data and one or more processors configured to receive, in a bitstream, one or more first messages each specifying at least one respective film grain characteristic applicable to first video data of the video data and each corresponding to a different film grain synthesis technique. The one or more processors are configured to determine that a first film grain synthesis technique specified by a first message of the one or more first messages corresponds to a device supported film grain synthesis technique. The one or more processors are configured to apply, based on a determination that the first film grain synthesis technique corresponds to a device supported film grain synthesis technique, the device supported film grain synthesis technique to the first video data based on contents of the first message.
Abstract:
A video decoder may apply a first inverse residual modification function to first decoded modified chroma residual data to generate first inverse modified chroma residual data. Additionally, the video decoder may apply a second inverse residual modification function to second decoded modified chroma residual data to generate second inverse modified chroma residual data. The first decoded modified chroma residual data is associated with a first chroma component and the second decoded modified chroma residual data is associated with a second chroma component. The video decoder may reconstruct a block of video data based on the first inverse modified chroma residual data and the second inverse modified chroma residual data.
Abstract:
A video encoder may transform residual data by using a transform selected from a group of transforms. The transform is applied to the residual data to create a two-dimensional array of transform coefficients. A scanning mode is selected to scan the transform coefficients in the two-dimensional array into a one-dimensional array of transform coefficients. The combination of transform and scanning mode may be selected from a subset of combinations that is based on an intra-prediction mode. The scanning mode may also be selected based on the transform used to create the two-dimensional array. The transforms and/or scanning modes used may be signaled to a video decoder.
Abstract:
A video decoder may receive a supplemental enhancement information (SEI) processing order SEI message, decode a syntax element in the SEI processing order SEI message that indicates a preferred processing order of SEI messages in a bitstream, and process the SEI message present in the bitstream in accordance with the preferred processing order.
Abstract:
Aspects of the present disclosure provide techniques for using a radio access network (RAN) level negative acknowledgement (NAK) feedback to indicate at least one missing frame from an encoding device. The RAN level NAK feedback replaces or preempts a decoding device sending an end-to-end feedback to the encoding device using real-time transport protocol (RTP) that has a long latency and may cause freezes at the decoding device. For example, an encoding device may send to a network entity a request for a configuration that configures the encoding device to transmit media frames to the decoding device. The network entity may provide a NAK feedback indicating at least one missing frame. Having received the configuration in response to the request, the encoding device transmits media frames to the decoding device via the network entity, and monitors for NAK feedback from the network entity in accordance with the configuration.
Abstract:
A device for encoding video data includes a memory configured to store video data, and a video encoder implemented in circuitry and configured to encode a future picture of the video data having a first display order position, the future picture being included in an intra period (IP) of the video data, the IP comprising a plurality of groups of pictures (GOPs), and after encoding the future picture, encode a picture of an ordinal first GOP of the plurality of GOPs using the future picture as a reference picture, each picture of the ordinal first GOP having display order positions earlier than the first display order position. Encoding the future picture in this manner may result in encoding performance improvements with minimal increases in encoding and decoding complexity.