Network abstraction layer (NAL) unit header design for three-dimensional video coding

    公开(公告)号:US10447990B2

    公开(公告)日:2019-10-15

    申请号:US13779468

    申请日:2013-02-27

    Abstract: A video encoder generates a network abstraction layer (NAL) unit that includes at least a first syntax element and a second syntax element. The first syntax element indicates that the NAL unit belongs to a particular NAL unit type. Coded slices of texture view components and depth view components are encapsulated within NAL units that belong to the particular NAL unit type. The second syntax element indicates whether a NAL unit header of the NAL unit includes an Advanced Video Coding (AVC)-compatible 3-dimensional video (3DV) header extension or includes a Multiview Video Coding (MVC)-compatible 3DV header extension. The video encoder outputs a bitstream that includes the NAL unit. A video decoder receives the NAL unit and determines whether the second syntax element indicates that the NAL unit header of the NAL unit includes the AVC-compatible 3DV header extension or the MVC-compatible 3DV header extension.

    Tile grouping in HEVC and L-HEVC file formats

    公开(公告)号:US10419768B2

    公开(公告)日:2019-09-17

    申请号:US15470148

    申请日:2017-03-27

    Abstract: A device generates a Network Abstraction Layer (NAL) Unit Map Entry box in a file. The NAL Unit Map Entry box associates NAL units with a tile set that includes two or more tiles of a plurality of tiles into which a current picture is partitioned. A region of the current picture covered by the tile set may be non-rectangular. Additionally, the device generates a tile set group entry in the file. The tile set group entry includes a horizontal offset, a vertical offset, a width, and a height. The horizontal offset specifies a horizontal offset of a top-left pixel of a minimum rectangular region that covers tiles in the tile set. The vertical offset specifies a vertical offset of the top-left pixel of the minimum rectangular region. The width specifies a width of the minimum rectangular region. The height specifies a height of the minimum rectangular region.

    Operation point for carriage of layered HEVC bitstream

    公开(公告)号:US10306269B2

    公开(公告)日:2019-05-28

    申请号:US14878783

    申请日:2015-10-08

    Abstract: A device for processing a bitstream including video data, such as a demultiplexer, extracts a descriptor from the bitstream, wherein the bitstream includes layers of video data for operation points, separate from the descriptor, such that each operation point includes one or more of the layers of video data, and wherein the descriptor includes a set of profile, tier, and level (PTL) structures and data that associates each of the layers of each of the operation points with a corresponding one of the PTL structures, extracts video data for one of the operation points from the bitstream based at least in part on the PTL structures to which the layers of the one of the operation points correspond, and provides the extracted video data to a video decoder.

    Bitstream conformance constraints in scalable video coding

    公开(公告)号:US10264286B2

    公开(公告)日:2019-04-16

    申请号:US14749527

    申请日:2015-06-24

    Abstract: An apparatus configured to code video information in a bitstream includes a memory and a processor in communication with the memory. The memory is configured to store video information associated with a plurality of video layers in the bitstream, the plurality of video layers in the bitstream divided into a plurality of bitstream partitions, herein each bitstream partition contains at least one of the plurality of video layers. The processor is configured to process a bitstream conformance parameter associated with a first bitstream partition of the plurality of bitstream partitions, wherein the bitstream conformance parameter is applicable to the first bitstream partition but not to another portion of the bitstream not encompassed by the first bitstream partition. The processor may encode or decode the video information in the bitstream.

    CONTENT SOURCE DESCRIPTION FOR IMMERSIVE MEDIA DATA

    公开(公告)号:US20190104326A1

    公开(公告)日:2019-04-04

    申请号:US16149690

    申请日:2018-10-02

    Abstract: An example device for transferring media data including immersive media data includes a memory configured to store the media data, and one or more processors implemented in circuitry and configured to transfer metadata that systematically describes different formatting options for the immersive media data, process data representing one or more of the formatting options that a client device supports for rendering the immersive media data, and transfer the immersive media data having one of the formatting options that the client device supports. The device may be a server that sends the metadata and the media data to the client device, or the client device that receives the metadata and the media data. The client device may further configure a rendering environment according to the one of the formatting options to render the immersive media data.

    Multi-layer video coding
    176.
    发明授权

    公开(公告)号:US10244242B2

    公开(公告)日:2019-03-26

    申请号:US14749526

    申请日:2015-06-24

    Abstract: A device for processing video data includes a memory configured to store at least a portion of a multi-layer bitstream of video data; and one or more processors configured to receive the portion of the multi-layer bitstream, the multi-layer bitstream comprising a plurality of layers, the plurality of layers comprising a non-independently decodable non-base layer (non-INBL) and an independently decodable non-base layer (INBL); receive a video parameter set (VPS) associated with the coded video data, the VPS comprising first representation format parameters; receive a sequence parameter set (SPS) associated with the INBL, the SPS comprising second representation format parameters; process the non-INBL based on the first representation format parameters; and process the INBL based on the second representation format parameters.

    View dependency in multi-view coding and 3D coding

    公开(公告)号:US10205961B2

    公开(公告)日:2019-02-12

    申请号:US13867924

    申请日:2013-04-22

    Abstract: This disclosure described techniques for coding layer dependencies for a block of video data. According to these techniques, a video encoder generates layer dependencies associated with a given layer. The video encoder also generates a type of prediction associated with one or more of the layer dependencies. In some examples, the video encoder generates a first syntax element to signal layer dependencies and a second syntax element to signal a type of prediction associated with one or more of the layer dependencies. A video decoder may obtain the layer dependencies associated with a given layer and the type of prediction associated with one or more of the layer dependencies.

    Carriage of video coding standard extension bitstream data using MPEG-2 systems

    公开(公告)号:US10205954B2

    公开(公告)日:2019-02-12

    申请号:US14520562

    申请日:2014-10-22

    Abstract: In one example, a device for processing video data includes a memory for storing an enhancement layer of video data coded according to an extension of a video coding standard, and one or more processors configured to decode a hierarchy extension descriptor for an elementary stream including the enhancement layer, wherein the hierarchy extension descriptor includes data representative of two or more reference layers on which the enhancement layer depends, wherein the two or more reference layers include a first enhancement layer, conforming to a first scalability dimension, and a second enhancement layer, conforming to a second scalability dimension, and wherein the first scalability dimension is different than the second scalability dimension, and to process the video data based at least in part on the data representative of the two or more reference layers.

    High-level syntax extensions for high efficiency video coding

    公开(公告)号:US10200709B2

    公开(公告)日:2019-02-05

    申请号:US13801731

    申请日:2013-03-13

    Abstract: In one example, a device includes a video coder configured to code a picture order count (POC) value for a first picture of video data, code a second-dimension picture identifier for the first picture, and code, in accordance with a base video coding specification or an extension to the base video coding specification, a second picture based at least in part on the POC value and the second-dimension picture identifier of the first picture. The video coder may comprise a video encoder or a video decoder. The second-dimension picture identifier may comprise, for example, a view identifier, a view order index, a layer identifier, or other such identifier. The video coder may code the POC value and the second-dimension picture identifier during coding of a motion vector for a block of the second picture, e.g., during advanced motion vector prediction or merge mode coding.

    PROCESSING MEDIA DATA USING A GENERIC DESCRIPTOR FOR FILE FORMAT BOXES

    公开(公告)号:US20190014165A1

    公开(公告)日:2019-01-10

    申请号:US16030387

    申请日:2018-07-09

    Abstract: In one example, a device for retrieving media data includes a memory configured to store media data, and a processor implemented in circuitry and configured to: process a descriptor of a manifest file of media content including the media data, the descriptor corresponding to an adaptation set of the media content and including data representative of a box of file level information that describes a property of a track of a media file of the media content including media data of the adaptation set, determine whether to retrieve the media data of the adaptation set according to the data representative of the file level information, and in response to determining to retrieve the media data, send a request to retrieve the media data.

Patent Agency Ranking