Method and apparatus for storing and signaling predictively coded image items

    公开(公告)号:US12149800B2

    公开(公告)日:2024-11-19

    申请号:US17045686

    申请日:2019-04-10

    Abstract: A method, apparatus and computer program product are provided to include information within a container that also includes a video bitstream as to whether individual image items corresponding to the video frames of the video bitstream bitstreams are self-decodable or, alternatively, are dependent upon one or more other image items. In an instance in which a respective image item is dependent upon one or more other image items, the method, apparatus and computer program product also include dependence information within the container identifying the other image item(s)upon which the decodability of the respective image item is dependent. As such, the method, apparatus and computer program product permit decoding relationships to be defined in the container between a predictively coded frame and other image item(s)upon which the predictively coded frame is dependent, thereby facilitating the decoding of the frame and, in turn, the video bitstream.

    Network-based spatial computing for extended reality (XR) applications

    公开(公告)号:US11748955B2

    公开(公告)日:2023-09-05

    申请号:US17495329

    申请日:2021-10-06

    CPC classification number: G06T19/006 G06F16/487 H04L67/131

    Abstract: Feature information is received from a client device and describes extended reality scene(s) in an extended reality (ER) environment at the client device. An ER description is formed in an ER description format corresponding to the feature information and is stored. Some of the stored ER description format is provided in a representational format upon request of the client device or other client devices viewing the one or more ER scenes and assisting the positioning of corresponding client devices in the ER environment. A client device captures environmental visual data and generates feature information, describing ER scene(s) in an extended reality environment at the client device, from the environmental visual data, and sends the generated feature information toward a server. A client device can localize itself in a 3D environment of an ER description, and generate ER anchors and objects, link them, and send them to a server.

    Method and apparatus for signaling and storing grouping types in an image container file

    公开(公告)号:US11700432B2

    公开(公告)日:2023-07-11

    申请号:US17045722

    申请日:2019-04-05

    CPC classification number: H04N21/8153 G06F16/51 H04N21/4312 H04N21/4342

    Abstract: A method, apparatus and computer program product are provided to store and signal pre-derivation properties in an image container file (24, 26). Relative to the construction of image package comprising an image container file, the method, apparatus and computer program product assign a pre-derivation property identifier data structure identifying one or more pre-derivation properties of one or more pre-derived images (22). With respect to the processing of an image container file, the method, apparatus and computer program product permit an image container file and a pre-derivation property identifier data structure identifying one or more pre-derivation properties of one or more pre-derived images in the image be processed to cause one or more pre-derived image items from the image container file to be rendered or edited and regenerated in accordance with the pre-derivation properties.

    Method and apparatus for enabling multiple timeline support for omnidirectional content playback

    公开(公告)号:US11587200B2

    公开(公告)日:2023-02-21

    申请号:US17267934

    申请日:2019-09-20

    Abstract: A method, apparatus and computer program product enable multiple timeline support in playback of omnidirectional media content with overlay. The method, apparatus and computer program product receive a visual overlay configured to be rendered as a multi-layer visual content with an omnidirectional media content file (30). The omnidirectional media content file is associated with a first presentation timeline. The visual overlay is associated with a second presentation timeline. The method, apparatus and computer program product construct an overlay behavior definition file associated with the visual overlay (32). The overlay behavior definition file indicates a behavior of the second presentation timeline with respect to the first presentation in an instance that a pre-defined user interaction switch occurs during a playback of the omnidirectional media content file.

    Storage of multiple atlases from one V-PCC elementary stream in ISOBMFF

    公开(公告)号:US11412267B2

    公开(公告)日:2022-08-09

    申请号:US17140580

    申请日:2021-01-04

    Abstract: An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform: provide signal information to identify an atlas identifier on a sample of a volumetric media track, or on the volumetric media track in a multi-track container; wherein the signal information allows a file parser to link volumetric media tracks with different atlas identifiers that originate from a volumetric media elementary stream; and wherein the file parser is able to reconstruct the volumetric media elementary stream based on the signal information and data encapsulated in the multi-track container.

    Caching and Clearing Mechanism for Deep Convolutional Neural Networks

    公开(公告)号:US20220191524A1

    公开(公告)日:2022-06-16

    申请号:US17549039

    申请日:2021-12-13

    Abstract: An apparatus includes circuitry configured to: partition an input tensor into one or more block tensors; partition at least one of the block tensors into one or more continuation bands, the one or more continuation bands being associated with a caching counter having a value; store the one or more continuation bands in a cache managed using a cache manager; retrieve, prior to a convolution or pooling operation on a current block tensor, the one or more continuation bands of a previous block tensor from the cache that are adjacent to a current block tensor; concatenate the retrieved continuation bands with the current block tensor; apply the convolution or pooling operation on the current block tensor after the concatenation; decrease the respective caching counter value of the retrieved continuation bands; and clear the continuation bands from the cache when its respective caching counter reaches a value of zero.

    Coded Picture with Mixed VCL NAL Unit Type

    公开(公告)号:US20220109861A1

    公开(公告)日:2022-04-07

    申请号:US17496161

    申请日:2021-10-07

    Abstract: An apparatus includes at least one processor; and at least one memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: indicate an extraction reference map entry used to assign a group identifier to at least one extraction reference, the extraction reference map entry indicating a subpicture layout; wherein the at least one extraction reference causes extraction of a network abstraction layer unit data by reference from another track; wherein the at least one extraction reference comprises an index of a track reference having a subpicture type within a subpicture order sample group description entry; and indicate, using the at least one extraction reference, subpictures or slices of a coded picture in decoding order.

Patent Agency Ranking