Apparatus, a method and a computer program for video encoding and decoding

    公开(公告)号:US12088847B2

    公开(公告)日:2024-09-10

    申请号:US17616541

    申请日:2020-05-20

    发明人: Miska Hannuksela

    摘要: There is disclosed a method, an apparatus and a computer program product for video encoding and decoding. In accordance with an embodiment the method for encoding comprises concluding that a coded video sequence starts at particular position in a bitstream, wherein the coded video sequence is a sequence of coded pictures in decoding order that is independently decodable and is followed by another coded video sequence or the end of the bitstream, and wherein the bitstream comprises access units, and an access unit comprises coded video data for a single time instance and associated other data, and an access unit comprises one or more network abstraction layer (NAL) units; and indicating in an access unit delimiter to treat the NAL unit(s) associated with the access unit delimiter as a start of a coded video sequence.

    Method, an apparatus and a computer program product for video encoding and video decoding

    公开(公告)号:US11722751B2

    公开(公告)日:2023-08-08

    申请号:US17789883

    申请日:2020-12-30

    IPC分类号: H04N21/84 H04N21/854

    CPC分类号: H04N21/84 H04N21/85406

    摘要: The embodiments relate to a method, including writing a first and a second media entity in a container file; creating a media presentation description (MPD) with a first and a second Representation; the Representations belonging to Adaptation Sets; the Representations being associated with the media entities of the container file; when one of the Representations belongs to a media entity which is a thumbnail to a viewpoint or a thumbnail to an overlay, the method includes writing in the MPD file an association/correspondence/grouping information of the said one Representation with another Representation belonging to a media entity which is a viewpoint or an overlay, correspondingly. The embodiments also relate to a method for parsing, and technical equipment for implementing the methods.

    Sharing of motion vector in 3D video coding

    公开(公告)号:US10715779B2

    公开(公告)日:2020-07-14

    申请号:US16420729

    申请日:2019-05-23

    摘要: Joint coding of depth map video and texture video is provided, where a motion vector for a texture video is predicted from a respective motion vector of a depth map video or vice versa. For scalable video coding, depth map video is coded as a base layer and texture video is coded as an enhancement layer(s). Inter-layer motion prediction predicts motion in texture video from motion in depth map video. With more than one view in a bitstream (for multiview coding), depth map videos are considered monochromatic camera views and are predicted from each other. If joint multiview video model coding tools are allowed, inter-view motion skip is used to predict motion vectors of texture images from depth map images. Furthermore, scalable multiview coding is utilized, where inter-view prediction is applied between views in the same dependency layer, and inter-layer (motion) prediction is applied between layers in the same view.

    An Apparatus, a Method and a Computer Program for Video Coding and Decoding

    公开(公告)号:US20190082184A1

    公开(公告)日:2019-03-14

    申请号:US16084352

    申请日:2017-03-17

    发明人: Miska Hannuksela

    摘要: There is provided a method comprising encoding an uncompressed constituent frame into a first encoded picture, said encoding also resulting into a reconstructed first picture and said constituent frame having an effective picture area within the first reconstructed picture, performing either of the following as a part of said encoding: inserting at least one sample value outside the effective picture area to form a boundary extension for the constituent frame in the reconstructed first picture; or saturating or wrapping oversample locations outside the effective picture area to be within the effective picture area. There is also provided a method comprising receiving an encoded picture, decoding the encoded picture to form a reconstructed constituent frame of the picture having an effective picture area; and performing either of the following: filling an area outside the effective picture area to produce a padded reference picture, wherein the filled area forms a boundary extension; or determining that when referring to sample locations outside the effective picture area in decoding, said sample locations are saturated or wrapped over to be within the effective picture area.