Signaling of spatial resolution of depth views in multiview coding file format

    公开(公告)号:US10791315B2

    公开(公告)日:2020-09-29

    申请号:US14137358

    申请日:2013-12-20

    Abstract: Techniques for encapsulating video streams containing multiple coded views in a media file are described herein. In one example, a method includes parsing a track of multiview video data, wherein the track includes at least one depth view. The method further includes parsing information to determine a spatial resolution associated with the depth view, wherein decoding the spatial resolution does not require parsing of a sequence parameter set of the depth view. Another example method includes composing a track of multiview video data, wherein the track includes the one or more views. The example method further includes composing information to indicate a spatial resolution associated with the depth view, wherein decoding the spatial resolution does not require parsing of a sequence parameter set of the depth view.

    Overlapped motion compensation for video coding

    公开(公告)号:US10771811B2

    公开(公告)日:2020-09-08

    申请号:US16253049

    申请日:2019-01-21

    Abstract: In an example, a method of decoding video data may include receiving a first block of video data. The first block of video data may be a sub-block of a prediction unit. The method may include receiving one or more blocks of video data that neighbor the first block of video data. The method may include determining motion information of at least one of the one or more blocks of video data that neighbor the first block of video data. The method may include decoding, using overlapped block motion compensation, the first block of video data based at least in part on the motion information of the at least one of the one or more blocks that neighbor the first block of video data.

    Edge computing
    343.
    发明授权

    公开(公告)号:US10726302B2

    公开(公告)日:2020-07-28

    申请号:US16204242

    申请日:2018-11-29

    Abstract: Methods, systems, and devices for object localization and classification are described. A device may configure a first unit of a detection layer associated with a learning framework when a quantity of output feature channels of an input feature maps is less than or equal to a quantity of input feature channels of the input feature maps. The first set of layers may include a group convolution layer, a pointwise layer, a batch normalization layer, or a rectified linear layer, or a combination thereof. The device may also configure, a second unit of the detection layer associated with the learning framework, when a second quantity of output feature channels of the input feature maps is less than or equal to a second quantity of input feature channels of the input feature maps. The second set of layers may include a depthwise layer or a pointwise layer, or both.

    Constrained depth intra mode coding for 3D video coding

    公开(公告)号:US10687079B2

    公开(公告)日:2020-06-16

    申请号:US15125549

    申请日:2014-03-13

    Abstract: Techniques include constraining depth intra mode coding in a three-dimensional (3D) video coding process, such as 3D-High Efficiency Video Coding (3D-HEVC). In some examples, the techniques for constraining depth intra mode coding may prevent transform tree nodes from being split into sub-transform tree nodes when a depth prediction unit that corresponds to the transform tree node is predicted according to a depth modeling mode (DMM). In further examples, the techniques for constraining depth intra mode coding may prevent the DMM mode from being used when the maximum transform unit size that corresponds to a depth prediction unit is greater than the size of the depth prediction unit. The techniques for constraining depth intra mode coding may prevent characteristics of the DMM prediction modes used in 3D-HEVC and characteristics of the transform tree subdivision used in 3D-HEVC from interfering with each other.

    EDGE COMPUTING
    345.
    发明申请
    EDGE COMPUTING 审中-公开

    公开(公告)号:US20200175334A1

    公开(公告)日:2020-06-04

    申请号:US16204242

    申请日:2018-11-29

    Abstract: Methods, systems, and devices for object localization and classification are described. A device may configure a first unit of a detection layer associated with a learning framework when a quantity of output feature channels of an input feature maps is less than or equal to a quantity of input feature channels of the input feature maps. The first set of layers may include a group convolution layer, a pointwise layer, a batch normalization layer, or a rectified linear layer, or a combination thereof. The device may also configure, a second unit of the detection layer associated with the learning framework, when a second quantity of output feature channels of the input feature maps is less than or equal to a second quantity of input feature channels of the input feature maps. The second set of layers may include a depthwise layer or a pointwise layer, or both.

    USING OBJECT RE-IDENTIFICATION IN VIDEO SURVEILLANCE

    公开(公告)号:US20180374233A1

    公开(公告)日:2018-12-27

    申请号:US15635059

    申请日:2017-06-27

    Abstract: In various implementations, object tracking in a video content analysis system can be augmented with an image-based object re-identification system (e.g., for person re-identification or re-identification of other objects) to improve object tracking results for objects moving in a scene. The object re-identification system can use image recognition principles, which can be enhanced by considering data provided by object trackers that can be output by an object traffic system. In a testing stage, the object re-identification system can selectively test object trackers against object models. For most input video frames, not all object trackers need be tested against all object models. Additionally, different types of object trackers can be tested differently, so that a context provided by each object tracker can be considered. In a training stage, object models can also be selectively updated.

    View synthesis in 3D video
    350.
    发明授权

    公开(公告)号:US10136119B2

    公开(公告)日:2018-11-20

    申请号:US14151586

    申请日:2014-01-09

    Abstract: In an example, a method of decoding video data includes determining whether a reference index for a current block corresponds to an inter-view reference picture, and when the reference index for the current block corresponds to the inter-view reference picture, obtaining, from an encoded bitstream, data indicating a view synthesis prediction (VSP) mode of the current block, where the VSP mode for the reference index indicates whether the current block is predicted with view synthesis prediction from the inter-view reference picture.

Patent Agency Ranking