METHODS AND SYSTEMS FOR APPLYING COMPLEX OBJECT DETECTION IN A VIDEO ANALYTICS SYSTEM

    公开(公告)号:US20190130580A1

    公开(公告)日:2019-05-02

    申请号:US16158079

    申请日:2018-10-11

    Abstract: Techniques and systems are provided for tracking objects in one or more video frames. For example, a first set of one or more bounding regions are determined for a video frame based on a trained classification network applied to the video frame. The first set of one or more bounding regions are associated with one or more objects in the video frame. One or more blobs can be detected for the video frame. A blob includes pixels of at least a portion of an object in the video frame. A second set of one or more bounding regions are determined for the video frame that are associated with the one or more blobs. A final set of one or more bounding regions is determined for the video frame using the first set of one or more bounding regions and the second set of one or more bounding regions. Object tracking can then be performed for the video frame using the final set of one or more bounding regions.

    OBJECT CLASSIFICATION IN A VIDEO ANALYTICS SYSTEM

    公开(公告)号:US20190130188A1

    公开(公告)日:2019-05-02

    申请号:US16147361

    申请日:2018-09-28

    Abstract: Techniques and systems are provided for classifying objects in one or more video frames. For example, a plurality of object trackers maintained for a current video frame can be obtained. A plurality of classification requests can also be obtained. The classification requests are associated with a subset of object trackers from the plurality of object trackers, and are generated based on one or more characteristics associated with the subset of object trackers. Based on the obtained plurality of classification requests, an object tracker is selected from the subset of object trackers for object classification. For example, the object tracker can be selected from the subset of object trackers based on priorities assigned to the subset of object trackers. The object classification can then be performed for the selected at least one object tracker.

    PRIORITIZING OBJECTS FOR OBJECT RECOGNITION
    13.
    发明申请

    公开(公告)号:US20190065895A1

    公开(公告)日:2019-02-28

    申请号:US16107879

    申请日:2018-08-21

    Abstract: Techniques and systems are provided for prioritizing objects for object recognition in one or more video frames. For example, a current video frame is obtained, and a objects are detected in the current video frame. State information associated with the objects is determined. Priorities for the objects can also be determined. For example, a priority can be determined for an object based on state information associated with the object. Object recognition is performed for at least one object from the objects based on priorities determined for the at least one object. For instance, object recognition can be performed for objects having higher priorities before objects having lower priorities.

    INFERENCE OF NOOUTPUTOFPRIORPICSFLAG IN VIDEO CODING
    14.
    发明申请
    INFERENCE OF NOOUTPUTOFPRIORPICSFLAG IN VIDEO CODING 有权
    NOVTPUTOFPRIORPICSFLAG在视频编码中的应用

    公开(公告)号:US20150195545A1

    公开(公告)日:2015-07-09

    申请号:US14584351

    申请日:2014-12-29

    Abstract: An apparatus for coding video information according to certain aspects includes a processor configured to determine a value of a flag associated with a current picture of a current layer to be decoded, the flag indicating whether pictures in a decoded picture buffer (DPB) should be output, wherein the current picture is an intra random access point (TRAP) picture that starts a new coded video sequence (CVS) and wherein the determination of the value of the flag is based on at least one of: (1) the chroma format of the current picture and the chroma format of the preceding picture, (2) the bit depth of the luma samples of the current picture and the bit depth of the luma samples of the preceding picture, or (3) the bit depth of the chroma samples of the current picture and the bit depth of the chroma samples of the preceding picture.

    Abstract translation: 根据某些方面的用于对视频信息进行编码的装置包括处理器,被配置为确定与要解码的当前图像的当前图像相关联的标志的值,指示是否应当输出解码图像缓冲器(DPB)中的图像的标志 ,其中当前图像是开始新的编码视频序列(CVS)的帧内随机接入点(TRAP)图像,并且其中所述标志的值的确定基于以下中的至少一个:(1) 前一图像的当前图像和色度格式,(2)当前图像的亮度样本的比特深度和前一图像的亮度样本的比特深度,或者(3)色度样本的比特深度 的当前图像和前一图像的色度采样的位深度。

    DEVICE AND METHOD FOR SCALABLE CODING OF VIDEO INFORMATION
    15.
    发明申请
    DEVICE AND METHOD FOR SCALABLE CODING OF VIDEO INFORMATION 有权
    视频信息可扩展编码的设备和方法

    公开(公告)号:US20150103922A1

    公开(公告)日:2015-04-16

    申请号:US14512962

    申请日:2014-10-13

    Abstract: An apparatus configured to code video information includes a memory unit and a processor in communication with the memory unit. The memory unit is configured to store video information associated with a first video layer having a first picture. The processor is configured to process picture order count (POC) derivation information associated with the first picture, and determine, based on the POC derivation information associated with the first picture, a POC value of at least one other picture in the first video layer that precedes the first picture in decoding order. The processor may encode or decode the video information.

    Abstract translation: 被配置为对视频信息进行编码的装置包括与存储器单元通信的存储器单元和处理器。 存储单元被配置为存储与具有第一图像的第一视频层相关联的视频信息。 处理器被配置为处理与第一图像相关联的图像顺序计数(POC)导出信息,并且基于与第一图像相关联的POC推导信息,确定第一视频层中的至少一个其他图像的POC值, 先于解码顺序的第一张图片。 处理器可以对视频信息进行编码或解码。

    OPTIMIZATIONS ON INTER-LAYER PREDICTION SIGNALLING FOR MULTI-LAYER VIDEO CODING
    16.
    发明申请
    OPTIMIZATIONS ON INTER-LAYER PREDICTION SIGNALLING FOR MULTI-LAYER VIDEO CODING 有权
    用于多层视频编码的层间预测信号优化

    公开(公告)号:US20150010051A1

    公开(公告)日:2015-01-08

    申请号:US14318230

    申请日:2014-06-27

    Abstract: A method of coding video data includes receiving one or more layers of video information. Each layer may include at least one picture. The method can include determining a number of active reference layer pictures associated with at least one picture of the one or more layers. The method can further include determining a number of direct reference layers associated with the at least one of the one or more layers. Based on the number of direct reference layers equaling the number of active reference layer pictures, the method can further include refraining from further signaling inter-layer reference picture information in any video slice associated with at least one of a video parameter set (VPS), a sequence parameter set (SPS), or a picture parameter set (PPS). Additionally or alternatively, based on the number of direct reference layers equaling the number of active reference layer pictures, the method can include adding to the inter-layer reference picture set all direct reference layer pictures for any video slice associated with at least one of a video parameter set (VPS), a sequence parameter set (SPS), or a picture parameter set (PPS).

    Abstract translation: 一种编码视频数据的方法包括接收一层或多层视频信息。 每个层可以包括至少一个图片。 该方法可以包括确定与一个或多个层的至少一个图片相关联的多个活动参考图层图片。 该方法还可以包括确定与一个或多个层中的至少一个层相关联的多个直接参照层。 基于等于有效参考层图像数量的直接参考层的数量,该方法还可以包括在与视频参数集(VPS)中的至少一个相关联的任何视频片段中避免进一步的信令层间参考图像信息, 序列参数集(SPS)或图像参数集(PPS)。 附加地或替代地,基于等于有效参考层图像的数量的直接参考层的数量,该方法可以包括向层间参考图像集合添加与以下各项中的至少一个相关联的任何视频片段的所有直接参考图像图像: 视频参数集(VPS),序列参数集(SPS)或图像参数集(PPS)。

    PARAMETER SETS IN VIDEO CODING
    17.
    发明申请
    PARAMETER SETS IN VIDEO CODING 有权
    参数设置在视频编码

    公开(公告)号:US20140022343A1

    公开(公告)日:2014-01-23

    申请号:US13945618

    申请日:2013-07-18

    Inventor: Ying CHEN

    CPC classification number: H04N13/161 H04N19/30 H04N19/463 H04N19/597 H04N19/70

    Abstract: A video parameter set (VPS) is associated with one or more coded video sequences (CVSs). The VPS includes a VPS extension for a video coding extension. The VPS extension includes a syntax element that indicates whether a video coding tool associated with the video coding extension is enabled for a set of applicable layers of a bitstream. When the syntax element indicates that the coding tool is enabled for the applicable layers, at least a portion of the video data that is associated with the CVSs and that is associated with the applicable layers is coded using the coding tool. When the syntax element indicates that the coding tool is not enabled for the applicable layers, the video data that is associated with the CVSs and that is associated with the applicable layers is not coded using the coding tool.

    Abstract translation: 视频参数集(VPS)与一个或多个编码视频序列(CVS)相关联。 VPS包括用于视频编码扩展的VPS扩展。 VPS扩展包括语法元素,其指示与视频编码扩展相关联的视频编码工具是否对于比特流的一组可应用层启用。 当语法元素指示对适用层启用编码工具时,使用编码工具编码与CVS相关联并且与适用层相关联的视频数据的至少一部分。 当语法元素指示编码工具没有为适用的层启用时,与CVS相关联且与适用层相关联的视频数据不使用编码工具进行编码。

    VIEW DEPENDENCY IN MULTI-VIEW CODING AND 3D CODING
    18.
    发明申请
    VIEW DEPENDENCY IN MULTI-VIEW CODING AND 3D CODING 审中-公开
    查看多视图编码和3D编码中的依赖关系

    公开(公告)号:US20130279576A1

    公开(公告)日:2013-10-24

    申请号:US13867924

    申请日:2013-04-22

    Abstract: This disclosure described techniques for coding layer dependencies for a block of video data. According to these techniques, a video encoder generates layer dependencies associated with a given layer. The video encoder also generates a type of prediction associated with one or more of the layer dependencies. In some examples, the video encoder generates a first syntax element to signal layer dependencies and a second syntax element to signal a type of prediction associated with one or more of the layer dependencies. A video decoder may obtain the layer dependencies associated with a given layer and the type of prediction associated with one or more of the layer dependencies.

    Abstract translation: 本公开描述了用于编码视频数据块的层依赖性的技术。 根据这些技术,视频编码器生成与给定层相关联的层依赖性。 视频编码器还生成与一个或多个层依赖关联的预测类型。 在一些示例中,视频编码器生成用于信号层依赖性的第一语法元素和第二语法元素来发送与一个或多个层依赖性相关联的预测类型。 视频解码器可以获得与给定层相关联的层依赖性以及与一个或多个层依赖性相关联的预测类型。

    MOTION VECTOR PREDICTION IN VIDEO CODING
    19.
    发明申请
    MOTION VECTOR PREDICTION IN VIDEO CODING 有权
    运动矢量预测在视频编码

    公开(公告)号:US20130272408A1

    公开(公告)日:2013-10-17

    申请号:US13853580

    申请日:2013-03-29

    Abstract: Aspects of this disclosure relate to, in an example, a method that includes identifying a first block of video data in a first temporal location from a first view, wherein the first block is associated with a first disparity motion vector. The method also includes determining a motion vector predictor for a second motion vector associated with a second block of video data, wherein the motion vector predictor is based on the first disparity motion vector. When the second motion vector comprises a disparity motion vector, the method includes determining the motion vector predictor comprises scaling the first disparity motion vector to generate a scaled motion vector predictor, wherein scaling the first disparity motion vector comprises applying a scaling factor comprising a view distance of the second disparity motion vector divided by a view distance of the first motion vector to the first disparity motion vector.

    Abstract translation: 本公开的方面在一个示例中涉及一种包括从第一视图识别第一时间位置中的第一视频数据块的方法,其中第一块与第一视差运动矢量相关联。 该方法还包括确定与第二视频数据块相关联的第二运动矢量的运动矢量预测器,其中运动矢量预测器基于第一视差运动矢量。 当第二运动矢量包括视差运动矢量时,该方法包括确定运动矢量预测器包括缩放第一视差运动矢量以产生缩放的运动矢量预测器,其中缩放第一视差运动矢量包括应用包括视距 的第二视差运动矢量除以第一运动矢量与第一视差运动矢量的视距。

    VIEW SYNTHESIS BASED ON ASYMMETRIC TEXTURE AND DEPTH RESOLUTIONS
    20.
    发明申请
    VIEW SYNTHESIS BASED ON ASYMMETRIC TEXTURE AND DEPTH RESOLUTIONS 审中-公开
    查看基于不对称纹理和深度分辨率的合成

    公开(公告)号:US20130271565A1

    公开(公告)日:2013-10-17

    申请号:US13774430

    申请日:2013-02-22

    CPC classification number: H04N13/161 H04N13/111 H04N19/597 H04N2213/003

    Abstract: An apparatus for processing video data includes a processor configured to associate, in a minimum processing unit (MPU), one pixel of a depth image of a reference picture with one or more pixels of a first chroma component of a texture image of the reference picture, associate, in the MPU, the one pixel of the depth image with one or more pixels of a second chroma component of the texture image, and associate, in the MPU, the one pixel of the depth image with a plurality of pixels of a luma component of the texture image. The number of the pixels of the luma component is different than the number of the one or more pixels of the first chroma component and the number of the one or more pixels of the second chroma component.

    Abstract translation: 一种用于处理视频数据的设备包括:处理器,被配置为在最小处理单元(MPU)中将参考图像的深度图像的一个像素与参考图像的纹理图像的第一色度分量的一个或多个像素相关联 在MPU中与纹理图像的第二色度分量的一个或多个像素相关联的深度图像的一个像素,并且在MPU中将深度图像的一个像素与多个像素 纹理图像的亮度分量。 亮度分量的像素的数量不同于第一色度分量的一个或多个像素的数量和第二色度分量的一个或多个像素的数量。

Patent Agency Ranking