Tracking semantic objects in vector image sequences
    13.
    发明申请
    Tracking semantic objects in vector image sequences 有权
    跟踪矢量图像序列中的语义对象

    公开(公告)号:US20050240629A1

    公开(公告)日:2005-10-27

    申请号:US11171448

    申请日:2005-06-29

    Abstract: A semantic object tracking method tracks general semantic objects with multiple non-rigid motion, disconnected components and multiple colors throughout a vector image sequence. The method accurately tracks these general semantic objects by spatially segmenting image regions from a current frame and then classifying these regions as to which semantic object they originated from in the previous frame. To classify each region, the method perform a region based motion estimation between each spatially segmented region and the previous frame to computed the position of a predicted region in the previous frame. The method then classifies each region in the current frame as being part of a semantic object based on which semantic object in the previous frame contains the most overlapping points of the predicted region. Using this method, each region in the current image is tracked to one semantic object from the previous frame, with no gaps or overlaps. The method propagates few or no errors because it projects regions into a frame where the semantic object boundaries are previously computed rather than trying to project and adjust a boundary in a frame where the object's boundary is unknown.

    Abstract translation: 语义对象跟踪方法在整个矢量图像序列中跟踪具有多个非刚性运动,断开组件和多种颜色的一般语义对象。 该方法通过对来自当前帧的图像区域进行空间分割,然后对这些区域进行分类,从而准确地跟踪这些一般语义对象,以便它们在前一帧中源自哪个语义对象。 为了对每个区域进行分类,该方法在每个空间分段区域和前一帧之间执行基于区域的运动估计,以计算先前帧中的预测区域的位置。 该方法然后根据前一帧中的哪个语义对象包含预测区域的最重叠点将当前帧中的每个区域分类为语义对象的一部分。 使用该方法,当前图像中的每个区域被跟踪到来自前一帧的一个语义对象,没有间隙或重叠。 该方法传播很少或没有错误,因为它将区域投影到预先计算语义对象边界的框架中,而不是尝试在对象边界未知的框架中投影和调整边界。

    Segmentation of image features using hierarchical analysis of
multi-valued image data and weighted averaging of segmentation results
    14.
    发明授权
    Segmentation of image features using hierarchical analysis of multi-valued image data and weighted averaging of segmentation results 失效
    使用多值图像数据的分层分析和分割结果的加权平均来分割图像特征

    公开(公告)号:US6075875A

    公开(公告)日:2000-06-13

    申请号:US722981

    申请日:1996-09-30

    Applicant: Chuang Gu

    Inventor: Chuang Gu

    Abstract: Homogeneous moving objects of arbitrary shapes are segmented and tracked with respect to the motion of the objects. In an intraframe mode of operation, a segmentation method includes obtaining a motion representation of corresponding pixels in the selected video image frame and a preceding video image frame to form motion-segmented video image features. Video image features are also segmented according to their spatial image characteristics (e.g., color) to form spatially-segmented video image features. Finally, the video image features are jointly segmented as a weighted combination of the motion-segmented video image features and the spatially-segmented video image features. The joint motion and spatial segmentation of image features provides enhanced accuracy in representing moving image features. This enhanced accuracy is particularly beneficial because the motion of image features is a significant display characteristic for human observers.

    Abstract translation: 任意形状的均匀移动物体相对于物体的运动被分段和跟踪。 在帧内操作模式中,分割方法包括获得所选择的视频图像帧中的相应像素的运动表示和先前的视频图像帧,以形成运动分割的视频图像特征。 视频图像特征也根据其空间图像特征(例如,颜色)被分割以形成空间分割的视频图像特征。 最后,视频图像特征被联合分割为运动分割的视频图像特征和空间分割的视频图像特征的加权组合。 图像特征的联合运动和空间分割提供了表示运动图像特征的增强的准确度。 这种增强的精度是特别有益的,因为图像特征的运动是人类观察者的显着特征。

    Parallel multiple bitrate video encoding to reduce latency and dependences between groups of pictures
    15.
    发明授权
    Parallel multiple bitrate video encoding to reduce latency and dependences between groups of pictures 有权
    并行多位码率视频编码,以减少图像组之间的延迟和依赖关系

    公开(公告)号:US08705616B2

    公开(公告)日:2014-04-22

    申请号:US12814060

    申请日:2010-06-11

    CPC classification number: H04N19/436 H04N19/30

    Abstract: A multiple bitrate (MBR) video encoding management tool utilizes available processing units for parallel MBR video encoding. Instead of focusing only on multi-threading of encoding tasks for a single picture or group of pictures (GOP), the management tool parallelizes the encoding of multiple GOPs between different processing units and/or different computing systems. With this parallel MBR video encoding architecture, different GOPs can be encoded in parallel. To facilitate such parallel encoding, data dependencies between GOPs are removed. The management tool can adjust the number of GOPs to encode in parallel on a computing system so as to favor parallelism of encoding for different GOPs at the expense of parallelism of encoding inside a GOP, or vice versa, and thereby set a suitable balance between encoding latency and throughput.

    Abstract translation: 多位比特率(MBR)视频编码管理工具利用可用的处理单元进行并行MBR视频编码。 管理工具不是专注于单个图像或图像组(GOP)的编码任务的多线程,而是在不同的处理单元和/或不同的计算系统之间并行化多个GOP的编码。 利用这种并行MBR视频编码架构,可以并行编码不同的GOP。 为了促进这种并行编码,GOP之间的数据依赖性被去除。 管理工具可以在计算系统上调节GOP并行编码的数量,从而以牺牲GOP内编码并行性为代价的优点,对不同的GOP进行编码的平行化,反之亦然,从而在编码之间设置适当的平衡 延迟和吞吐量。

    Face Recognition in Video Content
    16.
    发明申请
    Face Recognition in Video Content 有权
    视频内容中的人脸识别

    公开(公告)号:US20120106806A1

    公开(公告)日:2012-05-03

    申请号:US12916895

    申请日:2010-11-01

    CPC classification number: G06K9/00295 G06K2009/00328

    Abstract: The subject disclosure relates to face recognition in video. Face detection data in frames of input data are used to generate face galleries, which are labeled and used in recognizing faces throughout the video. Metadata that associates the video frame and the face are generated and maintained for subsequent identification. Faces other than those found by face detection may be found by face tracking, in which facial landmarks found by the face detection are used to track a face over previous and/or subsequent video frames. Once generated, the maintained metadata may be accessed to efficiently determine the identity of a person corresponding to a viewer-selected face.

    Abstract translation: 本公开涉及视频中的面部识别。 输入数据帧中的脸部检测数据用于生成面部画廊,这些画廊被标记并用于识别整个视频中的脸部。 生成并维护与视频帧和脸部相关联的元数据,以便后续识别。 面部检测以外的脸部可以通过脸部跟踪来发现,其中通过面部检测发现的面部地标用于跟踪先前和/或后续视频帧的面部。 一旦生成,可以访问维护的元数据以有效地确定与观众选择的面对应的人的身份。

    EFFICIENT EXTRACTION AND COMPRESSION OF DATA
    17.
    发明申请
    EFFICIENT EXTRACTION AND COMPRESSION OF DATA 审中-公开
    有效提取和压缩数据

    公开(公告)号:US20110202509A1

    公开(公告)日:2011-08-18

    申请号:US12706582

    申请日:2010-02-16

    CPC classification number: H03M7/30

    Abstract: A device for dynamically extracting and compressing information for a streaming media asset is provided. One embodiment of the device provides a computing device comprising a processor and memory comprising instructions stored therein that are executable by the processor. The instructions stored in the memory are executable to provide to a requesting computing device dynamically compressed information for a streaming media asset, the dynamically compressed information derived from an information file comprising variable data elements arranged in one or more data fields according to a well-known structure. For example, the instructions are executable to receive from the requesting computing device a request for the compressed information, extract the variable data elements from the information file, compress the variable data elements to form compressed data elements, and send to the requesting computing device a compressed file comprising the compressed data elements.

    Abstract translation: 提供了用于动态提取和压缩用于流媒体资产的信息的设备。 该设备的一个实施例提供了一种计算设备,其包括处理器和存储器,其包含可由处理器执行的存储在其中的指令。 存储在存储器中的指令是可执行的,以向请求计算设备提供用于流媒体资产的动态压缩信息,该动态压缩信息从包括根据众所周知的一个或多个数据字段中布置的可变数据元素的信息文件导出 结构体。 例如,指令可执行以从请求的计算设备接收对压缩信息的请求,从信息文件中提取可变数据元素,压缩可变数据元素以形成压缩数据元素,并且向请求计算设备发送 包括压缩数据元素的压缩文件。

    Region extraction in vector images
    18.
    发明授权
    Region extraction in vector images 有权
    矢量图像中的区域提取

    公开(公告)号:US07088845B2

    公开(公告)日:2006-08-08

    申请号:US10767135

    申请日:2004-01-28

    Abstract: A semantic object tracking method tracks general semantic objects with multiple non-rigid motion, disconnected components and multiple colors throughout a vector image sequence. The method accurately tracks these general semantic objects by spatially segmenting image regions from a current frame and then classifying these regions as to which semantic object they originated from in the previous frame. To classify each region, the method perform a region based motion estimation between each spatially segmented region and the previous frame to computed the position of a predicted region in the previous frame. The method then classifies each region in the current frame as being part of a semantic object based on which semantic object in the previous frame contains the most overlapping points of the predicted region. Using this method, each region in the current image is tracked to one semantic object from the previous frame, with no gaps or overlaps. The method propagates few or no errors because it projects regions into a frame where the semantic object boundaries are previously computed rather than trying to project and adjust a boundary in a frame where the object's boundary is unknown.

    Abstract translation: 语义对象跟踪方法在整个矢量图像序列中跟踪具有多个非刚性运动,断开组件和多种颜色的一般语义对象。 该方法通过对来自当前帧的图像区域进行空间分割,然后对这些区域进行分类,从而准确地跟踪这些一般语义对象,以便它们在前一帧中源自哪个语义对象。 为了对每个区域进行分类,该方法在每个空间分段区域和前一帧之间执行基于区域的运动估计,以计算先前帧中的预测区域的位置。 该方法然后根据前一帧中的哪个语义对象包含预测区域的最重叠点将当前帧中的每个区域分类为语义对象的一部分。 使用该方法,当前图像中的每个区域被跟踪到来自前一帧的一个语义对象,没有间隙或重叠。 该方法传播很少或没有错误,因为它将区域投影到预先计算语义对象边界的框架中,而不是尝试在对象边界未知的框架中投影和调整边界。

    Video encoding using previously calculated motion information
    20.
    发明授权
    Video encoding using previously calculated motion information 有权
    使用先前计算的运动信息的视频编码

    公开(公告)号:US08311115B2

    公开(公告)日:2012-11-13

    申请号:US12362427

    申请日:2009-01-29

    Abstract: A video encoder uses previously calculated motion information for inter frame coding to achieve faster computation speed for video compression. In a multi bit rate application, motion information produced by motion estimation for inter frame coding of a compressed video bit stream at one bit rate is passed on to a subsequent encoding of the video at a lower bit rate. The video encoder chooses to use the previously calculated motion information for inter frame coding at the lower bit rate if the video resolution is unchanged. A multi core motion information pre-calculation produces motion information prior to encoding by dividing motion estimation of each inter frame to separate CPU cores.

    Abstract translation: 视频编码器使用先前计算的帧间编码运动信息来实现视频压缩的更快的计算速度。 在多比特率应用中,通过用于以一个比特率的压缩视频比特流的帧间编码的运动估计产生的运动信息被传递到较低比特率的视频的后续编码。 如果视频分辨率不变,则视频编码器选择使用先前计算的运动信息用于较低比特率的帧间编码。 多核心运动信息预计算在编码之前通过将每个帧间的运动估计划分成分离的CPU核心来产生运动信息。

Patent Agency Ranking