Generation and provision of media metadata
    1.
    发明授权
    Generation and provision of media metadata 有权
    生成和提供媒体元数据

    公开(公告)号:US08763068B2

    公开(公告)日:2014-06-24

    申请号:US12964597

    申请日:2010-12-09

    Abstract: Various embodiments related to the generation and provision of media metadata are disclosed. For example, one disclosed embodiment provides a computing device having a logic subsystem configured to execute instructions, and a data holding subsystem comprising instructions stored thereon that are executable by the processor to receive an input of a video and/or audio content item, and to compare the content item to one or more object descriptors each representing an object for locating within the content item to locate instances of one or more of the objects in the content item. The instructions are further executable to generate metadata for each object located in the video content item, and to receive a validating user input related to whether the metadata generated for a selected object is correct.

    Abstract translation: 公开了与生成和提供媒体元数据相关的各种实施例。 例如,一个公开的实施例提供了具有被配置为执行指令的逻辑子系统的计算设备,以及包括存储在其上的指令的数据保持子系统,其可由处理器执行以接收视频和/或音频内容项目的输入,并且 将内容项目与一个或多个对象描述符进行比较,每个对象描述符表示用于在内容项目中定位的对象以定位内容项目中的一个或多个对象的实例。 指令还可执行以为位于视频内容项目中的每个对象生成元数据,并且接收与为所选对象生成的元数据是否正确相关的验证用户输入。

    Rotation and scaling optimization for mobile devices
    2.
    发明授权
    Rotation and scaling optimization for mobile devices 失效
    移动设备的旋转和缩放优化

    公开(公告)号:US07710434B2

    公开(公告)日:2010-05-04

    申请号:US11755082

    申请日:2007-05-30

    Applicant: Chuang Gu

    Inventor: Chuang Gu

    Abstract: Image processing in mobile devices is optimized by combining at least two of the color conversion, rotation, and scaling operations. Received images, such as still images or frames of video stream, are subjected to a combined transformation after decoding, where each pixel is color converted (e.g. from YUV to RGB), rotated, and scaled as needed. By combining two or three of the processes into one, read/write operations consuming significant processing and memory resources are reduced enabling processing of higher resolution images and/or power and processing resource savings.

    Abstract translation: 通过组合至少两个颜色转换,旋转和缩放操作来优化移动设备中的图像处理。 接收到的图像,例如静止图像或视频流的帧,在解码之后进行组合变换,其中每个像素被颜色转换(例如从YUV到RGB),旋转和根据需要进行缩放。 通过将两个或三个进程组合​​成一个,消耗重要处理和存储器资源的读/写操作被减少,使得能够处理更高分辨率图像和/或功率并且处理资源节省。

    Parallel multiple bitrate video encoding to reduce latency and dependences between groups of pictures
    3.
    发明授权
    Parallel multiple bitrate video encoding to reduce latency and dependences between groups of pictures 有权
    并行多位码率视频编码,以减少图像组之间的延迟和依赖关系

    公开(公告)号:US08705616B2

    公开(公告)日:2014-04-22

    申请号:US12814060

    申请日:2010-06-11

    CPC classification number: H04N19/436 H04N19/30

    Abstract: A multiple bitrate (MBR) video encoding management tool utilizes available processing units for parallel MBR video encoding. Instead of focusing only on multi-threading of encoding tasks for a single picture or group of pictures (GOP), the management tool parallelizes the encoding of multiple GOPs between different processing units and/or different computing systems. With this parallel MBR video encoding architecture, different GOPs can be encoded in parallel. To facilitate such parallel encoding, data dependencies between GOPs are removed. The management tool can adjust the number of GOPs to encode in parallel on a computing system so as to favor parallelism of encoding for different GOPs at the expense of parallelism of encoding inside a GOP, or vice versa, and thereby set a suitable balance between encoding latency and throughput.

    Abstract translation: 多位比特率(MBR)视频编码管理工具利用可用的处理单元进行并行MBR视频编码。 管理工具不是专注于单个图像或图像组(GOP)的编码任务的多线程,而是在不同的处理单元和/或不同的计算系统之间并行化多个GOP的编码。 利用这种并行MBR视频编码架构,可以并行编码不同的GOP。 为了促进这种并行编码,GOP之间的数据依赖性被去除。 管理工具可以在计算系统上调节GOP并行编码的数量,从而以牺牲GOP内编码并行性为代价的优点,对不同的GOP进行编码的平行化,反之亦然,从而在编码之间设置适当的平衡 延迟和吞吐量。

    Face Recognition in Video Content
    4.
    发明申请
    Face Recognition in Video Content 有权
    视频内容中的人脸识别

    公开(公告)号:US20120106806A1

    公开(公告)日:2012-05-03

    申请号:US12916895

    申请日:2010-11-01

    CPC classification number: G06K9/00295 G06K2009/00328

    Abstract: The subject disclosure relates to face recognition in video. Face detection data in frames of input data are used to generate face galleries, which are labeled and used in recognizing faces throughout the video. Metadata that associates the video frame and the face are generated and maintained for subsequent identification. Faces other than those found by face detection may be found by face tracking, in which facial landmarks found by the face detection are used to track a face over previous and/or subsequent video frames. Once generated, the maintained metadata may be accessed to efficiently determine the identity of a person corresponding to a viewer-selected face.

    Abstract translation: 本公开涉及视频中的面部识别。 输入数据帧中的脸部检测数据用于生成面部画廊,这些画廊被标记并用于识别整个视频中的脸部。 生成并维护与视频帧和脸部相关联的元数据,以便后续识别。 面部检测以外的脸部可以通过脸部跟踪来发现,其中通过面部检测发现的面部地标用于跟踪先前和/或后续视频帧的面部。 一旦生成,可以访问维护的元数据以有效地确定与观众选择的面对应的人的身份。

    EFFICIENT EXTRACTION AND COMPRESSION OF DATA
    5.
    发明申请
    EFFICIENT EXTRACTION AND COMPRESSION OF DATA 审中-公开
    有效提取和压缩数据

    公开(公告)号:US20110202509A1

    公开(公告)日:2011-08-18

    申请号:US12706582

    申请日:2010-02-16

    CPC classification number: H03M7/30

    Abstract: A device for dynamically extracting and compressing information for a streaming media asset is provided. One embodiment of the device provides a computing device comprising a processor and memory comprising instructions stored therein that are executable by the processor. The instructions stored in the memory are executable to provide to a requesting computing device dynamically compressed information for a streaming media asset, the dynamically compressed information derived from an information file comprising variable data elements arranged in one or more data fields according to a well-known structure. For example, the instructions are executable to receive from the requesting computing device a request for the compressed information, extract the variable data elements from the information file, compress the variable data elements to form compressed data elements, and send to the requesting computing device a compressed file comprising the compressed data elements.

    Abstract translation: 提供了用于动态提取和压缩用于流媒体资产的信息的设备。 该设备的一个实施例提供了一种计算设备,其包括处理器和存储器,其包含可由处理器执行的存储在其中的指令。 存储在存储器中的指令是可执行的,以向请求计算设备提供用于流媒体资产的动态压缩信息,该动态压缩信息从包括根据众所周知的一个或多个数据字段中布置的可变数据元素的信息文件导出 结构体。 例如,指令可执行以从请求的计算设备接收对压缩信息的请求,从信息文件中提取可变数据元素,压缩可变数据元素以形成压缩数据元素,并且向请求计算设备发送 包括压缩数据元素的压缩文件。

    Region extraction in vector images
    6.
    发明授权
    Region extraction in vector images 有权
    矢量图像中的区域提取

    公开(公告)号:US07088845B2

    公开(公告)日:2006-08-08

    申请号:US10767135

    申请日:2004-01-28

    Abstract: A semantic object tracking method tracks general semantic objects with multiple non-rigid motion, disconnected components and multiple colors throughout a vector image sequence. The method accurately tracks these general semantic objects by spatially segmenting image regions from a current frame and then classifying these regions as to which semantic object they originated from in the previous frame. To classify each region, the method perform a region based motion estimation between each spatially segmented region and the previous frame to computed the position of a predicted region in the previous frame. The method then classifies each region in the current frame as being part of a semantic object based on which semantic object in the previous frame contains the most overlapping points of the predicted region. Using this method, each region in the current image is tracked to one semantic object from the previous frame, with no gaps or overlaps. The method propagates few or no errors because it projects regions into a frame where the semantic object boundaries are previously computed rather than trying to project and adjust a boundary in a frame where the object's boundary is unknown.

    Abstract translation: 语义对象跟踪方法在整个矢量图像序列中跟踪具有多个非刚性运动,断开组件和多种颜色的一般语义对象。 该方法通过对来自当前帧的图像区域进行空间分割,然后对这些区域进行分类,从而准确地跟踪这些一般语义对象,以便它们在前一帧中源自哪个语义对象。 为了对每个区域进行分类,该方法在每个空间分段区域和前一帧之间执行基于区域的运动估计,以计算先前帧中的预测区域的位置。 该方法然后根据前一帧中的哪个语义对象包含预测区域的最重叠点将当前帧中的每个区域分类为语义对象的一部分。 使用该方法,当前图像中的每个区域被跟踪到来自前一帧的一个语义对象,没有间隙或重叠。 该方法传播很少或没有错误,因为它将区域投影到预先计算语义对象边界的框架中,而不是尝试在对象边界未知的框架中投影和调整边界。

    Face recognition in video content
    7.
    发明授权
    Face recognition in video content 有权
    视频内容中的人脸识别

    公开(公告)号:US08494231B2

    公开(公告)日:2013-07-23

    申请号:US12916895

    申请日:2010-11-01

    CPC classification number: G06K9/00295 G06K2009/00328

    Abstract: The subject disclosure relates to face recognition in video. Face detection data in frames of input data are used to generate face galleries, which are labeled and used in recognizing faces throughout the video. Metadata that associates the video frame and the face are generated and maintained for subsequent identification. Faces other than those found by face detection may be found by face tracking, in which facial landmarks found by the face detection are used to track a face over previous and/or subsequent video frames. Once generated, the maintained metadata may be accessed to efficiently determine the identity of a person corresponding to a viewer-selected face.

    Abstract translation: 本公开涉及视频中的面部识别。 输入数据帧中的脸部检测数据用于生成面部画廊,这些画廊被标记并用于识别整个视频中的脸部。 生成并维护与视频帧和脸部相关联的元数据,以便后续识别。 面部检测以外的脸部可以通过脸部跟踪来发现,其中通过面部检测发现的面部地标用于跟踪先前和/或后续视频帧的面部。 一旦生成,可以访问维护的元数据以有效地确定与观众选择的面对应的人的身份。

    STAGED ELEMENT CLASSIFICATION
    8.
    发明申请
    STAGED ELEMENT CLASSIFICATION 有权
    标准元素分类

    公开(公告)号:US20120281886A1

    公开(公告)日:2012-11-08

    申请号:US13102740

    申请日:2011-05-06

    Inventor: Yaming He Chuang Gu

    CPC classification number: G06K9/00288 G06K9/00677 G06K9/6807

    Abstract: Various examples are disclosed herein that relate to staged element classification. For example, one disclosed example provides a method of classifying elements by forming elements for classification into a plurality of first-level sets in a first stage, generating primary groups within the first-level sets based on element similarity, forming a plurality of second-level sets from the first-level sets in a second stage, generating secondary groups within the second-level sets based on element similarity, and merging a plurality of the primary and/or secondary groups based on element similarity.

    Abstract translation: 本文公开了与分段元素分类相关的各种示例。 例如,一个公开的示例提供了一种通过在第一阶段中形成用于分类为多个第一级集合的元素来对元素进行分类的方法,基于元素相似性在第一级集合内生成主组,形成多个第二级组, 基于元素相似度在第二级集合内生成二级组,并且基于元素相似度合并多个主组和/或辅助组。

    GENERATION AND PROVISION OF MEDIA METADATA
    9.
    发明申请
    GENERATION AND PROVISION OF MEDIA METADATA 有权
    媒体元数据的生成和提供

    公开(公告)号:US20120147265A1

    公开(公告)日:2012-06-14

    申请号:US12964597

    申请日:2010-12-09

    Abstract: Various embodiments related to the generation and provision of media metadata are disclosed. For example, one disclosed embodiment provides a computing device having a logic subsystem configured to execute instructions, and a data holding subsystem comprising instructions stored thereon that are executable by the processor to receive an input of a video and/or audio content item, and to compare the content item to one or more object descriptors each representing an object for locating within the content item to locate instances of one or more of the objects in the content item. The instructions are further executable to generate metadata for each object located in the video content item, and to receive a validating user input related to whether the metadata generated for a selected object is correct.

    Abstract translation: 公开了与生成和提供媒体元数据相关的各种实施例。 例如,一个公开的实施例提供了具有被配置为执行指令的逻辑子系统的计算设备,以及包括存储在其上的指令的数据保持子系统,其可由处理器执行以接收视频和/或音频内容项目的输入,并且 将内容项目与一个或多个对象描述符进行比较,每个对象描述符表示用于在内容项目中定位的对象以定位内容项目中的一个或多个对象的实例。 指令还可执行以为位于视频内容项目中的每个对象生成元数据,并且接收与为所选对象生成的元数据是否正确相关的验证用户输入。

    VIDEO ENCODING USING PREVIOUSLY CALCULATED MOTION INFORMATION
    10.
    发明申请
    VIDEO ENCODING USING PREVIOUSLY CALCULATED MOTION INFORMATION 有权
    使用先前计算的运动信息进行视频编码

    公开(公告)号:US20100189179A1

    公开(公告)日:2010-07-29

    申请号:US12362427

    申请日:2009-01-29

    Abstract: A video encoder uses previously calculated motion information for inter frame coding to achieve faster computation speed for video compression. In a multi bit rate application, motion information produced by motion estimation for inter frame coding of a compressed video bit stream at one bit rate is passed on to a subsequent encoding of the video at a lower bit rate. The video encoder chooses to use the previously calculated motion information for inter frame coding at the lower bit rate if the video resolution is unchanged. A multi core motion information pre-calculation produces motion information prior to encoding by dividing motion estimation of each inter frame to separate CPU cores.

    Abstract translation: 视频编码器使用先前计算的帧间编码运动信息来实现视频压缩的更快的计算速度。 在多比特率应用中,通过用于以一个比特率的压缩视频比特流的帧间编码的运动估计产生的运动信息被传递到较低比特率的视频的后续编码。 如果视频分辨率不变,则视频编码器选择使用先前计算的运动信息用于较低比特率的帧间编码。 多核心运动信息预计算在编码之前通过将每个帧间的运动估计划分成分离的CPU核心来产生运动信息。

Patent Agency Ranking