VIDEO SUMMARIZATION USING AUDIO AND VISUAL CUES
    1.
    发明申请
    VIDEO SUMMARIZATION USING AUDIO AND VISUAL CUES 审中-公开
    使用音频和视觉视频的视频总结

    公开(公告)号:US20120281969A1

    公开(公告)日:2012-11-08

    申请号:US13099391

    申请日:2011-05-03

    IPC分类号: G11B27/00

    CPC分类号: G11B27/034 G11B27/11

    摘要: A method for producing an audio-visual slideshow for a video sequence having an audio soundtrack and a corresponding video track including a time sequence of image frames, comprising: segmenting the audio soundtrack into a plurality of audio segments; subdividing the audio segments into a sequence of audio frames; determining a corresponding audio classification for each audio frame; automatically selecting a subset of the audio segments responsive to the audio classification for the corresponding audio frames; for each of the selected audio segments automatically analyzing the corresponding image frames to select one or more key image frames; merging the selected audio segments to form an audio summary; forming an audio-visual slideshow by combining the selected key frames with the audio summary, wherein the selected key frames are displayed synchronously with their corresponding audio segment; and storing the audio-visual slideshow in a processor-accessible storage memory.

    摘要翻译: 一种用于产生具有音频声轨和包括图像帧的时间序列的对应视频轨迹的视频序列的视听幻灯片放映方法,包括:将所述音频音轨分割为多个音频段; 将音频段细分成音频帧序列; 确定每个音频帧的相应音频分类; 响应于相应音频帧的音频分类自动选择音频段的子集; 对于每个所选择的音频片段,自动分析对应的图像帧以选择一个或多个关键图像帧; 合并所选音频片段以形成音频摘要; 通过将所选择的关键帧与音频摘要组合来形成视听幻灯片,其中所选择的关键帧与其对应的音频片段同步显示; 以及将视听幻灯片放映在处理器可访问的存储存储器中。

    ADAPTIVE MULTIMEDIA SEMANTIC CONCEPT CLASSIFIER
    2.
    发明申请
    ADAPTIVE MULTIMEDIA SEMANTIC CONCEPT CLASSIFIER 有权
    自适应多媒体语义概念分类器

    公开(公告)号:US20120109964A1

    公开(公告)日:2012-05-03

    申请号:US12912820

    申请日:2010-10-27

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30038

    摘要: A method of classifying a set of semantic concepts on a second multimedia collection based upon adapting a set of semantic concept classifiers and updating concept affinity relations that were developed to classify the set of semantic concepts for a first multimedia collection. The method comprises providing the second multimedia collection from a different domain and a processor automatically classifying the semantic concepts from the second multimedia collection by adapting the semantic concept classifiers and updating the concept affinity relations to the second multimedia collection based upon the local smoothness over the concept affinity relations and the local smoothness over data affinity relations.

    摘要翻译: 一种基于适应一组语义概念分类器并且更新用于对第一多媒体集合的语义概念集合进行分类而开发的概念亲和度关系来对第二多媒体集合上的一组语义概念进行分类的方法。 该方法包括从不同的域提供第二多媒体集合,并且处理器通过基于概念上的局部平​​滑度来适应语义概念分类器并且将概念兴趣关系更新到第二多媒体集合来自动地对来自第二多媒体集合的语义概念进行分类 亲和关系和数据关联关系的局部平滑度。

    Video concept classification using video similarity scores
    3.
    发明授权
    Video concept classification using video similarity scores 失效
    视频概念分类使用视频相似度分数

    公开(公告)号:US08699852B2

    公开(公告)日:2014-04-15

    申请号:US13269753

    申请日:2011-10-10

    IPC分类号: H04N5/92

    摘要: A method for determining a semantic concept classification for a digital video clip, comprising: receiving an audio-visual dictionary including a plurality of audio-visual grouplets, the audio-visual grouplets including visual background and foreground codewords, audio background and foreground codewords, wherein the codewords in a particular audio-visual grouplet were determined to be correlated with each other; determining reference video codeword similarity scores for a set of reference video clips; determining codeword similarity scores for the digital video clip; determining a reference video similarity score for each reference video clip representing a similarity between the digital video clip and the reference video clip responsive to the audio-visual grouplets, the codeword similarity scores and the reference video codeword similarity scores; and determining one or more semantic concept classifications using trained semantic classifiers responsive to the determined reference video similarity scores.

    摘要翻译: 一种用于确定数字视频剪辑的语义概念分类的方法,包括:接收包括多个视听小组的视听词典,所述视听小组包括视觉背景和前景码字,音频背景和前景码字,其中 确定特定视听小组中的码字彼此相关; 确定一组参考视频剪辑的参考视频码字相似性得分; 确定所述数字视频剪辑的码字相似性得分; 响应于所述视听小组,所述码字相似度得分和所述参考视频码字相似性得分,确定表示所述数字视频剪辑和所述参考视频剪辑之间的相似性的每个参考视频剪辑的参考视频相似性分数; 以及响应于所确定的参考视频相似性分数,使用经过训练的语义分类器来确定一个或多个语义概念分类。

    VIDEO CONCEPT CLASSIFICATION USING TEMPORALLY-CORRELATED GROUPLETS
    4.
    发明申请
    VIDEO CONCEPT CLASSIFICATION USING TEMPORALLY-CORRELATED GROUPLETS 审中-公开
    视频概念分类使用时间相关的组合

    公开(公告)号:US20130251340A1

    公开(公告)日:2013-09-26

    申请号:US13425455

    申请日:2012-03-21

    IPC分类号: H04N5/91

    摘要: A method for determining a semantic concept classification for a digital video clip based on a grouplet dictionary that includes a plurality of temporally-correlated grouplets. The temporally-correlated grouplets include textual codewords and either visual codewords or audio codewords, wherein the codewords in a particular temporally-correlated grouplet were determined to be correlated with each other based on analysis of a set of training videos. Reference video codeword similarity scores are determined for a set of reference video clips, and codeword similarity scores are determined for the digital video clip. A reference video similarity score is determined for each reference video clip representing a similarity between the digital video clip and the reference video clip based on the reference video codeword similarity scores, the codeword similarity scores, and the temporally-correlated grouplets. One or more semantic concept classifications are determined using trained semantic classifiers responsive to the determined reference video similarity scores.

    摘要翻译: 一种用于基于包括多个时间相关小区的小组字典来确定数字视频剪辑的语义概念分类的方法。 时间相关小区包括文本码字和视觉码字或音频码字,其中基于一组训练视频的分析,将特定时间相关小区小区中的码字确定为彼此相关。 对于一组参考视频剪辑确定参考视频码字相似性分数,并且为数字视频剪辑确定码字相似性分数。 基于参考视频码字相似度分数,码字相似度分数和时间相关小区,为表示数字视频剪辑和参考视频剪辑之间的相似性的每个参考视频剪辑确定参考视频相似性分数。 响应于所确定的参考视频相似性分数,使用经过训练的语义分类器来确定一个或多个语义概念分类。

    VIDEO CONCEPT CLASSIFICATION USING AUDIO-VISUAL GROUPLETS
    5.
    发明申请
    VIDEO CONCEPT CLASSIFICATION USING AUDIO-VISUAL GROUPLETS 有权
    视频概念使用音视频分组

    公开(公告)号:US20130089303A1

    公开(公告)日:2013-04-11

    申请号:US13269742

    申请日:2011-10-10

    IPC分类号: H04N5/93

    摘要: A method for determining a semantic concept classification for a digital video clip, comprising: receiving an audio-visual dictionary including a plurality of audio-visual grouplets, the audio-visual grouplets including visual background and foreground codewords, audio background and foreground codewords, wherein the codewords in a particular audio-visual grouplet were determined to be correlated with each other; analyzing the digital video clip to determine a set of visual features and a set of audio features; determining similarity scores between the digital video clip and each of the audio-visual grouplets by comparing the set of visual features to any visual background and foreground codewords associated with a particular audio-visual grouplet, and comparing the set of audio features to any audio background and foreground codewords associated with the particular audio-visual grouplet; and determining one or more semantic concept classifications using trained semantic classifiers.

    摘要翻译: 一种用于确定数字视频剪辑的语义概念分类的方法,包括:接收包括多个视听小组的视听词典,所述视听小组包括视觉背景和前景码字,音频背景和前景码字,其中 确定特定视听小组中的码字彼此相关; 分析数字视频剪辑以确定一组视觉特征和一组音频特征; 通过将视觉特征的集合与与特定视听小组相关联的任何视觉背景和前景码字进行比较,以及将该组音频特征与任何音频背景进行比较来确定数字视频剪辑和每个视听小组之间的相似性得分 以及与特定视听小组相关联的前景码字; 以及使用经训练的语义分类器确定一个或多个语义概念分类。

    Semantic event detection using cross-domain knowledge
    6.
    发明授权
    Semantic event detection using cross-domain knowledge 有权
    使用跨域知识的语义事件检测

    公开(公告)号:US08213725B2

    公开(公告)日:2012-07-03

    申请号:US12408140

    申请日:2009-03-20

    IPC分类号: G06K9/62

    摘要: A method for facilitating semantic event classification of a group of image records related to an event. The method using an event detector system for providing: extracting a plurality of visual features from each of the image records; wherein the visual features include segmenting an image record into a number of regions, in which the visual features are extracted; generating a plurality of concept scores for each of the image records using the visual features, wherein each concept score corresponds to a visual concept and each concept score is indicative of a probability that the image record includes the visual concept; generating a feature vector corresponding to the event based on the concept scores of the image records; and supplying the feature vector to an event classifier that identifies at least one semantic event classifier that corresponds to the event.

    摘要翻译: 一种用于促进与事件相关的一组图像记录的语义事件分类的方法。 该方法使用事件检测器系统来提供:从每个图像记录提取多个视觉特征; 其中所述视觉特征包括将图像记录分割成其中提取所述视觉特征的多个区域; 使用所述视觉特征为每个所述图像记录生成多个概念分数,其中每个概念分数对应于视觉概念,并且每个概念分数指示所述图像记录包括所述视觉概念的概率; 基于所述图像记录的概念分数生成与所述事件相对应的特征向量; 以及将特征向量提供给识别与该事件相对应的至少一个语义事件分类器的事件分类器。

    VIDEO CONCEPT CLASSIFICATION USING AUDIO-VISUAL ATOMS
    7.
    发明申请
    VIDEO CONCEPT CLASSIFICATION USING AUDIO-VISUAL ATOMS 有权
    使用音频视频的视频概念分类

    公开(公告)号:US20110081082A1

    公开(公告)日:2011-04-07

    申请号:US12574716

    申请日:2009-10-07

    IPC分类号: G06K9/00 G06K9/62 G10L11/00

    CPC分类号: G06K9/00765 G10L25/00

    摘要: A method for determining a classification for a video segment, comprising the steps of: breaking the video segment into a plurality of short-term video slices, each including a plurality of video frames and an audio signal; analyzing the video frames for each short-term video slice to form a plurality of region tracks; analyzing each region track to form a visual feature vector and a motion feature vector; analyzing the audio signal for each short-term video slice to determine an audio feature vector; forming a plurality of short-term audio-visual atoms for each short-term video slice by combining the visual feature vector and the motion feature vector for a particular region track with the corresponding audio feature vector; and using a classifier to determine a classification for the video segment responsive to the short-term audio-visual atoms.

    摘要翻译: 一种用于确定视频段的分类的方法,包括以下步骤:将视频段分解成多个短视频片段,每个短片段包括多个视频帧和音频信号; 分析每个短期视频片段的视频帧以形成多个区域轨道; 分析每个区域轨迹以形成视觉特征向量和运动特征向量; 分析每个短期视频片段的音频信号以确定音频特征向量; 通过将特定区域轨道的视觉特征向量和运动特征向量与相应的音频特征向量组合,形成每个短期视频片段的多个短期视听原子; 并且使用分类器来确定响应于短期视听原子的视频片段的分类。

    Audio signal semantic concept classification method
    8.
    发明授权
    Audio signal semantic concept classification method 有权
    音频信号语义概念分类方法

    公开(公告)号:US09111547B2

    公开(公告)日:2015-08-18

    申请号:US13591489

    申请日:2012-08-22

    IPC分类号: G06F15/18 G10L25/51

    CPC分类号: G10L25/51

    摘要: A method for determining a semantic concept associated with an audio signal captured using an audio sensor. A data processor is used to automatically analyze the audio signal using a plurality of semantic concept detectors to determine corresponding preliminary semantic concept detection values, each semantic concept detector being adapted to detect a particular semantic concept. The preliminary semantic concept detection values are analyzed using a joint likelihood model based on predetermined pair-wise likelihoods that particular pairs of semantic concepts co-occur to determine updated semantic concept detection values. One or more semantic concepts are determined based on the updated semantic concept detection values. The semantic concept detectors and the joint likelihood model are trained together with a joint training process using training audio signals, at least some of which are known to be associated with a plurality of semantic concepts.

    摘要翻译: 一种用于确定与使用音频传感器捕获的音频信号相关联的语义概念的方法。 数据处理器用于使用多个语义概念检测器自动分析音频信号,以确定对应的初步语义概念检测值,每个语义概念检测器适于检测特定的语义概念。 基于预定的成对似然性,使用联合似然模型来分析初步语义概念检测值,特定的语义概念对共同出现以确定更新的语义概念检测值。 基于更新的语义概念检测值来确定一个或多个语义概念。 语义概念检测器和联合似然模型与使用训练音频信号的联合训练过程一起训练,其中至少一些已知与多个语义概念相关联。

    Video concept classification using audio-visual grouplets
    9.
    发明授权
    Video concept classification using audio-visual grouplets 有权
    使用视听小组的视频概念分类

    公开(公告)号:US08867891B2

    公开(公告)日:2014-10-21

    申请号:US13269742

    申请日:2011-10-10

    摘要: A method for determining a semantic concept classification for a digital video clip, comprising: receiving an audio-visual dictionary including a plurality of audio-visual grouplets, the audio-visual grouplets including visual background and foreground codewords, audio background and foreground codewords, wherein the codewords in a particular audio-visual grouplet were determined to be correlated with each other; analyzing the digital video clip to determine a set of visual features and a set of audio features; determining similarity scores between the digital video clip and each of the audio-visual grouplets by comparing the set of visual features to any visual background and foreground codewords associated with a particular audio-visual grouplet, and comparing the set of audio features to any audio background and foreground codewords associated with the particular audio-visual grouplet; and determining one or more semantic concept classifications using trained semantic classifiers.

    摘要翻译: 一种用于确定数字视频剪辑的语义概念分类的方法,包括:接收包括多个视听小组的视听词典,所述视听小组包括视觉背景和前景码字,音频背景和前景码字,其中 确定特定视听小组中的码字彼此相关; 分析数字视频剪辑以确定一组视觉特征和一组音频特征; 通过将视觉特征的集合与与特定视听小组相关联的任何视觉背景和前景码字进行比较,以及将该组音频特征与任何音频背景进行比较来确定数字视频剪辑和每个视听小组之间的相似性得分 以及与特定视听小组相关联的前景码字; 以及使用经训练的语义分类器确定一个或多个语义概念分类。

    AUDIO SIGNAL SEMANTIC CONCEPT CLASSIFICATION METHOD
    10.
    发明申请
    AUDIO SIGNAL SEMANTIC CONCEPT CLASSIFICATION METHOD 有权
    音频信号语义概念分类方法

    公开(公告)号:US20140056432A1

    公开(公告)日:2014-02-27

    申请号:US13591489

    申请日:2012-08-22

    IPC分类号: H04R29/00

    CPC分类号: G10L25/51

    摘要: A method for determining a semantic concept associated with an audio signal captured using an audio sensor. A data processor is used to automatically analyze the audio signal using a plurality of semantic concept detectors to determine corresponding preliminary semantic concept detection values, each semantic concept detector being adapted to detect a particular semantic concept. The preliminary semantic concept detection values are analyzed using a joint likelihood model based on predetermined pair-wise likelihoods that particular pairs of semantic concepts co-occur to determine updated semantic concept detection values. One or more semantic concepts are determined based on the updated semantic concept detection values. The semantic concept detectors and the joint likelihood model are trained together with a joint training process using training audio signals, at least some of which are known to be associated with a plurality of semantic concepts.

    摘要翻译: 一种用于确定与使用音频传感器捕获的音频信号相关联的语义概念的方法。 数据处理器用于使用多个语义概念检测器自动分析音频信号,以确定对应的初步语义概念检测值,每个语义概念检测器适于检测特定的语义概念。 基于预定的成对似然性,使用联合似然模型来分析初步语义概念检测值,特定的语义概念对共同出现以确定更新的语义概念检测值。 基于更新的语义概念检测值来确定一个或多个语义概念。 语义概念检测器和联合似然模型与使用训练音频信号的联合训练过程一起训练,其中至少一些已知与多个语义概念相关联。