System and method for video summarization
    1.
    发明授权
    System and method for video summarization 有权
    视频摘要的系统和方法

    公开(公告)号:US08200063B2

    公开(公告)日:2012-06-12

    申请号:US11860436

    申请日:2007-09-24

    IPC分类号: H04N9/80

    摘要: The subject invention relates to a system and method for video summarization, and more specifically to a system for segmenting and classifying data from a video in order to create a summary video that preserves and summarizes relevant content. In one embodiment, the system first extracts appearance, motion, and audio features from a video in order to create video segments corresponding to the extracted features. The video segments are then classified as dynamic or static depending on the appearance-based and motion-based features extracted from each video segment. The classified video segments are then grouped into clusters to eliminate redundant content. Select video segments from each cluster are selected as summary segments, and the summary segments are compiled to form a summary video. The parameters for any of the steps in the summarization of the video can be altered so that a user can adapt the system to any type of video, although the system is designed to summarize unstructured videos where the content is unknown. In another aspect, audio features can also be used to further summarize video with certain audio properties.

    摘要翻译: 本发明涉及一种用于视频摘要的系统和方法,更具体地涉及一种用于从视频分割和分类数据以便创建保留和总结相关内容的摘要视频的系统。 在一个实施例中,系统首先从视频中提取外观,运动和音频特征,以便创建对应于所提取的特征的视频段。 视频片段根据从每个视频段中提取的基于外观和基于运动的特征,被分类为动态或静态。 然后将分类的视频片段分组成簇以消除冗余内容。 选择每个群集中的视频片段作为摘要片段,并将摘要片段编译为一个摘要视频。 视频总结中的任何步骤的参数可以被改变,使得用户可以使系统适应任何类型的视频,尽管系统被设计为总结内容未知的非结构化视频。 另一方面,也可以使用音频特征来进一步总结具有某些音频属性的视频。

    Systems and methods for reducing speech intelligibility while preserving environmental sounds
    2.
    发明授权
    Systems and methods for reducing speech intelligibility while preserving environmental sounds 有权
    降低语音清晰度同时保护环境声音的系统和方法

    公开(公告)号:US08140326B2

    公开(公告)日:2012-03-20

    申请号:US12135131

    申请日:2008-06-06

    IPC分类号: G10L21/02 G10L19/00 G10L13/06

    摘要: An audio privacy system reduces the intelligibility of speech in an audio signal while preserving prosodic information, such as pitch, relative energy and intonation so that a listener has the ability to recognize environmental sounds but not the speech itself. An audio signal is processed to separate non-vocalic information, such as pitch and relative energy of speech, from vocalic regions, after which syllables are identified within the vocalic regions. Representations of the vocalic regions are computed to produce a vocal tract transfer function and an excitation. The vocal tract transfer function for each syllable is then replaced with the vocal tract transfer function from another prerecorded vocalic sound. In one aspect, the identity of the replacement vocalic sound is independent of the identity of the syllable being replaced. A modified audio signal is then synthesized with the original prosodic information and the modified vocal tract transfer function to produce unintelligible speech that preserves the pitch and energy of the speech as well as environmental sounds.

    摘要翻译: 音频隐私系统降低音频信号中的语音的可懂度,同时保留诸如音调,相对能量和语调之类的韵律信息,以便收听者具有识别环境声音而不是语音本身的能力。 处理音频信号以从声乐区域分离非声音信息,例如语音的音调和相对能量,之后在声音区域内识别音节。 计算声乐区域的表示以产生声道传递函数和激发。 然后,每个音节的声道传递功能由另一预先录制的声乐中的声道传递功能所取代。 在一个方面,替换的声音的身份与被替换的音节的身份无关。 然后,将经修改的音频信号与原始韵律信息和修改的声道传递函数合成以产生保持语音的音调和能量以及环境声音的无法理解的语音。

    SYSTEMS AND METHODS FOR INTERACTIVE FORM FILLING
    3.
    发明申请
    SYSTEMS AND METHODS FOR INTERACTIVE FORM FILLING 审中-公开
    交互式填充的系统和方法

    公开(公告)号:US20120063684A1

    公开(公告)日:2012-03-15

    申请号:US12878972

    申请日:2010-09-09

    IPC分类号: G06K9/34

    摘要: Systems and methods for interactive, user-driven detection, creation and completion of form fields in a digital document are provided. A document with form fields that require completion by a user is received, after which form fields are detected at the direction of the user. Once the user selects a possible form field, the system creates the appropriate fillable form field based on size, type, location, related text and other parameters of the form field and surrounding document. Additional levels of interaction include predictive text, pattern development and automatic completion of previously completed fields.

    摘要翻译: 提供了用于数字文档中的表单域的交互式,用户驱动的检测,创建和完成的系统和方法。 接收到需要用户完成的表单字段的文档,之后在用户的方向检测到表单字段。 一旦用户选择了可能的表单域,系统将根据表单域和周围文档的大小,类型,位置,相关文本和其他参数创建适当的可填写表单域。 其他交互级别包括预测文本,模式开发和自动完成先前完成的领域。

    SYSTEMS AND METHODS FOR REDUCING SPEECH INTELLIGIBILITY WHILE PRESERVING ENVIRONMENTAL SOUNDS
    4.
    发明申请
    SYSTEMS AND METHODS FOR REDUCING SPEECH INTELLIGIBILITY WHILE PRESERVING ENVIRONMENTAL SOUNDS 有权
    在保护环境声音的同时降低语音智能的系统和方法

    公开(公告)号:US20090306988A1

    公开(公告)日:2009-12-10

    申请号:US12135131

    申请日:2008-06-06

    IPC分类号: G10L13/00

    摘要: An audio privacy system reduces the intelligibility of speech in an audio signal while preserving prosodic information, such as pitch, relative energy and intonation so that a listener has the ability to recognize environmental sounds but not the speech itself. An audio signal is processed to separate non-vocalic information, such as pitch and relative energy of speech, from vocalic regions, after which syllables are identified within the vocalic regions. Representations of the vocalic regions are computed to produce a vocal tract transfer function and an excitation. The vocal tract transfer function for each syllable is then replaced with the vocal tract transfer function from another prerecorded vocalic sound. In one aspect, the identity of the replacement vocalic sound is independent of the identity of the syllable being replaced. A modified audio signal is then synthesized with the original prosodic information and the modified vocal tract transfer function to produce unintelligible speech that preserves the pitch and energy of the speech as well as environmental sounds.

    摘要翻译: 音频隐私系统降低音频信号中的语音的可懂度,同时保留诸如音调,相对能量和语调之类的韵律信息,以便收听者具有识别环境声音而不是语音本身的能力。 处理音频信号以从声乐区域分离非声音信息,例如语音的音调和相对能量,之后在声音区域内识别音节。 计算声乐区域的表示以产生声道传递函数和激发。 然后,每个音节的声道传递功能由另一预先录制的声乐中的声道传递功能所取代。 在一个方面,替换的声音的身份与被替换的音节的身份无关。 然后,将经修改的音频信号与原始韵律信息和修改的声道传递函数合成以产生保持语音的音调和能量以及环境声音的无法理解的语音。

    System and method for supporting document navigation on mobile devices using segmentation and keyphrase summarization
    5.
    发明授权
    System and method for supporting document navigation on mobile devices using segmentation and keyphrase summarization 有权
    使用分段和关键短语汇总支持移动设备上的文档导航的系统和方法

    公开(公告)号:US08601393B2

    公开(公告)日:2013-12-03

    申请号:US12242757

    申请日:2008-09-30

    IPC分类号: G06F17/00 G06F3/00

    CPC分类号: G06F3/0481

    摘要: Described is a system that characterizes segments of a document with one or more keyphrases and then uses the keyphrases to help users find interesting parts of a document. The keyphrases are displayed with information about the location of the phrase in the document and are used as pointers to quickly move to from an overview to a section of potential interest.

    摘要翻译: 描述了一种使用一个或多个关键短语表征文档段的系统,然后使用关键短语来帮助用户找到文档的有趣部分。 关键短语显示有关短语在文档中的位置的信息,并且用作快速从概览迁移到潜在兴趣部分的指针。

    SYSTEMS AND METHODS OF GENERATING USE-BASED PRODUCT SEARCHING
    6.
    发明申请
    SYSTEMS AND METHODS OF GENERATING USE-BASED PRODUCT SEARCHING 审中-公开
    生成用途产品搜索的系统和方法

    公开(公告)号:US20120209751A1

    公开(公告)日:2012-08-16

    申请号:US13025960

    申请日:2011-02-11

    IPC分类号: G06Q30/00

    CPC分类号: G06Q30/06

    摘要: Systems and methods are directed to use-based product searching. Raw product information data, such as product features, specifications, and user reviews are processed and analyzed using pattern-based text analysis to extract relevant product aspects and uses. The aspects are weighted in relation to their importance for various uses, and the corresponding aspects and their weights are linked to the uses. A user selects uses for a product, which correspond to weighted aspects, and the weights for the aspects are used to rank products using the weights of the aspects linked to the selected uses. The ranked products are presented to the user in a customizable interface. The user may directly specify weights for the extracted aspects to further customize the ranked list of products. The interface provides additional options for viewing product details, opinions and comparisons.

    摘要翻译: 系统和方法针对基于使用的产品搜索。 使用基于模式的文本分析处理和分析原始产品信息数据,如产品功能,规格和用户评论,以提取相关的产品方面和用途。 这些方面与其对各种用途的重要性有关,相应的方面和权重与用途有关。 用户选择对应于加权方面的产品的使用,并且使用方面的权重来使用与所选择的使用相关联的方面的权重来对产品进行排序。 排名的产品在可定制的界面中呈现给用户。 用户可以直接指定提取的方面的权重,以进一步定制产品的排名列表。 该界面提供了查看产品详细信息,意见和比较的其他选项。

    EFFICIENT TRACKING MULTIPLE OBJECTS THROUGH OCCLUSION
    8.
    发明申请
    EFFICIENT TRACKING MULTIPLE OBJECTS THROUGH OCCLUSION 审中-公开
    有效跟踪通过信息传播的多个对象

    公开(公告)号:US20090002489A1

    公开(公告)日:2009-01-01

    申请号:US11771626

    申请日:2007-06-29

    IPC分类号: H04N7/18 G06K9/62

    摘要: Visual tracking of multiple objects in a crowded scene is critical for many applications include surveillance, video conference and human computer interaction. Complex interactions between objects result in partial or significant occlusions, making tracking a highly challenging problem. Presented is a novel efficient approach to tracking a varying number of objects through occlusion. The object tracking during occlusion is posed as a track-based segmentation problem in the joint-object space. Appearance models are used to interpret the foreground into multiple layer probabilistic masks in a Bayesian framework. The search for optimal segmentation solution is achieved by a greedy searching algorithm and integral image for real-time computing. Promising results on several challenging video surveillance sequences have been demonstrated.

    摘要翻译: 视觉跟踪拥挤场景中的多个对象对于许多应用程序至关重要,包括监视,视频会议和人机交互。 物体之间的复杂相互作用导致部分或重要的闭塞,从而跟踪一个非常具有挑战性的问题。 提出了一种新颖的有效方法,通过遮挡跟踪不同数量的物体。 遮挡期间的物体跟踪在联合对象空间中作为基于轨道的分割问题。 外观模型用于将前景解释为贝叶斯框架中的多层概率蒙版。 通过贪心搜索算法和实时计算的积分图像来实现对最优分割解的搜索。 已经证明了几个具有挑战性的视频监控序列的有希望的结果。