SYSTEMS AND METHODS FOR REDUCING SPEECH INTELLIGIBILITY WHILE PRESERVING ENVIRONMENTAL SOUNDS
    1.
    发明申请
    SYSTEMS AND METHODS FOR REDUCING SPEECH INTELLIGIBILITY WHILE PRESERVING ENVIRONMENTAL SOUNDS 有权
    在保护环境声音的同时降低语音智能的系统和方法

    公开(公告)号:US20090306988A1

    公开(公告)日:2009-12-10

    申请号:US12135131

    申请日:2008-06-06

    IPC分类号: G10L13/00

    摘要: An audio privacy system reduces the intelligibility of speech in an audio signal while preserving prosodic information, such as pitch, relative energy and intonation so that a listener has the ability to recognize environmental sounds but not the speech itself. An audio signal is processed to separate non-vocalic information, such as pitch and relative energy of speech, from vocalic regions, after which syllables are identified within the vocalic regions. Representations of the vocalic regions are computed to produce a vocal tract transfer function and an excitation. The vocal tract transfer function for each syllable is then replaced with the vocal tract transfer function from another prerecorded vocalic sound. In one aspect, the identity of the replacement vocalic sound is independent of the identity of the syllable being replaced. A modified audio signal is then synthesized with the original prosodic information and the modified vocal tract transfer function to produce unintelligible speech that preserves the pitch and energy of the speech as well as environmental sounds.

    摘要翻译: 音频隐私系统降低音频信号中的语音的可懂度,同时保留诸如音调,相对能量和语调之类的韵律信息,以便收听者具有识别环境声音而不是语音本身的能力。 处理音频信号以从声乐区域分离非声音信息,例如语音的音调和相对能量,之后在声音区域内识别音节。 计算声乐区域的表示以产生声道传递函数和激发。 然后,每个音节的声道传递功能由另一预先录制的声乐中的声道传递功能所取代。 在一个方面,替换的声音的身份与被替换的音节的身份无关。 然后,将经修改的音频信号与原始韵律信息和修改的声道传递函数合成以产生保持语音的音调和能量以及环境声音的无法理解的语音。

    Systems and methods for reducing speech intelligibility while preserving environmental sounds
    2.
    发明授权
    Systems and methods for reducing speech intelligibility while preserving environmental sounds 有权
    降低语音清晰度同时保护环境声音的系统和方法

    公开(公告)号:US08140326B2

    公开(公告)日:2012-03-20

    申请号:US12135131

    申请日:2008-06-06

    IPC分类号: G10L21/02 G10L19/00 G10L13/06

    摘要: An audio privacy system reduces the intelligibility of speech in an audio signal while preserving prosodic information, such as pitch, relative energy and intonation so that a listener has the ability to recognize environmental sounds but not the speech itself. An audio signal is processed to separate non-vocalic information, such as pitch and relative energy of speech, from vocalic regions, after which syllables are identified within the vocalic regions. Representations of the vocalic regions are computed to produce a vocal tract transfer function and an excitation. The vocal tract transfer function for each syllable is then replaced with the vocal tract transfer function from another prerecorded vocalic sound. In one aspect, the identity of the replacement vocalic sound is independent of the identity of the syllable being replaced. A modified audio signal is then synthesized with the original prosodic information and the modified vocal tract transfer function to produce unintelligible speech that preserves the pitch and energy of the speech as well as environmental sounds.

    摘要翻译: 音频隐私系统降低音频信号中的语音的可懂度,同时保留诸如音调,相对能量和语调之类的韵律信息,以便收听者具有识别环境声音而不是语音本身的能力。 处理音频信号以从声乐区域分离非声音信息,例如语音的音调和相对能量,之后在声音区域内识别音节。 计算声乐区域的表示以产生声道传递函数和激发。 然后,每个音节的声道传递功能由另一预先录制的声乐中的声道传递功能所取代。 在一个方面,替换的声音的身份与被替换的音节的身份无关。 然后,将经修改的音频信号与原始韵律信息和修改的声道传递函数合成以产生保持语音的音调和能量以及环境声音的无法理解的语音。

    SYSTEMS AND METHODS FOR INTERACTIVE FORM FILLING
    3.
    发明申请
    SYSTEMS AND METHODS FOR INTERACTIVE FORM FILLING 审中-公开
    交互式填充的系统和方法

    公开(公告)号:US20120063684A1

    公开(公告)日:2012-03-15

    申请号:US12878972

    申请日:2010-09-09

    IPC分类号: G06K9/34

    摘要: Systems and methods for interactive, user-driven detection, creation and completion of form fields in a digital document are provided. A document with form fields that require completion by a user is received, after which form fields are detected at the direction of the user. Once the user selects a possible form field, the system creates the appropriate fillable form field based on size, type, location, related text and other parameters of the form field and surrounding document. Additional levels of interaction include predictive text, pattern development and automatic completion of previously completed fields.

    摘要翻译: 提供了用于数字文档中的表单域的交互式,用户驱动的检测,创建和完成的系统和方法。 接收到需要用户完成的表单字段的文档,之后在用户的方向检测到表单字段。 一旦用户选择了可能的表单域,系统将根据表单域和周围文档的大小,类型,位置,相关文本和其他参数创建适当的可填写表单域。 其他交互级别包括预测文本,模式开发和自动完成先前完成的领域。

    SYSTEM AND METHOD FOR VIDEO SUMMARIZATION
    4.
    发明申请
    SYSTEM AND METHOD FOR VIDEO SUMMARIZATION 有权
    用于视频总结的系统和方法

    公开(公告)号:US20090080853A1

    公开(公告)日:2009-03-26

    申请号:US11860436

    申请日:2007-09-24

    IPC分类号: G11B27/00

    摘要: The subject invention relates to a system and method for video summarization, and more specifically to a system for segmenting and classifying data from a video in order to create a summary video that preserves and summarizes relevant content. In one embodiment, the system first extracts appearance, motion, and audio features from a video in order to create video segments corresponding to the extracted features. The video segments are then classified as dynamic or static depending on the appearance-based and motion-based features extracted from each video segment. The classified video segments are then grouped into clusters to eliminate redundant content. Select video segments from each cluster are selected as summary segments, and the summary segments are compiled to form a summary video. The parameters for any of the steps in the summarization of the video can be altered so that a user can adapt the system to any type of video, although the system is designed to summarize unstructured videos where the content is unknown. In another aspect, audio features can also be used to further summarize video with certain audio properties.

    摘要翻译: 本发明涉及一种用于视频摘要的系统和方法,更具体地涉及一种用于从视频中分割和分类数据以便创建保留和总结相关内容的摘要视频的系统。 在一个实施例中,系统首先从视频中提取外观,运动和音频特征,以便创建对应于所提取的特征的视频段。 视频片段根据从每个视频段中提取的基于外观和基于运动的特征,被分类为动态或静态。 然后将分类的视频片段分组成簇以消除冗余内容。 选择每个群集中的视频片段作为摘要片段,并将摘要片段编译为一个摘要视频。 视频总结中的任何步骤的参数可以被改变,使得用户可以使系统适应任何类型的视频,尽管系统被设计为总结内容未知的非结构化视频。 另一方面,也可以使用音频特征来进一步总结具有某些音频属性的视频。

    System and method for video summarization
    5.
    发明授权
    System and method for video summarization 有权
    视频摘要的系统和方法

    公开(公告)号:US08200063B2

    公开(公告)日:2012-06-12

    申请号:US11860436

    申请日:2007-09-24

    IPC分类号: H04N9/80

    摘要: The subject invention relates to a system and method for video summarization, and more specifically to a system for segmenting and classifying data from a video in order to create a summary video that preserves and summarizes relevant content. In one embodiment, the system first extracts appearance, motion, and audio features from a video in order to create video segments corresponding to the extracted features. The video segments are then classified as dynamic or static depending on the appearance-based and motion-based features extracted from each video segment. The classified video segments are then grouped into clusters to eliminate redundant content. Select video segments from each cluster are selected as summary segments, and the summary segments are compiled to form a summary video. The parameters for any of the steps in the summarization of the video can be altered so that a user can adapt the system to any type of video, although the system is designed to summarize unstructured videos where the content is unknown. In another aspect, audio features can also be used to further summarize video with certain audio properties.

    摘要翻译: 本发明涉及一种用于视频摘要的系统和方法,更具体地涉及一种用于从视频分割和分类数据以便创建保留和总结相关内容的摘要视频的系统。 在一个实施例中,系统首先从视频中提取外观,运动和音频特征,以便创建对应于所提取的特征的视频段。 视频片段根据从每个视频段中提取的基于外观和基于运动的特征,被分类为动态或静态。 然后将分类的视频片段分组成簇以消除冗余内容。 选择每个群集中的视频片段作为摘要片段,并将摘要片段编译为一个摘要视频。 视频总结中的任何步骤的参数可以被改变,使得用户可以使系统适应任何类型的视频,尽管系统被设计为总结内容未知的非结构化视频。 另一方面,也可以使用音频特征来进一步总结具有某些音频属性的视频。

    Efficient, user-friendly system to stream screens inside video using a mobile device
    6.
    发明授权
    Efficient, user-friendly system to stream screens inside video using a mobile device 有权
    高效,用户友好的系统使用移动设备流式传输视频内的屏幕

    公开(公告)号:US08934024B2

    公开(公告)日:2015-01-13

    申请号:US12687831

    申请日:2010-01-14

    摘要: A system helps filter and correct video captured and streamed from a mobile device. In particular, the system detects and streams content shown on screens, allowing anyone to stream screen content immediately without needing to develop hooks into external software (i.e. without installing a screen recorder software in the computer). The system can use a variety of user-selectable techniques to detect the screen, and utilizes the mobile device's touchscreen to allow users to manually override detected corners. However, some of these approaches could potentially be applied to other types of content, such as identifying TV screens, appliance LCD screens, other mobile devices' screens, multifunction devices. (e.g. a remote technician could help troubleshoot a malfunctioning MFD by having the end-user point his cellphone to the LCD screen of the MFD).

    摘要翻译: 系统有助于过滤和纠正从移动设备捕获和流式传输的视频。 特别地,系统检测并流式传输显示在屏幕上的内容,允许任何人立即流式传输屏幕内容,而无需将钩子开发到外部软件(即不在计算机中安装屏幕录像机软件)。 该系统可以使用各种用户可选择的技术来检测屏幕,并利用移动设备的触摸屏来允许用户手动覆盖检测到的角落。 然而,这些方法中的一些可能可能应用于其他类型的内容,例如识别电视屏幕,设备LCD屏幕,其他移动设备屏幕,多功能设备。 (例如,远程技术人员可以通过使终端用户将其手机指向MFD的LCD屏幕来帮助排除故障MFD)。

    System and method for detecting user actions in a video stream
    8.
    发明申请
    System and method for detecting user actions in a video stream 有权
    用于检测视频流中的用户动作的系统和方法

    公开(公告)号:US20060090134A1

    公开(公告)日:2006-04-27

    申请号:US10973198

    申请日:2004-10-26

    IPC分类号: G06F9/00 G11B27/00

    摘要: Embodiments of the present invention include a video server that can detect and track the image of a pointing indicator in an input video stream representation of a computer display. The video server checks ordered frames of the video signal and determines movements for a pointing indicator such as a mouse arrow. Certain motions by the pointing indicator, such as lingering over a button or menu item or circling a button or menu item can provoke a control action on the server.

    摘要翻译: 本发明的实施例包括可以在计算机显示器的输入视频流表示中检测和跟踪指示指示符的图像的视频服务器。 视频服务器检查视频信号的有序帧,并确定诸如鼠标箭头的指示指示器的移动。 指示指示器的某些运动,例如挥动按钮或菜单项或旋转按钮或菜单项可能会引起服务器上的控制动作。

    Systems and Methods for Instructional Video Navigation and Note Taking
    9.
    发明申请
    Systems and Methods for Instructional Video Navigation and Note Taking 有权
    教学视频导航系统与方法及注意事项

    公开(公告)号:US20140099071A1

    公开(公告)日:2014-04-10

    申请号:US13647248

    申请日:2012-10-08

    IPC分类号: H04N9/87

    摘要: A method for navigating instructional video presentations is disclosed. The method includes determining a pause mode of a video presentation, and playing the video presentation on a display device. The video presentation has one or more predetermined pause positions. The method also includes, while playing the video presentation, determining that the video presentation has reached one of the one or more pause positions. The method further includes, in accordance with a determination that the video presentation is in a first pause mode, pausing the video presentation at the one of the one or more pause positions and maintaining a display of a paused frame of the video presentation, and, in accordance with a determination that the video presentation is in a second pause mode distinct from the first pause mode, continuing to play the video presentation through the one of the one or more pause positions.

    摘要翻译: 公开了用于导航教学视频演示的方法。 该方法包括确定视频呈现的暂停模式,以及在显示装置上播放视频呈现。 视频呈现具有一个或多个预定的暂停位置。 该方法还包括在播放视频呈现时确定视频呈现已经达到一个或多个暂停位置中的一个。 所述方法还包括:根据所述视频呈现处于第一暂停模式的确定,暂停所述一个或多个暂停位置中的所述一个或多个暂停位置中的所述一个处的所述视频呈现,并且保持所述视频呈现的暂停帧的显示, 根据与第一暂停模式不同的第二暂停模式的确定,继续通过一个或多个暂停位置中的一个播放视频呈现。

    DOCUMENT IMAGING WITH TARGETED ADVERTISING BASED ON DOCUMENT CONTENT ANALYSIS
    10.
    发明申请
    DOCUMENT IMAGING WITH TARGETED ADVERTISING BASED ON DOCUMENT CONTENT ANALYSIS 审中-公开
    基于文件内容分析的文档成像与定向广告

    公开(公告)号:US20100145808A1

    公开(公告)日:2010-06-10

    申请号:US12329856

    申请日:2008-12-08

    IPC分类号: G06Q30/00 G06Q20/00

    摘要: A method and system for delivery of targeted advertisement via multifunction document imaging devices. Imaging devices used for copying, scanning, faxing and printing documents are used to deliver advertisements, coupons, and other promotional material to users. The imaging device is capable of delivering targeted promotional material based on analysis of the documents content passing through the device. Targeting is based on device history, user history or user demographics. Device history and user history are compiled from the contents of the documents processed respectively at a device and by a user. Demographics are inferred from a demographics model using user identity or document content input to the model. Advertisements may be delivered via paper, the device display, and other means.

    摘要翻译: 一种用于通过多功能文档成像装置传送目标广告的方法和系统。 用于复印,扫描,传真和打印文件的成像设备用于向用户传送广告,优惠券和其他宣传材料。 成像设备能够基于通过设备的文档内容的分析来提供目标宣传材料。 定位是基于设备历史记录,用户历史记录或用户人口统计。 设备历史和用户历史是从设备和用户分别处理的文档的内容进行编译的。 根据人口统计模型,使用用户身份或输入到模型的文档内容来推断人口统计。 广告可以通过纸张,设备显示和其他方式传送。