Speaker authentication by fusion of voiceprint match attempt results with additional information
    11.
    发明授权
    Speaker authentication by fusion of voiceprint match attempt results with additional information 有权
    通过融合声纹的扬声器认证与尝试结果匹配附加信息

    公开(公告)号:US07240007B2

    公开(公告)日:2007-07-03

    申请号:US10392549

    申请日:2003-03-20

    CPC classification number: G10L15/24

    Abstract: A speaker authentication system includes a data fuser operable to fuse voiceprint match attempt results with additional information to assist in authenticating a speaker providing audio input. In other aspects, the system includes a data store of speaker voiceprints and a voiceprint matching module adapted to receive an audio input and operable to attempt to assist in authenticating a speaker by matching the audio input to at least one of the speaker voiceprints. The voiceprint matching module adjusts a confidence of voiceprint match attempt results by at least one of: (a) a number of utterance repetitions upon which a matching speaker voiceprint has been trained; or (b) a passage of time since a training occurrence associated with a matching speaker voiceprint.

    Abstract translation: 扬声器认证系统包括数据定影器,其可操作以将声纹匹配尝试结果与附加信息融合,以帮助认证提供音频输入的扬声器。 在其他方面,该系统包括扬声器声纹的数据存储器和声纹匹配模块,其适于接收音频输入并且可操作以通过将音频输入与扬声器声纹中的至少一个相匹配来尝试辅助认证扬声器。 声纹匹配模块通过以下至少一个来调整声纹匹配尝试结果的置信度:(a)已经训练了匹配的说话者声纹的多个话语重复; 或(b)与匹配的说话者声纹相关联的训练发生之后的时间段。

    Simultaneous support of isolated and connected phrase command recognition in automatic speech recognition systems
    12.
    发明申请
    Simultaneous support of isolated and connected phrase command recognition in automatic speech recognition systems 有权
    在自动语音识别系统中同时支持隔离和连接短语命令识别

    公开(公告)号:US20070150288A1

    公开(公告)日:2007-06-28

    申请号:US11312785

    申请日:2005-12-20

    CPC classification number: G10L15/22 G10L2015/228

    Abstract: A system for operating one or more devices using speech input including a receiver for receiving a speech input, a controller in communication with said receiver, software executing on said controller for converting the speech input into computer-readable data, software executing on said controller for generating a table of active commands, the table including a portion of all valid commands of the system, software executing on said controller for identifying at least one active command represented by the data, and software executing on said controller for transmitting the active command to at least one device operable by the active command.

    Abstract translation: 一种用于使用语音输入操作一个或多个设备的系统,包括用于接收语音输入的接收器,与所述接收器通信的控制器,在所述控制器上执行的用于将语音输入转换为计算机可读数据的软件的系统,在所述控制器上执行的软件, 生成活动命令表,该表包括系统的所有有效命令的一部分,在所述控制器上执行的用于识别由数据表示的至少一个活动命令的软件以及在所述控制器上执行的用于将活动命令发送到 至少一个可由活动命令操作的设备。

    Technique for developing discriminative sound units for speech recognition and allophone modeling
    13.
    发明授权
    Technique for developing discriminative sound units for speech recognition and allophone modeling 有权
    用于发展用于语音识别和异音素建模的辨别声音单元的技术

    公开(公告)号:US06711541B1

    公开(公告)日:2004-03-23

    申请号:US09390434

    申请日:1999-09-07

    CPC classification number: G10L15/063 G10L2015/025

    Abstract: A set of models is developed to represent sound units and these models are then used with the incorrect sound units to determine which generate high likelihood scores. The models generating high likelihood scores for the incorrect sound units represent those that are more likely to be confused. The resulting confusability data may then be used in generating more discriminative speech models and in subsequent pruning of the acoustic decision tree. The confusability data may also be used to develop confusability predictors used for rejection during search and in developing continuous speech recognition models that are optimized to minimize confusability.

    Abstract translation: 开发了一组模型来表示声音单元,然后将这些模型与不正确的声音单元一起使用以确定哪个产生高似然分数。 为不正确声音单位产生高似然分数的模型代表更可能被混淆的那些。 所产生的可混淆性数据然后可以用于产生更具歧视性的语音模型以及随后的声学决策树的修剪。 可混淆性数据还可用于开发用于搜索期间拒绝的混淆性预测变量,并开发出经过优化以最小化混淆性的连续语音识别模型。

    Voice activated controller for recording and retrieving audio/video programs
    14.
    发明授权
    Voice activated controller for recording and retrieving audio/video programs 有权
    语音激活控制器,用于记录和检索音频/视频节目

    公开(公告)号:US06643620B1

    公开(公告)日:2003-11-04

    申请号:US09270262

    申请日:1999-03-15

    Abstract: The system includes a database of program records representing A/V programs which are available for recording. The system also includes an A/V recording device for receiving a recording command and recording the A/V program. A speech recognizer is provided for receiving the spoken request and translating the spoken request into a text stream having a plurality of words. A natural language processor receives the text stream and processes the words for resolving a semantic content of the spoken request. The natural language processor places the meaning of the words into a task frame having a plurality of key word slots. A dialogue system analyzes the task frame for determining if a sufficient number of key word slots have been filled and prompts the user for additional information for filling empty slots. The dialogue system searches the database of program records using the key words placed within the task frame for selecting the A/V program and generating the recording command for use by the A/V recording device.

    Abstract translation: 该系统包括表示可用于记录的A / V节目的节目记录的数据库。 该系统还包括用于接收记录命令并记录A / V程序的A / V记录装置。 语音识别器被提供用于接收口头请求并将口头请求转换成具有多个单词的文本流。 自然语言处理器接收文本流并处理用于解析语音请求的语义内容的单词。 自然语言处理器将单词的含义置于具有多个关键字时隙的任务帧中。 对话系统分析任务框以确定是否已经填充了足够数量的关键字槽,并提示用户填充空槽的附加信息。 对话系统使用放置在任务帧内的关键词来搜索节目记录的数据库,用于选择A / V节目并产生由A / V记录装置使用的记录命令。

    Voice Control System with Multiple Microphone Arrays
    15.
    发明申请
    Voice Control System with Multiple Microphone Arrays 有权
    具有多个麦克风阵列的语音控制系统

    公开(公告)号:US20160125882A1

    公开(公告)日:2016-05-05

    申请号:US14531798

    申请日:2014-11-03

    Abstract: A voice controlled medical system with improved speech recognition includes a first microphone array, a second microphone array, a controller in communication with the first and second microphone arrays, and a medical device operable by the controller. The controller includes a beam module that generates a first beamed signal using signals from the first microphone array and a second beamed signal using signals from the second microphone array. The controller also includes a comparison module that compares the first and second beamed signals and determines a correlation between the first and second beamed signals. The controller also includes a voice interpreting module that identifies commands within the first and second beamed signals if the correlation is above a correlation threshold. The controller also includes an instrument control module that executes the commands to operate said medical device.

    Abstract translation: 具有改进的语音识别的语音控制医疗系统包括第一麦克风阵列,第二麦克风阵列,与第一和第二麦克风阵列通信的控制器,以及可由控制器操作的医疗设备。 控制器包括波束模块,其使用来自第一麦克风阵列的信号和使用来自第二麦克风阵列的信号的第二波束信号产生第一波束信号。 控制器还包括比较模块,其比较第一和第二波束信号,并确定第一和第二波束信号之间的相关性。 如果相关性高于相关阈值,则控制器还包括语音解释模块,其识别第一和第二发射信号内的命令。 控制器还包括执行操作所述医疗装置的命令的仪器控制模块。

    Device control system employing extensible markup language for defining information resources
    16.
    发明授权
    Device control system employing extensible markup language for defining information resources 有权
    设备控制系统采用可扩展标记语言来定义信息资源

    公开(公告)号:US08037179B2

    公开(公告)日:2011-10-11

    申请号:US11555945

    申请日:2006-11-02

    CPC classification number: G06F17/2247

    Abstract: A device control system including at least one device operable by the system, at least one processor, software executing on the at least one processor for receiving message data and determining a corresponding XML document type, software executing on the at least one processor for generating a XML document based on the XML document type, the XML document including the message data, software executing on the processor for packetizing the XML document, and two or more communication components, each communication component including an XML parser for parsing the XML document and extracting the message data.

    Abstract translation: 一种设备控制系统,包括至少一个可由所述系统操作的设备,至少一个处理器,在所述至少一个处理器上执行的用于接收消息数据并确定相应的XML文档类型的软件,在所述至少一个处理器上执行的软件, 基于XML文档类型的XML文档,包括消息数据的XML文档,用于打包XML文档的处理器上执行的软件以及两个或更多个通信组件,每个通信组件包括用于解析XML文档的XML解析器,并且提取 消息数据。

    Audio, Visual and device data capturing system with real-time speech recognition command and control system
    17.
    发明申请
    Audio, Visual and device data capturing system with real-time speech recognition command and control system 有权
    音频,视觉和设备数据采集系统具有实时语音识别命令和控制系统

    公开(公告)号:US20080062280A1

    公开(公告)日:2008-03-13

    申请号:US11519315

    申请日:2006-09-12

    CPC classification number: G06F19/321 G06F19/00 G06F19/3481 G10L15/26 G16H10/60

    Abstract: An audio, visual and device data capturing system including an audio recorder for recording audio data, at least one visual recorder for recording visual data, at least one device data recorder for receiving device data from at least one device in communication with the system, a speech recognition module for interpreting the audio data, a transcript module for generating transcript data from the interpreted audio data, a data capturing module for generating a data record including at least a portion of each of the audio data, the transcript data, the visual data and the device data, and at least one storage device for storing the data record.

    Abstract translation: 一种包括用于记录音频数据的音频记录器,用于记录视觉数据的至少一个视觉记录器的至少一个设备数据记录器,用于从与系统通信的至少一个设备接收设备数据的音频,视频和设备数据捕获系统, 用于解释音频数据的语音识别模块,用于从解释音频数据生成转录数据的抄录模块,用于生成数据记录的数据捕获模块,该数据记录包括每个音频数据,抄本数据,视觉数据的至少一部分 和设备数据,以及用于存储数据记录的至少一个存储设备。

    Speech recognition system with user profiles management component
    18.
    发明申请
    Speech recognition system with user profiles management component 有权
    具有用户配置文件管理组件的语音识别系统

    公开(公告)号:US20070294081A1

    公开(公告)日:2007-12-20

    申请号:US11455248

    申请日:2006-06-16

    CPC classification number: G10L15/26

    Abstract: A speech recognition and control system continuously operable by two or more users including a receiver for receiving a speech input, a processor in communication with the receiver, a database in communication with the processor, the database including a plurality of user profiles, profile management software executing on the processor for determining an active profile from the plurality of user profiles, and software executing on the processor for identifying at least one command from the speech input based on the active profile.

    Abstract translation: 由两个或多个用户连续操作的语音识别和控制系统,包括用于接收语音输入的接收器,与接收器通信的处理器,与处理器通信的数据库,数据库包括多个用户简档,简档管理软件 在所述处理器上执行以从所述多个用户简档确定活动简档,以及在所述处理器上执行的软件,用于基于所述活动简档从所述语音输入中识别至少一个命令。

    Method and system for intuitive text-to-speech synthesis customization
    19.
    发明申请
    Method and system for intuitive text-to-speech synthesis customization 审中-公开
    直观的文本到语音合成定制的方法和系统

    公开(公告)号:US20050177369A1

    公开(公告)日:2005-08-11

    申请号:US10776892

    申请日:2004-02-11

    CPC classification number: G10L13/08

    Abstract: A system for tuning the text-to-speech conversion process having a text-to-speech engine that converts the input text into a processed text form which includes speech features. A visual editing interface displaying the processed text form using graphical indicators on an output device to allow a user to edit the text and graphical indicators to modify the speech features of the text input.

    Abstract translation: 一种用于调整具有将输入文本转换成包括语音特征的处理文本形式的文本到语音引擎的文本到语音转换过程的系统。 视觉编辑界面,使用输出设备上的图形指示器显示处理的文本形式,以允许用户编辑文本和图形指示符,以修改文本输入的语音特征。

    Method for goal-oriented speech translation in hand-held devices using meaning extraction and dialogue
    20.
    发明授权
    Method for goal-oriented speech translation in hand-held devices using meaning extraction and dialogue 有权
    使用意义提取和对话的手持设备中面向目标的语音翻译方法

    公开(公告)号:US06233561B1

    公开(公告)日:2001-05-15

    申请号:US09290628

    申请日:1999-04-12

    CPC classification number: G10L15/1822 G10L15/1815

    Abstract: A computer-implemented method and apparatus is provided for processing a spoken request from a user. A speech recognizer converts the spoken request into a digital format. A frame data structure associates semantic components of the digitized spoken request with predetermined slots. The slots are indicative of data which are used to achieve a predetermined goal. A speech understanding module which is connected to the speech recognizer and to the frame data structure determines semantic components of the spoken request. The slots are populated based upon the determined semantic components. A dialog manager which is connected to the speech understanding module may determine at least one slot which is unpopulated based upon the determined semantic components and in a preferred embodiment may provide confirmation of the populated slots. A computer generated-request is formulated in order for the user to provide data related to the unpopulated slot. The method and apparatus are well-suited (but not limited) to use in a hand-held speech translation device.

    Abstract translation: 提供了一种用于处理来自用户的口头请求的计算机实现的方法和装置。 语音识别器将口头请求转换为数字格式。 帧数据结构将数字化语音请求的语义分量与预定时隙相关联。 这些时隙指示用于实现预定目标的数据。 连接到语音识别器和帧数据结构的语音理解模块确定语音请求的语义分量。 基于确定的语义分量来填充时隙。 连接到语音理解模块的对话管理器可以基于所确定的语义组件来确定未填充的至少一个时隙,并且在优选实施例中可以提供填充时隙的确认。 制定计算机生成请求以便用户提供与未填充槽相关的数据。 该方法和装置非常适合(但不限于)在手持语音翻译装置中使用。

Patent Agency Ranking