Speech recognition employing key word modeling and non-key word modeling
    2.
    发明授权
    Speech recognition employing key word modeling and non-key word modeling 失效
    语音识别采用关键词建模和非关键词建模

    公开(公告)号:US5509104A

    公开(公告)日:1996-04-16

    申请号:US132430

    申请日:1993-10-06

    IPC分类号: G10L15/00 G10L15/14 G10L5/00

    CPC分类号: G10L15/142 G10L2015/088

    摘要: Speaker independent recognition of small vocabularies, spoken over the long distance telephone network, is achieved using two types of models, one type for defined vocabulary words (e.g., collect, calling-card, person, third-number and operator), and one type for extraneous input which ranges from non-speech sounds to groups of non-vocabulary words (e.g. `I want to make a collect call please`). For this type of key word spotting, modifications are made to a connected word speech recognition algorithm based on state-transitional (hidden Markov) models which allow it to recognize words from a pre-defined vocabulary list spoken in an unconstrained fashion. Statistical models of both the actual vocabulary words and the extraneous speech and background noises are created. A syntax-driven connected word recognition system is then used to find the best sequence of extraneous input and vocabulary word models for matching the actual input speech.

    摘要翻译: 使用两种类型的模型来实现对长途电话网络上的小词汇的独立识别,一种用于定义的词汇单词(例如,收集,呼叫卡,人,第三号码和运营商)的模型,以及一种类型 对于从非语音声音到非词汇单词组(例如“我想要收集电话”)的无关输入。 对于这种类型的关键词发现,对基于状态转换(隐马尔科夫)模型的连接词语音识别算法进行修改,这允许其识别来自以无限制方式说明的预定义词汇列表中的单词。 创建实际词汇单词和无关语音和背景噪声的统计模型。 然后使用语法驱动的连接词识别系统来找到用于匹配实际输入语音的外来输入和词汇词模型的最佳序列。

    Method and apparatus for generating speech pattern templates
    4.
    发明授权
    Method and apparatus for generating speech pattern templates 失效
    用于生成语音模式模板的方法和装置

    公开(公告)号:US4454586A

    公开(公告)日:1984-06-12

    申请号:US322748

    申请日:1981-11-19

    CPC分类号: G10L15/12

    摘要: A system for generating speech pattern templates for use with either speech recognition or speech synthesis. Reference demisyllable templates are first generated from a reference first speaker using both manual and automatic analysis. The analysis for a second speaker is simplified and automated by comparing with the first speaker's templates. The second speaker speaks the same words at a rate time-warped to match the first speakers rate and template. We define a demisyllable as each of the two halves of a syllable, assuming a syllable starts and ends with a noisy consonant, and the syllable is split at its vowel center, thereby simplifying concatenation and comparison. Key features of the invention include generating a set of signals representative of the time alignment between the first and second speaker's templates, and the time-of-occurence boundaries of each syllable in a word.

    摘要翻译: 一种用于产生用于语音识别或语音合成的语音模式模板的系统。 首先使用手动和自动分析从参考的第一个扬声器生成参考分解模板。 通过与第一个演讲人的模板进行比较,简化和自动化第二个演讲者的分析。 第二个发言人以相同的时间说出一致的话,以匹配第一个演讲者的速度和模板。 我们将一个分音节定义为音节的两个半部分,假设音节开始和结尾是一个嘈杂的辅音,并且音节在其元音中心分裂,从而简化了连接和比较。 本发明的主要特征包括产生一组代表第一和第二说话者模板之间的时间对准的信号以及一个单词中每个音节的发生时间边界。

    Spoken word controlled automatic dialer
    5.
    发明授权
    Spoken word controlled automatic dialer 失效
    口语字自动拨号器

    公开(公告)号:US4348550A

    公开(公告)日:1982-09-07

    申请号:US128842

    申请日:1980-06-09

    CPC分类号: H04M1/271 G10L15/22

    摘要: A speech controlled dialing circuit identifies input utterances which may be a command word (mode select), repertory word (dialing name or number), or non-recognized ("Other"). Responsive to the identification of each occurring input utterance, a set of predetermined templates are selected to identify the next occuring utterance. A programmed microprocessor system is described to implement the main controller function.

    摘要翻译: 语音控制拨号电路识别可以是命令字(模式选择),汇总字(拨号名称或号码)或未识别(“其他”)的输入话语。 响应于每个发生的输入话语的识别,选择一组预定模板以识别下一个发生的话语。 描述了编程的微处理器系统来实现主控制器功能。

    Automatic speech recognizer
    6.
    发明授权
    Automatic speech recognizer 失效
    自动语音识别器

    公开(公告)号:US5329608A

    公开(公告)日:1994-07-12

    申请号:US108839

    申请日:1993-08-18

    CPC分类号: G10L15/187

    摘要: Apparatus and method for recording data in a speech recognition system and recognizing spoken data corresponding to the recorded data. The apparatus and method responds to entered data by generating a string of phonetic transcriptions from the entered data. The data and generated phonetic transcription string associated therewith is recorded in a vocabulary lexicon of the speech recognition system. The apparatus and method responds to receipt of spoken data by constructing a model of subwords characteristic of the spoken data and compares the constructed subword model with ones of the recorded lexicon vocabulary recorded phonetic transcription strings to recognize the spoken data as the data identified by and associated with a phonetic transcription string matching the constructed subword string.

    摘要翻译: 用于在语音识别系统中记录数据并识别对应于记录数据的口语数据的装置和方法。 该装置和方法通过从输入的数据生成一串语音转录来响应输入的数据。 与其相关联的数据和产生的语音转录串被记录在语音识别系统的词汇词典中。 该装置和方法通过构建口语数据特征的子词的模型来响应口头数据的接收,并将构建的子词模型与所记录的词典词汇记录的语音转录字符串进行比较,以将口语数据识别为由相关联的 具有与构造的子字符串匹配的语音转录字符串。

    Endpoint detector
    7.
    发明授权
    Endpoint detector 失效
    端点检测器

    公开(公告)号:US4821325A

    公开(公告)日:1989-04-11

    申请号:US669654

    申请日:1984-11-08

    IPC分类号: G10L11/02 G10L5/00

    CPC分类号: G10L25/87

    摘要: An arrangement for endpoint detection improves speech recognition accuracy where the input signal includes nonstationary noise. Energy pulses are found by looking for local energy level peaks, then analyzing surrounding energy levels to determine pulse boundaries. Energy pulses are combined according to predetermined criteria to form longer pulses corresponding to words or phrases in the input signal.

    摘要翻译: 用于端点检测的布置改善了语音识别精度,其中输入信号包括非平稳噪声。 通过寻找局部能级峰值,然后分析周围的能级以确定脉冲边界,找到能量脉冲。 根据预定标准组合能量脉冲以形成对应于输入信号中的单词或短语的较长脉冲。

    Word recognizer
    8.
    发明授权
    Word recognizer 失效
    字识别器

    公开(公告)号:US4400828A

    公开(公告)日:1983-08-23

    申请号:US248547

    申请日:1981-03-27

    CPC分类号: G10L15/12 G10L15/00

    摘要: An input word is recognized as one of a set of reference words. A set of word distance signals representative of the correspondence of the input word to the reference words is generated. A set of weighted word distance signals is also generated. Responsive to the word distance signals and the weighted word distance signals, the reference word that most closely corresponds to the input word is selected.

    摘要翻译: 输入字被识别为一组参考词之一。 产生表示输入字与参考字的对应关系的一组字距离信号。 还产生一组加权字距离信号。 响应于字距离信号和加权字距离信号,选择最接近输入字的参考字。