Text-to-speech using clustered context-dependent phoneme-based units
    51.
    发明授权
    Text-to-speech using clustered context-dependent phoneme-based units 失效
    使用基于上下文的基于音素的单元的文本到语音

    公开(公告)号:US6163769A

    公开(公告)日:2000-12-19

    申请号:US949138

    申请日:1997-10-02

    IPC分类号: G10L13/06 G10L13/00

    CPC分类号: G10L13/07

    摘要: A text-to-speech system includes a storage device for storing a clustered set of context-dependent phoneme-based units of a target speaker. In one embodiment, decision trees are used wherein each decision tree based context-dependent phoneme-based unit is arranged based on context of at least one immediately preceding and succeeding phoneme. At least one of the context-dependent phoneme-based units represents other non-stored context-dependent phoneme units of similar sound due to similar contexts. A text analyzer obtains a string of phonetic symbols representative of text to be converted to speech. A concatenation module selects stored decision tree based context-dependent phoneme-based units from the set decision tree based context-dependent phoneme-based units based on the context of the phonetic symbols and synthesizes the selected phoneme-based units to generate speech corresponding to the text.

    摘要翻译: 文本到语音系统包括用于存储目标说话者的基于上下文的基于音素的单元的聚集集合的存储设备。 在一个实施例中,使用决策树,其中基于上下文的基于音素的单元的每个基于决策树的单元基于至少一个紧接在前和后面的音素的上下文来排列。 基于上下文的基于音素的单元中的至少一个单元表示由于类似的上下文而具有类似声音的其他未存储的上下文相关音素单元。 文本分析器获得代表要转换为语音的文本的语音符号串。 级联模块基于语音符号的上下文从基于上下文的基于音素的单元中选择存储的基于决策树的基于上下文的基于音素的基于单元的基于上下文的基于音素的单元,并且合成所选择的基于音素的单元以产生对应于 文本。

    Method and apparatus for automatically invoking a new word module for
unrecognized user input
    52.
    发明授权
    Method and apparatus for automatically invoking a new word module for unrecognized user input 失效
    用于自动调用新的单词模块以供无法识别的用户输入的方法和装置

    公开(公告)号:US5852801A

    公开(公告)日:1998-12-22

    申请号:US538919

    申请日:1995-10-04

    IPC分类号: G10L15/18 G10L15/22 G01L5/06

    摘要: A method for reducing recognition errors in a speech recognition system that has a user interface, which instructs the user to invoke a new word acquisition module upon a predetermined condition, and that improves the recognition accuracy for poorly recognized words. The user interface of the present invention suggests to a user which unrecognized words may be new words that should be added to the recognition program lexicon. The user interface advises the user to enter words into a new word lexicon that fails to present themselves in an alternative word list for two consecutive tries. A method to improve the recognition accuracy for poorly recognized words via language model adaptation is also provided by the present invention. The present invention increases the unigram probability of an unrecognized word in proportion to the score difference between the unrecognized word and the top one word to guarantee recognition of the same word in a subsequent try. In the event that the score of unrecognized word is unknown (i.e., not in the alternative word list), the present invention increases the unigram probability of the unrecognized word in proportion to the difference between the top one word score and the smallest score in the alternative list.

    摘要翻译: 一种用于减少具有用户界面的语音识别系统中的识别错误的方法,所述用户界面指示用户在预定条件下调用新的单词获取模块,并且提高了对于较差识别字词的识别精度。 本发明的用户界面向用户建议未被识别的单词可以是应被添加到识别程序词典的新单词。 用户界面建议用户将单词输入到一个新的单词词典中,这个单词词典不能在两个连续的尝试中呈现出一个替代单词列表。 通过本发明也提供了通过语言模型适应来提高对于识别不良的词的识别精度的方法。 本发明增加与未被识别的单词和前一个单词之间的分数差成比例的未被识别的单词的单字概率,以保证在随后的尝试中识别相同的单词。 在无法识别的词的得分未知(即,不在替代词表中)的情况下,本发明将不识别词的单词概率与第一个单词得分和最小分数之间的差成比例增加 替代清单