摘要:
During a training sequence, a speaker-trained speech recognizer detects and signals the speaker when vocabulary word pairs are potentially confusing to the recognizer. Each vocabulary word is converted into feature signals and then parameters representing a predetermined reference model of that word. The feature signals of a subsequent potential vocabulary word are compared against the reference model of each vocabulary word previously stored in the recognizer memory. The speaker is signaled when the potential vocabulary word is confusingly similar to one of the existing vocabulary words.