摘要:
A text-to-speech system includes a storage device for storing a clustered set of context-dependent phoneme-based units of a target speaker. In one embodiment, decision trees are used wherein each decision tree based context-dependent phoneme-based unit is arranged based on context of at least one immediately preceding and succeeding phoneme. At least one of the context-dependent phoneme-based units represents other non-stored context-dependent phoneme units of similar sound due to similar contexts. A text analyzer obtains a string of phonetic symbols representative of text to be converted to speech. A concatenation module selects stored decision tree based context-dependent phoneme-based units from the set decision tree based context-dependent phoneme-based units based on the context of the phonetic symbols and synthesizes the selected phoneme-based units to generate speech corresponding to the text.
摘要:
A method for reducing recognition errors in a speech recognition system that has a user interface, which instructs the user to invoke a new word acquisition module upon a predetermined condition, and that improves the recognition accuracy for poorly recognized words. The user interface of the present invention suggests to a user which unrecognized words may be new words that should be added to the recognition program lexicon. The user interface advises the user to enter words into a new word lexicon that fails to present themselves in an alternative word list for two consecutive tries. A method to improve the recognition accuracy for poorly recognized words via language model adaptation is also provided by the present invention. The present invention increases the unigram probability of an unrecognized word in proportion to the score difference between the unrecognized word and the top one word to guarantee recognition of the same word in a subsequent try. In the event that the score of unrecognized word is unknown (i.e., not in the alternative word list), the present invention increases the unigram probability of the unrecognized word in proportion to the difference between the top one word score and the smallest score in the alternative list.