Invention Grant
US06366885B1 Speech driven lip synthesis using viseme based hidden markov models
有权
使用基于Viseme的隐马尔可夫模型的语音驱动唇形合成
- Patent Title: Speech driven lip synthesis using viseme based hidden markov models
- Patent Title (中): 使用基于Viseme的隐马尔可夫模型的语音驱动唇形合成
-
Application No.: US09384763Application Date: 1999-08-27
-
Publication No.: US06366885B1Publication Date: 2002-04-02
- Inventor: Sankar Basu , Tanveer Atzal Faruquie , Chalapathy V. Neti , Nitendra Rajput , Andrew William Senior , L. Venkata Subramaniam , Ashish Verma
- Applicant: Sankar Basu , Tanveer Atzal Faruquie , Chalapathy V. Neti , Nitendra Rajput , Andrew William Senior , L. Venkata Subramaniam , Ashish Verma
- Main IPC: G10L2106
- IPC: G10L2106

Abstract:
A method of speech driven lip synthesis which applies viseme based training models to units of visual speech. The audio data is grouped into a smaller number of visually distinct visemes rather than the larger number of phonemes. These visemes then form the basis for a Hidden Markov Model (HMM) state sequence or the output nodes of a neural network. During the training phase, audio and visual features are extracted from input speech, which is then aligned according to the apparent viseme sequence with the corresponding audio features being used to calculate the HMM state output probabilities or the output of the neutral network. During the synthesis phase, the acoustic input is aligned with the most likely viseme HMM sequence (in the case of an HMM based model) or with the nodes of the network (in the case of a neural network based system), which is then used for animation.
Information query