-
公开(公告)号:US20190198022A1
公开(公告)日:2019-06-27
申请号:US15850814
申请日:2017-12-21
Applicant: MOTOROLA SOLUTIONS, INC.
Inventor: Stephen VARNER , Wei LIN , Randy L. EKL , Daniel A. LAW
CPC classification number: G10L15/25 , G06K9/00335 , G06K9/00711 , G10L15/26 , G10L25/57 , G10L25/60
Abstract: A device and method for generating text representative of lip movement is provided. One or more portions of video data are determined that include: audio with an intelligibility rating below a threshold intelligibility rating; and lips of a human face. A lip-reading algorithm is applied to the one or more portions of the video data to determine text representative of detected lip movement in the one or more portions of the video data. The text representative of the detected lip movement is stored in a memory. A transcript that includes the text representative of the detected lip movement may be generated. Captioned video data may be generated from the video data and the text representative of detected lip movement.