-
11.
公开(公告)号:US20220084502A1
公开(公告)日:2022-03-17
申请号:US17205624
申请日:2021-03-18
Inventor: Mingming Ma , Tianshu Hu , Zhibin Hong
Abstract: The present application discloses a method and apparatus for determining the shape of the lips of a virtual character, a device and a computer storage medium, and relates to an artificial intelligence technology, and particularly to computer vision and deep learning technologies. An implementation includes: determining a phoneme sequence corresponding to a voice, the phoneme sequence including a phoneme corresponding to each time point; determining lip-shape key point information corresponding to each phoneme in the phoneme sequence; searching a pre-established lip shape library according to each piece of determined lip-shape key point information, so as to obtain a lip shape image of each phoneme; and corresponding the searched lip shape image of each phoneme with each time point to obtain a lip-shape image sequence corresponding to the voice. With the present application, the voice may be synchronized with the shapes of the lips in the images.
-
公开(公告)号:US20210312685A1
公开(公告)日:2021-10-07
申请号:US17348452
申请日:2021-06-15
Inventor: Hanqi Guo , Tianshu Hu , Mingming Ma , Zhibin Hong
Abstract: A method for synthesizing a figure of a virtual object includes: obtaining a figure image of the virtual object, and original face images corresponding to a speech segment; extracting a first face key point of the face of the virtual object, and a second face key point of each of the original face images; processing the first face key point to generate a position and posture information of a first three-dimensional 3D face; processing each second face key point to generate vertex information of a second 3D face; generating a target face image corresponding to each original face image based on the position and the posture information of the first 3D face and the vertex information of each second 3D face; and synthesizing a speaking figure segment of the virtual object, corresponding to the speech segment, based on the figure image of the virtual object and each target face image.
-
公开(公告)号:US11074437B2
公开(公告)日:2021-07-27
申请号:US15930714
申请日:2020-05-13
Inventor: Shihu Li , Xiangda Yan , Yuanzhang Chang , Zhibin Hong , Tianshu Hu , Kun Yao , Junyu Han , Jingtuo Liu , Shengxian Zhu
Abstract: A method, an electronic device and a storage medium for expression driving are disclosed. The method may include: performing facial key point detection on a driven character in a first image to obtain a first facial key point sequence; performing the following processing for each second image of a plurality of second images obtained successively: performing facial key point detection on a driving character in the second image to obtain a second facial key point sequence; obtaining a difference between the second facial key point sequence and an expressionless key point sequence which has been determined previously according to an analysis on the second facial key point sequence for a previous second image, and performing expression drive rendering on the driven character based on the difference and the first facial key point sequence. The technical solution may enhance flexibility, interactivity, accuracy etc.
-
公开(公告)号:US20210201886A1
公开(公告)日:2021-07-01
申请号:US17204167
申请日:2021-03-17
Inventor: Tonghui Li , Tianshu Hu , Mingming Ma , Zhibin Hong
Abstract: This application discloses a method and a device for dialogue with a virtual object, a client end and a storage medium. A specific implementation scheme of the method applied to the client end includes: converting a first voice collected by the client end into a first text content, in a case that the client end is in an offline mode; acquiring a second text content responding to the first text content based on offline natural language processing (NLP) and/or a target database pre-stored by the client end; performing voice synthesis on the second text content to acquire a second voice; simulating a lip shape of the second voice by using the virtual object to acquire a target video in which the virtual object says the second voice; and playing the target video.
-
-
-