-
公开(公告)号:US11645801B2
公开(公告)日:2023-05-09
申请号:US17348452
申请日:2021-06-15
Inventor: Hanqi Guo , Tianshu Hu , Mingming Ma , Zhibin Hong
CPC classification number: G06T13/40 , G06T7/73 , G06T15/04 , G06T17/20 , G06V40/171 , G06T2207/20221 , G06T2207/30201
Abstract: A method for synthesizing a figure of a virtual object includes: obtaining a figure image of the virtual object, and original face images corresponding to a speech segment; extracting a first face key point of the face of the virtual object, and a second face key point of each of the original face images; processing the first face key point to generate a position and posture information of a first three-dimensional 3D face; processing each second face key point to generate vertex information of a second 3D face; generating a target face image corresponding to each original face image based on the position and the posture information of the first 3D face and the vertex information of each second 3D face; and synthesizing a speaking figure segment of the virtual object, corresponding to the speech segment, based on the figure image of the virtual object and each target face image.
-
2.
公开(公告)号:US20210209732A1
公开(公告)日:2021-07-08
申请号:US17208651
申请日:2021-03-22
Inventor: Tianshu Hu , Jiaming Liu , Zhibin Hong
Abstract: The present application discloses a face super-resolution realization method and apparatus, an electronic device and a storage medium, and relate to fields of face image processing and deep learning. The specific implementation solution is as follows: a face part in a first image is extracted; the face part is input into a pre-trained face super-resolution model to obtain a super-sharp face image; a semantic segmentation image corresponding to the super-sharp face image is acquired; and the face part in the first image is replaced with the super-sharp face image, by utilizing the semantic segmentation image, to obtain a face super-resolution image.
-
3.
公开(公告)号:US11710215B2
公开(公告)日:2023-07-25
申请号:US17208651
申请日:2021-03-22
Inventor: Tianshu Hu , Jiaming Liu , Zhibin Hong
CPC classification number: G06T3/4046 , G06N3/08 , G06T3/4053 , G06T7/174 , G06V10/26 , G06V40/165 , G06V40/171 , G06T2207/20081 , G06T2207/20084 , G06T2207/30201
Abstract: The present application discloses a face super-resolution realization method and apparatus, an electronic device and a storage medium, and relate to fields of face image processing and deep learning. The specific implementation solution is as follows: a face part in a first image is extracted; the face part is input into a pre-trained face super-resolution model to obtain a super-sharp face image; a semantic segmentation image corresponding to the super-sharp face image is acquired; and the face part in the first image is replaced with the super-sharp face image, by utilizing the semantic segmentation image, to obtain a face super-resolution image.
-
公开(公告)号:US20210232932A1
公开(公告)日:2021-07-29
申请号:US17201681
申请日:2021-03-15
Inventor: Jiaming Liu , Tianshu Hu , Shengyi He , Zhibin Hong
Abstract: A method and an apparatus for generating an image, a device and a medium are provided The method includes: acquiring a first random vector set; determining an image category to which at least one random vector in the first random vector set belongs, based on a trained classifier; and inputting a random vector belonging to the image category into a trained image generator to generate a virtual image belonging to the image category. Through the technical solution in the present disclosure, a large number of images that are definitely classified can be automatically generated.
-
公开(公告)号:US11670029B2
公开(公告)日:2023-06-06
申请号:US17191444
申请日:2021-03-03
Inventor: Tianshu Hu , Mingming Ma , Shengyi He , Hanqi Guo , Tonghui Li , Zhibin Hong
CPC classification number: G06T13/40 , G06N20/00 , G06V40/168 , G06V40/176
Abstract: A method and apparatus for processing character image data are provided. A specific embodiment of the method can include: acquiring a video including character images of a first character as first data, and acquiring a video including character images of a second character as second data, the video including character images of the first character including a plurality of types of movement-state images; replacing a head image extracted from the first data with a head image extracted from the second data; replacing a face extracted from the first data with a face extracted from the second data; and replacing the face in the replaced head image with the replaced face to obtain a target head image.
-
公开(公告)号:US11423907B2
公开(公告)日:2022-08-23
申请号:US17204771
申请日:2021-03-17
Inventor: Tianshu Hu , Mingming Ma , Tonghui Li , Zhibin Hong
Abstract: The application provides a virtual object image display method and apparatus, an electronic device and a storage medium, relates to the field of artificial intelligence, in particular to the field of computer vision and deep learning, and may be applied to virtual object dialogue scenarios. The specific implementation scheme includes: segmenting acquired voice to obtain voice segments; predicting lip shape sequence information for the voice segments; searching for a corresponding lip shape image sequence based on the lip shape sequence information; performing lip fusion between the lip shape image sequence and a virtual object baseplate to obtain a virtual object image; displaying the virtual object image. The application improves ability to obtain virtual object image.
-
公开(公告)号:US20220084502A1
公开(公告)日:2022-03-17
申请号:US17205624
申请日:2021-03-18
Inventor: Mingming Ma , Tianshu Hu , Zhibin Hong
Abstract: The present application discloses a method and apparatus for determining the shape of the lips of a virtual character, a device and a computer storage medium, and relates to an artificial intelligence technology, and particularly to computer vision and deep learning technologies. An implementation includes: determining a phoneme sequence corresponding to a voice, the phoneme sequence including a phoneme corresponding to each time point; determining lip-shape key point information corresponding to each phoneme in the phoneme sequence; searching a pre-established lip shape library according to each piece of determined lip-shape key point information, so as to obtain a lip shape image of each phoneme; and corresponding the searched lip shape image of each phoneme with each time point to obtain a lip-shape image sequence corresponding to the voice. With the present application, the voice may be synchronized with the shapes of the lips in the images.
-
公开(公告)号:US20210312685A1
公开(公告)日:2021-10-07
申请号:US17348452
申请日:2021-06-15
Inventor: Hanqi Guo , Tianshu Hu , Mingming Ma , Zhibin Hong
Abstract: A method for synthesizing a figure of a virtual object includes: obtaining a figure image of the virtual object, and original face images corresponding to a speech segment; extracting a first face key point of the face of the virtual object, and a second face key point of each of the original face images; processing the first face key point to generate a position and posture information of a first three-dimensional 3D face; processing each second face key point to generate vertex information of a second 3D face; generating a target face image corresponding to each original face image based on the position and the posture information of the first 3D face and the vertex information of each second 3D face; and synthesizing a speaking figure segment of the virtual object, corresponding to the speech segment, based on the figure image of the virtual object and each target face image.
-
公开(公告)号:US11074437B2
公开(公告)日:2021-07-27
申请号:US15930714
申请日:2020-05-13
Inventor: Shihu Li , Xiangda Yan , Yuanzhang Chang , Zhibin Hong , Tianshu Hu , Kun Yao , Junyu Han , Jingtuo Liu , Shengxian Zhu
Abstract: A method, an electronic device and a storage medium for expression driving are disclosed. The method may include: performing facial key point detection on a driven character in a first image to obtain a first facial key point sequence; performing the following processing for each second image of a plurality of second images obtained successively: performing facial key point detection on a driving character in the second image to obtain a second facial key point sequence; obtaining a difference between the second facial key point sequence and an expressionless key point sequence which has been determined previously according to an analysis on the second facial key point sequence for a previous second image, and performing expression drive rendering on the driven character based on the difference and the first facial key point sequence. The technical solution may enhance flexibility, interactivity, accuracy etc.
-
公开(公告)号:US20210201886A1
公开(公告)日:2021-07-01
申请号:US17204167
申请日:2021-03-17
Inventor: Tonghui Li , Tianshu Hu , Mingming Ma , Zhibin Hong
Abstract: This application discloses a method and a device for dialogue with a virtual object, a client end and a storage medium. A specific implementation scheme of the method applied to the client end includes: converting a first voice collected by the client end into a first text content, in a case that the client end is in an offline mode; acquiring a second text content responding to the first text content based on offline natural language processing (NLP) and/or a target database pre-stored by the client end; performing voice synthesis on the second text content to acquire a second voice; simulating a lip shape of the second voice by using the virtual object to acquire a target video in which the virtual object says the second voice; and playing the target video.
-
-
-
-
-
-
-
-
-